Jun Li

22 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×15Crypto×12Vision×4NLP×3Software Eng.×2ML×1Info Theory×1Signal Processing×1

Frequent co-authors

Lijun Li3×

Haoxuan Qu3×

Hossein Rahmani3×

Jun Liu3×

Pengyu Zhu2×

Jing Shao2×

Research Timeline

2026

Unveiling the Resilience of LLM-Enhanced Search Engines against Black-Hat SEO Manipulation

This paper systematically analyzes the resilience of LLM-enhanced search engines against black-hat SEO attacks, finding that while they block most traditional attacks, they remain vulnerable to sophisticated LLM-generated query manipulations.

Scaling Exposes the Trigger: Input-Level Backdoor Detection in Text-to-Image Diffusion Models via Cross-Attention Scaling

The paper introduces SET, a robust input-level backdoor detection framework that detects hidden malicious triggers in text-to-image diffusion models by analyzing systematic differences in how benign and backdoor inputs respond to controlled cross-attention scaling perturbations.

Beyond Text Prompts: Precise Concept Erasure through Text-Image Collaboration

The paper introduces TICoE, a text-image collaborative framework that achieves precise and faithful concept removal from text-to-image generative models, surpassing existing methods in both precision and content fidelity.

PRAG: End-to-End Privacy-Preserving Retrieval-Augmented Generation

PRAG is an end-to-end privacy-preserving Retrieval-Augmented Generation (RAG) system that maintains high retrieval accuracy and scalability in cloud environments by encrypting both documents and queries.

APIOT: Autonomous Vulnerability Management Across Bare-Metal Industrial OT Networks

The paper introduces APIOT, the first LLM framework capable of autonomously performing the full discovery, exploitation, patching, and verification cycle against bare-metal industrial OT devices.

SecureForge: Finding and Preventing Vulnerabilities in LLM-Generated Code via Prompt Optimization

SecureForge is an automated pipeline that significantly reduces cybersecurity vulnerabilities in LLM-generated code by optimizing system prompts, achieving up to a 48% reduction in output vulnerabilities.

Knowledge Poisoning Attacks on Medical Multi-Modal Retrieval-Augmented Generation

The paper proposes M extsuperscript{3}Att, a knowledge-poisoning framework that injects covert misinformation into medical multimodal RAG systems using paired visual data triggers, demonstrating attacks that generate clinically plausible but incorrect diagnoses.

New Wide-Net-Casting Jailbreak Attacks Risk Large Models

This paper introduces the 'wide-net-casting' jailbreak scenario, demonstrating that querying a group of large language models can expose significant, previously overlooked safety risks, with a novel method achieving 100% jailbreak success in some tests.

Inverting the Shield: Systematically Generating Safety Tests from Policy Specifications

The paper introduces POLARIS, a novel framework that systematically generates comprehensive and verifiable safety tests for LLMs by formalizing natural language policies into First-Order Logic and exploring the resulting Semantic Policy Graph.

You Live More Than Once: Towards Hierarchical Skill Meta-Evolving

The paper proposes HiSME, a lightweight hierarchical skill meta-evolving solution that jointly optimizes skills and the skill evolving strategy by learning meta-skills from task execution traces, leading to improved agent performance.

A Unified Framework for the Evaluation of LLM Agentic Capabilities

The paper introduces a unified framework to fairly evaluate LLM agentic capabilities by standardizing diverse benchmarks and separating the effects of the LLM model from the surrounding framework and environment.

SmartDirector: Keyframe-Conditioned Cinematic Video Generation with Narrative Pacing Control

SmartDirector is a novel framework that significantly improves cinematic video generation by using multiple keyframes to provide precise control over narrative structure and temporal pacing.

AgentSchool: An LLM-Powered Multi-Agent Simulation for Education

The paper introduces AgentSchool, an advanced LLM-powered multi-agent simulator that models learning as state transitions to provide a robust, ethically viable testbed for educational research and pedagogical reform.

How Coding Agents Fail Their Users: A Large-Scale Analysis of Developer-Agent Misalignment in 20,574 Real-World Sessions

This study analyzes over 20,000 real-world coding sessions to show that AI coding agents frequently fail users through subtle misalignment, requiring constant manual correction even when major system damage is avoided.

AutoSci: A Memory-Centric Agentic System for the Full Scientific Research Lifecycle

AutoSci is a memory-centric agentic system designed to automate the entire scientific research lifecycle by integrating structured memory, multi-stage execution, and continuous self-improvement.

Practical Cross-Band Channel Prediction for AI-RAN via Physics-Guided Deep Unfolding

The paper proposes GUIDE, a physics-guided deep unfolding framework that enables practical, real-time cross-band channel prediction for AI-RAN by embedding wireless channel physics, significantly improving beamforming gain while maintaining high inference speed.

SafeSteer: Localized On-Policy Distillation for Efficient Safety Alignment

SafeSteer proposes a localized on-policy distillation method that restricts safety alignment to specific safety tokens, thereby achieving strong safety performance with minimal degradation to general capabilities and significantly reducing data requirements.

ToolFG: Towards Well-Grounded Fine-Grained Image Classification

The paper introduces ToolFG, a novel tool-integrated MLLM framework that enhances fine-grained image classification by enabling models to autonomously use external tools to gather verifiable visual cues.

Estimating Mutual Information between Time Series and Temporal Event Sequences Across Diverse Analysis Tasks

The paper proposes a novel nonparametric mutual information estimator to robustly quantify dependence between heterogeneous temporal data, specifically continuous time series and discrete event sequences.

Search-Time Contamination in Deep Research Agents: Measuring Performance Inflation in Public Benchmark Evaluation

The paper introduces the concept of Search-Time Contamination (STC), demonstrating that deep research agents can leak information from public benchmarks via web search, leading to an overestimation of their true reasoning ability.

Highlighted terms show continued research focus across papers

Papers

cs.CRcs.AIRecentJun 3, 2026

Search-Time Contamination in Deep Research Agents: Measuring Performance Inflation in Public Benchmark Evaluation

Yongjie Wang, Xinyue Zhang, Kunhong Yao, Zhiwei Zeng +3 more

View →

cs.AIcs.CLRecentJun 1, 2026