Qi Wang

13 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×8Crypto×5NLP×4ML×3Vision×2Info Retrieval×1Systems and Control×1

Frequent co-authors

Jiaqi Wang2×

Luoyu Chen2×

Weiqi Wang2×

Zhiyi Tian2×

Feng Wu2×

Ahmed Asiri2×

Research Timeline

2026

Stable Agentic Control: Tool-Mediated LLM Architecture for Autonomous Cyber Defense

The paper proposes a tool-mediated LLM architecture for autonomous cyber defense, formally proving its stability and demonstrating that it significantly reduces an attacker's expected payoff in real-world attack graph simulations.

PhishSigma++: Malicious Email Detection with Typed Entity Relations

PhishSigma++ is a novel entity-relation-based detector that improves malicious email detection by focusing on invariant functional relationships between typed entities, significantly outperforming text-centric models under adversarial manipulation.

Ellipsoid Control: A White-list Jailbreak Defense via Benign Latent Modeling

The paper proposes Ellipsoid Control, a white-list defense mechanism that uses benign data geometry to constrain model updates, thereby enhancing jailbreak safety while preserving the utility of harmless inputs.

Steering Beyond the Support: Adversarial Training on Unsupervised Jailbroken Activation Simulation

The paper proposes an unsupervised bi-level adversarial training framework to enhance LLM safety steering, achieving strong zero-shot defense against unseen and evolving jailbreak prompts.

RAISE: RAG Design as an Architecture Search Problem

The paper proposes formulating RAG design as an architecture search problem and introduces RAISE, a comprehensive framework and benchmark for systematically optimizing RAG hyperparameters.

Semantic Triplet Restoration: A Novel Protocol for Hierarchical Table Understanding in Large Language Models

The paper introduces Semantic Triplet Restoration (STR), a novel protocol that converts complex table structures into atomic semantic triplets, improving table question answering by providing explicit semantic context and reducing reliance on layout-dependent serializations.

A Visually Impaired Assistance Benchmark for VLM-as-a-Judge Evaluation

The paper introduces VIABLE, the first benchmark for evaluating Vision-Language Models (VLMs) as judges for Visually Impaired Assistance (VIA), finding that current models are largely unreliable and proposing VIA-Judge-Agent to improve evaluation.

RLVR without Ineffective Samples: Group Prioritized Off-Policy Optimization for LLM Reasoning

The paper introduces Group Prioritized Off-Policy Optimization (POPO), a novel framework that efficiently accelerates RL finetuning for LLM reasoning by leveraging effective off-policy training batches without requiring costly additional data rollouts.

Brain-Atlas-Guided Generative Counterfactual Attention for Explainable Cognitive Decline Diagnosis Using Multimodal Connectomes

The paper proposes a novel Generative Counterfactual Attention-guided Network (GCAN) that uses multimodal connectomes and brain atlas knowledge to provide explainable and highly accurate diagnosis of cognitive decline.

Policy and World Modeling Co-Training for Language Agents

The paper proposes PaW, a co-training framework that uses standard RL rollouts to provide auxiliary world model supervision directly during policy training, significantly improving language agent performance.

AdaCodec: A Predictive Visual Code for Video MLLMs

AdaCodec introduces a predictive visual coding scheme for video MLLMs, significantly improving efficiency and performance by transmitting only inter-frame changes and full reference frames when necessary.

From Agent Traces to Trust: Evidence Tracing and Execution Provenance in LLM Agents

This survey provides a systematic framework and taxonomy for evidence tracing and execution provenance in LLM agents, addressing the difficulty of verifying and auditing complex agent behaviors.

OneReason Technical Report

The paper proposes OneReason, a framework that enhances the reasoning capability of generative recommendation models by focusing on improving item perception and structuring user behavior into coherent latent interests.

Highlighted terms show continued research focus across papers

Papers

cs.IRcs.AIcs.CLRecentJun 4, 2026

OneReason Technical Report

OneRec Team, Biao Yang, Boyang Ding, Chenglong Chu +80 more

View →

cs.CRcs.AIRecentJun 3, 2026