Guang Wang

9 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×9Crypto×5ML×4NLP×1Society×1

Frequent co-authors

Shuai Wang2×

Zhun Wang2×

Research Timeline

2026

A Framework for Formalizing LLM Agent Security

The paper introduces a contextual security framework for LLM agents, defining security properties and reformulating various attacks and defenses based on the context of execution.

Beyond Content Safety: Real-Time Monitoring for Reasoning Vulnerabilities in Large Language Models

This paper introduces a novel framework, the Reasoning Safety Monitor, to detect and prevent logical inconsistencies and adversarial manipulations within the internal reasoning steps of large language models, establishing reasoning safety as a critical security dimension.

Knowledge Poisoning Attacks on Medical Multi-Modal Retrieval-Augmented Generation

The paper proposes M extsuperscript{3}Att, a knowledge-poisoning framework that injects covert misinformation into medical multimodal RAG systems using paired visual data triggers, demonstrating attacks that generate clinically plausible but incorrect diagnoses.

EnergyMamba: An Uncertainty-Aware Graph-Enhanced Selective State Space Model for Energy Consumption Prediction

EnergyMamba proposes an uncertainty-aware, graph-enhanced selective state space model to significantly improve both the accuracy and reliability of energy consumption prediction by explicitly modeling spatial dependencies.

MindClaw: Closed-Loop Embodied Mental-State Reasoning for Precision Intervention

The paper introduces MindClaw, a closed-loop framework that enables embodied agents to perform real-time mental-state reasoning and intervene with precision, significantly outperforming standard VLM baselines.

E4GEN: Event-level Explainable Extreme-Enhanced Time-series Generation

E4GEN introduces an explainable diffusion framework that significantly improves time-series generation by specifically focusing on and controlling the fidelity of extreme events.

CyberGym-E2E: Scalable Real-World Benchmark for AI Agents' End-to-End Cybersecurity Capabilities

The paper introduces CyberGym-E2E, a large-scale, end-to-end benchmark designed to comprehensively evaluate AI agents' capabilities across the entire lifecycle of real-world software vulnerability discovery, proof-of-concept generation, and patch creation.

From Shield to Target: Denial-of-Service Attacks on LLM-Based Agent Guardrails

This paper reveals a denial-of-service vulnerability in LLM-based guardrails for autonomous agents and proposes two attack frameworks.

Cognitive Episodes in LLM Reasoning Traces Enable Interpretable Human Item Difficulty Prediction

The paper introduces Epi2Diff, a framework that maps Large Reasoning Models reasoning traces into cognitively grounded episode sequences for human item difficulty prediction, outperforming strong baselines.

Highlighted terms show continued research focus across papers

Papers

cs.CLcs.AIcs.CYEmpiricalRecentJun 26, 2026

Cognitive Episodes in LLM Reasoning Traces Enable Interpretable Human Item Difficulty Prediction

Chenguang Wang, Ming Li, Xinyue Zeng, Zhuochun Li +3 more

View →

cs.CRcs.AIEmpirical