Qing Wang

15 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

NLP×9AI×7Crypto×4Signal Processing×1ML×1Biomolecules×1Multiagent×1Networking×1

Frequent co-authors

Yuqing Wang2×

Jialu Liang2×

Qianqian Song2×

Raul Chavez-Santiago1×

Kamran Sayrafian-Pour1×

Ali Khaleghi1×

Research Timeline

2026

Look One Step Ahead: Forward-Looking Incentive Design with Strategic Privacy for Proactive Service Provisioning over Air-Ground Integrated Edge Networks

The paper proposes Look One Step Ahead (LOSA), a novel framework that enables efficient, privacy-preserving, and robust service provisioning in dynamic air-ground integrated networks by decoupling planning into a look-ahead phase and a real-time execution phase.

Safety Anchor: Defending Harmful Fine-tuning via Geometric Bottlenecks

The paper introduces Safety Bottleneck Regularization (SBR), a novel defense mechanism that anchors LLM safety by constraining the unembedding layer, effectively preventing harmful fine-tuning (HFT) even when other defenses fail.

OrchJail: Jailbreaking Tool-Calling Text-to-Image Agents by Orchestration-Guided Fuzzing

OrchJail introduces an orchestration-guided fuzzing framework to systematically jailbreak tool-calling text-to-image agents by exploiting unsafe multi-step tool-orchestration patterns.

Reflect-Guard: Enhancing LLM Safeguards against Adversarial Prompts via Logical Self-Reflection

Reflect-Guard enhances LLM safety classifiers by integrating logical self-reflection, significantly improving detection of sophisticated adversarial jailbreak prompts.

Modeling Vehicle-Type-Specific Pedestrian Crash Avoidance Behavior in Safety-Critical Interactions Using Smooth-Mamba Deep Reinforcement Learning

The paper develops a novel deep reinforcement learning framework, SMamba-DDPG, to accurately model vehicle-type-specific pedestrian crash avoidance behavior, finding that pedestrians react faster and more cautiously to automated vehicles (AVs) than to human-driven vehicles (HDVs).

What Gets Unmasked First? Trajectory Analysis of Diffusion Models for Graph-to-Text Generation

This paper analyzes the decoding process of masked diffusion models for graph-to-text generation, finding that structural fine-tuning disrupts natural entity-first generation and proposing a structural decoding method to fix it.

MemPro: Agentic Memory Systems as Evolvable Programs

MemPro introduces a system-level evolution framework that treats the entire memory construction-retrieval pipeline as an evolvable program, significantly improving long-horizon agent performance over fixed-pipeline baselines.

Probe Before You Edit: Probing-Guided Molecular Optimization for LLM Agents in Structure-Based Drug Design

The paper introduces PROBE, an optimization framework that guides LLM agents in structure-based drug design by performing controlled 'probe edits' to assess how molecular changes affect both binding affinity and druggability simultaneously.

DrugClaw and DrugAudit: A Primary-Source-Grounded Agent and Authority-Aware Benchmark for Drug-Information Question Answering

The paper introduces DrugClaw, a multi-agent system, and DrugAudit, a new benchmark, demonstrating that DrugClaw excels at answering drug-related questions by grounding answers in primary regulatory sources.

UniD$^3$: A Knowledge Graph-Enhanced RAG Framework for Drug-Disease Discovery and Reasoning

UniD$^3$ is a novel Knowledge Graph-enhanced RAG framework that processes vast biomedical literature to systematically extract, organize, and validate comprehensive drug-disease knowledge, achieving high accuracy in structured data generation.

Trust Region On-Policy Distillation

The paper introduces Trust Region On-Policy Distillation (TrOPD), a robust method that stabilizes the on-policy distillation of large language models by restricting training to regions where teacher supervision is reliable.

Beyond Isolated Behaviors: Hierarchical User Modeling for LLM Personalization

The paper proposes a hierarchical framework, PHF (Practice-Habitus-Field), inspired by Bourdieu's Theory of Practice, to improve LLM personalization by modeling user behaviors at three distinct levels.

CRAB-Bench: Evaluating LLM Agents under Complex Task Dependencies and Human-aligned User Simulation

The paper introduces CRAB-Bench and RUSE, a rigorous evaluation framework that tests LLM agents on complex, interdependent tasks with realistic human user interactions, revealing significant performance gaps in current models.

UniClawBench: A Universal Benchmark for Proactive Agents on Real-World Tasks

The paper introduces UniClawBench, a capability-driven benchmark for evaluating proactive agents in real-world settings, using five foundational capabilities and live Docker containers.

Propagation models for IEEE 802.15.6 standardization of implant communication in body area networks

This paper outlines research on obtaining accurate propagation models for implant communication in Body Area Networks (BANs) using computer simulations, and presents current research on enhancing channel models with ultra wideband signals.

Highlighted terms show continued research focus across papers

Papers

eess.SPEmpiricalRecentJul 24, 2026

Propagation models for IEEE 802.15.6 standardization of implant communication in body area networks

Raul Chavez-Santiago, Kamran Sayrafian-Pour, Ali Khaleghi, Kenichi Takizawa +3 more

View →

cs.CLEmpiricalRecent