Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Yi Wang

Yi Wang

24 indexed papers

Recent (6 mo)
24
With code
0
Influential cites
0
Benchmarked
0

Publications per year

24
26

Top categories

Crypto×14AI×12ML×10NLP×5Vision×4Software Eng.×2Comp. Eng.×1Robotics×1

Frequent co-authors

Mingyi Wang2×
Hongyi Wang2×
Chenyi Wang2×
Ming F. Li2×
Zhan Qin2×
Haoyu Wang2×

Research Timeline

2026
DPDSyn: Improving Differentially Private Dataset Synthesis for Model Training by Downstream Task Guidance

DPDSyn improves differentially private dataset synthesis by training a differentially private AI model on the original private data, which is then used to generate synthetic datasets that maintain high utility for downstream tasks.

Listen to the Voices of Everyday Users: Democratizing Privacy Ratings for Sensitive Data Access in Mobile Apps

The paper proposes and evaluates DePRa, a system that democratizes privacy assessment by making everyday users active evaluators of mobile app data access, showing its potential to complement expert audits.

Defense against Poisoning Attacks under Shuffle-DP

The paper proposes the first general defense framework to make all union-preserving Differential Privacy (DP) protocols, specifically those based on shuffle-DP, resilient against poisoning attacks.

ClawGuard: Out-of-Band Detection of LLM Agent Workflow Hijacking via EM Side Channel

ClawGuard introduces a passive, out-of-band security monitor that detects LLM agent workflow hijacking by analyzing unique electromagnetic (EM) emanations generated during agent skill execution.

EVA: Editing for Versatile Alignment against Jailbreaks

The paper proposes EVA, a novel framework that uses direct model editing to surgically correct specific neurons responsible for jailbreaking vulnerabilities in LLMs and VLMs, achieving robust safety alignment without performance degradation.

Systematic Discovery of Semantic Attacks in Online Map Construction through Conditional Diffusion

The paper introduces MIRAGE, a framework that systematically discovers semantic attacks on online HD map construction by finding plausible environmental variations that bypass standard adversarial defenses, demonstrating attacks that remove or inject critical road boundaries.

An exponential mechanism based on quadratic approximations for fine-tuning machine learning models with privacy guarantees

The paper proposes a novel exponential mechanism using quadratic approximations to fine-tune machine learning models on sensitive data while providing strong differential privacy guarantees.

Benchmarking Autonomous Agents against Temporal, Spatial, and Semantic Evasions

The paper introduces a multi-dimensional evasion framework and a new benchmark (A3S-Bench) to test autonomous agents, demonstrating that stateful, multi-turn attacks significantly increase system risk.

Adversarial Trust Poisoning in Vehicular Collaborative Perception

The paper introduces TrustFlip, a novel physical adversarial attack that exploits consistency-based trust defenses in vehicular collaborative perception by using genuine objects to induce inconsistencies among benign vehicles, thereby poisoning the trust scores of targeted vehicles.

What Does the Server See? Understanding Privacy Leakage from Large Language Models in Split Inference

The paper introduces ActInv and PAF to systematically analyze and quantify privacy leakage from intermediate activations during split inference of LLMs, proposing PriPert for enhanced defense.

LoRe: Adaptive Interaction-Evaluation Routing with Per-Step Interaction Budgets for Iterative Graph Solvers

LoRe is a training-free wrapper that dynamically budgets interaction evaluation at each step of graph solvers, significantly improving scalability and speed while maintaining solution quality.

Deconstructing Spatial Complexity: Hierarchical Decomposition for LLM Spatial Reasoning

This paper introduces MCTS-Guided Group Relative Policy Optimization (M-GRPO) to enhance LLM spatial reasoning by improving the decomposition of complex tasks into optimal sub-tasks.

PR2: Predictive Routing Replay for MoE-Based LLM Reinforcement Learning

The paper proposes Predictive Routing Replay (PR2) to stabilize reinforcement learning on Mixture of Experts (MoE) LLMs by predicting and incorporating short-horizon router evolution during training and rollout.

Detector-Evasive LLM Paraphrasing via Constrained Policy Optimization

The paper proposes Detector Evasion Policy Optimization (DEPO), a constrained reinforcement learning method that effectively evades AI text detectors while strictly maintaining the original text's semantics.

Benchmarking and Enhancing Text-to-Image Models for Generating Visual Representations in Early Arithmetic Education

The paper introduces a new benchmark, E2V-Bench, to evaluate text-to-image models on generating pedagogically accurate visuals from arithmetic equations, finding that current models often fail due to structural and numerical errors.

Richer Representations for Neural Algorithmic Reasoning via Auxiliary Reconstruction

The paper proposes using an auxiliary reconstruction task, specifically one that captures intra-state feature dependencies, to improve the quality of state representations learned by the encoder in neural algorithmic reasoning.

OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification

OmniOPD introduces a logit-free, chunk-level distillation framework that improves on standard On-Policy Distillation by using semantic similarity and peak-entropy scheduling, achieving state-of-the-art performance even with black-box teachers.

AdaCodec: A Predictive Visual Code for Video MLLMs

AdaCodec introduces a predictive visual coding scheme for video MLLMs, significantly improving efficiency and performance by transmitting only inter-frame changes and full reference frames when necessary.

MOC: Multi-Order Communication in LLM-based Multi-Agent Systems

The paper proposes Multi-Order Communication (MOC) to overcome the limitations of standard first-order message passing in LLM-based multi-agent systems, significantly improving performance by capturing multi-hop dependencies.

Are Economists Open to AI? Text as Data as Survey on Professional Sentiment and Academic Research Trends

The paper introduces TaDaS, a framework that analyzes large-scale text archives to measure professional sentiment, finding that while AI discussion among economists is initially negative, the trend shows increasing openness as AI enters elite academic journals.

Highlighted terms show continued research focus across papers

Papers

cs.CVcs.AIcs.CLRecentJun 1, 2026

AdaCodec: A Predictive Visual Code for Video MLLMs

Haowen Hou, Zhen Huang, Zheming Liang, Qingyi Si +7 more

AdaCodec introduces a predictive visual coding scheme for video MLLMs, significantly improving efficiency and performance by transmitting only inter-frame changes and full reference frames when necess…

View →
cs.AIRecentJun 1, 2026

MOC: Multi-Order Communication in LLM-based Multi-Agent Systems

Yao Guan, Lin Wang, Zhihu Lu, Ziyi Wang +2 more

The paper proposes Multi-Order Communication (MOC) to overcome the limitations of standard first-order message passing in LLM-based multi-agent systems, significantly improving performance by capturin…

View →
cs.CERecentJun 1, 2026

Are Economists Open to AI? Text as Data as Survey on Professional Sentiment and Academic Research Trends

Yi Wang, Lei Ge

The paper introduces TaDaS, a framework that analyzes large-scale text archives to measure professional sentiment, finding that while AI discussion among economists is initially negative, the trend sh…

View →
cs.LGcs.CLRecentMay 31, 2026

OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification

Yuhang Zhou, Lizhu Zhang, Yifan Wu, Mingyi Wang +4 more

OmniOPD introduces a logit-free, chunk-level distillation framework that improves on standard On-Policy Distillation by using semantic similarity and peak-entropy scheduling, achieving state-of-the-ar…

View →
cs.LGcs.AIRecentMay 30, 2026

Richer Representations for Neural Algorithmic Reasoning via Auxiliary Reconstruction

Jiafu Huang, Chao Peng, Chenyang Xu, Zhengfeng Yang +6 more

The paper proposes using an auxiliary reconstruction task, specifically one that captures intra-state feature dependencies, to improve the quality of state representations learned by the encoder in ne…

View →
cs.LGcs.AIRecentMay 29, 2026

PR2: Predictive Routing Replay for MoE-Based LLM Reinforcement Learning

Daize Dong, Junlin Chen, Haolong Jia, Jiawei Wu +8 more

The paper proposes Predictive Routing Replay (PR2) to stabilize reinforcement learning on Mixture of Experts (MoE) LLMs by predicting and incorporating short-horizon router evolution during training a…

View →
cs.LGcs.AIRecentMay 29, 2026

Detector-Evasive LLM Paraphrasing via Constrained Policy Optimization

Mingyi Wang, Zhuoer Shen, Yuheng Bu, Shaofeng Zou

The paper proposes Detector Evasion Policy Optimization (DEPO), a constrained reinforcement learning method that effectively evades AI text detectors while strictly maintaining the original text's sem…

View →
cs.CVcs.AIcs.CLRecentMay 29, 2026

Benchmarking and Enhancing Text-to-Image Models for Generating Visual Representations in Early Arithmetic Education

Junling Wang, Boqi Chen, Heejin Do, Mubashara Akhtar +2 more

The paper introduces a new benchmark, E2V-Bench, to evaluate text-to-image models on generating pedagogically accurate visuals from arithmetic equations, finding that current models often fail due to…

View →
cs.LGcs.AIRecentMay 27, 2026

LoRe: Adaptive Interaction-Evaluation Routing with Per-Step Interaction Budgets for Iterative Graph Solvers

Jintao Li, Yong-Yi Wang, Zheng-An Wang, Heng Fan

LoRe is a training-free wrapper that dynamically budgets interaction evaluation at each step of graph solvers, significantly improving scalability and speed while maintaining solution quality.

View →
cs.AIRecentMay 27, 2026

Deconstructing Spatial Complexity: Hierarchical Decomposition for LLM Spatial Reasoning

Yi Wang, Haojie Lu, Zhaofan Zhang, Li Chen +1 more

This paper introduces MCTS-Guided Group Relative Policy Optimization (M-GRPO) to enhance LLM spatial reasoning by improving the decomposition of complex tasks into optimal sub-tasks.

View →
cs.CRcs.CLcs.LGRecentMay 22, 2026

What Does the Server See? Understanding Privacy Leakage from Large Language Models in Split Inference

Mingyuan Fan, Yu Liu, Fuyi Wang, Cen Chen

The paper introduces ActInv and PAF to systematically analyze and quantify privacy leakage from intermediate activations during split inference of LLMs, proposing PriPert for enhanced defense.

View →
cs.CRcs.AIcs.SERecentMay 21, 2026

Benchmarking Autonomous Agents against Temporal, Spatial, and Semantic Evasions

Jianan Ma, Xiaohu Du, Ruixiao Lin, Yaoxiang Bian +7 more

The paper introduces a multi-dimensional evasion framework and a new benchmark (A3S-Bench) to test autonomous agents, demonstrating that stateful, multi-turn attacks significantly increase system risk…

View →
cs.CRcs.AIRecentMay 21, 2026

Adversarial Trust Poisoning in Vehicular Collaborative Perception

Yutong Liu, Chenyi Wang, Ming F. Li, Qingzhao Zhang

The paper introduces TrustFlip, a novel physical adversarial attack that exploits consistency-based trust defenses in vehicular collaborative perception by using genuine objects to induce inconsistenc…

View →
cs.LGcs.CRRecentMay 19, 2026

An exponential mechanism based on quadratic approximations for fine-tuning machine learning models with privacy guarantees

Hoang Tran, Jorge Ramirez, Jiayi Wang, Alberto Bocchinfuso +2 more

The paper proposes a novel exponential mechanism using quadratic approximations to fine-tune machine learning models on sensitive data while providing strong differential privacy guarantees.

View →
cs.CRcs.AIRecentMay 14, 2026

EVA: Editing for Versatile Alignment against Jailbreaks

Yi Wang, Hongye Qiu, Yue Xu, Sibei Yang +3 more

The paper proposes EVA, a novel framework that uses direct model editing to surgically correct specific neurons responsible for jailbreaking vulnerabilities in LLMs and VLMs, achieving robust safety a…

View →
cs.CVcs.CRcs.LGRecentMay 14, 2026

Systematic Discovery of Semantic Attacks in Online Map Construction through Conditional Diffusion

Chenyi Wang, Ruoyu Song, Raymond Muller, Jean-Philippe Monteuuis +4 more

The paper introduces MIRAGE, a framework that systematically discovers semantic attacks on online HD map construction by finding plausible environmental variations that bypass standard adversarial def…

View →
cs.CRRecentMay 7, 2026

ClawGuard: Out-of-Band Detection of LLM Agent Workflow Hijacking via EM Side Channel

Leo Linqian Gan, Jeffery Wu, Longyuan Ge, Lanqing Yang +5 more

ClawGuard introduces a passive, out-of-band security monitor that detects LLM agent workflow hijacking by analyzing unique electromagnetic (EM) emanations generated during agent skill execution.

View →
cs.CRcs.DBRecentMay 1, 2026

Defense against Poisoning Attacks under Shuffle-DP

Siyi Wang, Qiyao Luo, Yihua Hu, Lixu Wang +5 more

The paper proposes the first general defense framework to make all union-preserving Differential Privacy (DP) protocols, specifically those based on shuffle-DP, resilient against poisoning attacks.

View →
cs.CRcs.HCRecentApr 27, 2026

Listen to the Voices of Everyday Users: Democratizing Privacy Ratings for Sensitive Data Access in Mobile Apps

Liu Wang, Tianshu Zhou, Haoyu Wang, Yi Wang

The paper proposes and evaluates DePRa, a system that democratizes privacy assessment by making everyday users active evaluators of mobile app data access, showing its potential to complement expert a…

View →
cs.CRRecentApr 17, 2026

DPDSyn: Improving Differentially Private Dataset Synthesis for Model Training by Downstream Task Guidance

Mingxuan Jia, Wen Huang, Weixin Zhao, Xingyi Wang +2 more

DPDSyn improves differentially private dataset synthesis by training a differentially private AI model on the original private data, which is then used to generate synthetic datasets that maintain hig…

View →