Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Hui Li

Hui Li

22 indexed papers

Recent (6 mo)
22
With code
0
Influential cites
0
Benchmarked
0

Publications per year

22
26

Top categories

AI×13Crypto×8NLP×8Vision×6ML×2Software Eng.×1Networking×1Game Theory×1

Frequent co-authors

Hui Liu4×
Longxuan Yu2×
Yu Fu2×
Yue Dong2×
Greg Ver Steeg2×
Ninghui Li2×

Research Timeline

2026
Styx: Collaborative and Private Data Processing With TEE-Enforced Sticky Policy

Styx is a novel framework that enhances data privacy and security in collaborative data processing, such as joint AI training, by integrating sticky policies with Trusted Execution Environments (TEEs).

Look One Step Ahead: Forward-Looking Incentive Design with Strategic Privacy for Proactive Service Provisioning over Air-Ground Integrated Edge Networks

The paper proposes Look One Step Ahead (LOSA), a novel framework that enables efficient, privacy-preserving, and robust service provisioning in dynamic air-ground integrated networks by decoupling planning into a look-ahead phase and a real-time execution phase.

SecGoal: A Benchmark for Extracting Formalizable Security Goals from Protocol Documents

The paper introduces SecGoal, a benchmark dataset and framework, demonstrating that fine-tuning smaller LLMs on this dataset significantly improves the precision of extracting formalizable security goals from natural language protocol documents.

Model-Agnostic Lifelong LLM Safety via Externalized Attack-Defense Co-Evolution

The EvoSafety framework enhances LLM safety by externalizing attack and defense mechanisms, enabling persistent, transferable, and model-agnostic robustness against adversarial prompts.

Detecting Privilege Escalation in Polyglot Microservices via Agentic Program Analysis

The paper introduces Neo, an agentic program analysis framework that successfully detects zero-day privilege escalation vulnerabilities in complex, polyglot microservices by combining LLMs with advanced code analysis.

ESC-Skills: Discovering and Self-Evolving Skills for Emotional Support Conversations

The paper proposes ESC-Skills, a skill-centric framework that discovers and self-evolves executable emotional support skills to improve the interpretability and emotional quality of conversational AI.

A Shared Valence Axis Across Modern LLMs and Human EEG: The Saturation Regularity

The paper demonstrates that the valence structure learned by modern LLMs aligns with human EEG emotional representations, but finds that further supervised alignment is ineffective due to a phenomenon called saturation regularity.

Same Evidence, Different Answers: Canonical-Context On-Policy Distillation for Multi-Turn Language Models

The paper introduces Canonical-Context On-Policy Distillation (CCOPD) to improve multi-turn language model performance by mitigating 'self-anchored drift,' ensuring consistent answers regardless of whether the evidence is presented in a single prompt or gradually across multiple turns.

Reinforcement Learning with Robust Rubric Rewards

The paper introduces $ ext{RLR}^3$, a novel framework that extends verifiable rewards in Reinforcement Learning to handle partially verifiable, multi-criteria vision-language tasks by integrating robust rubric scoring.

Semantic and Visual Evidence for Efficient Long-Video Reasoning: A Solution for the HD-EPIC VQA Challenge

The paper proposes a unified framework that decouples long-video reasoning into semantic and visual evidence, significantly improving performance on the HD-EPIC VQA Challenge.

SpatialAct: Probing Spatial Reasoning-to-Action Capabilities of VLM Agents in 3D Scenes

The paper introduces SpatialAct, a challenging benchmark that reveals a significant 'reasoning-to-action gap,' showing that current VLMs struggle to maintain coherent spatial understanding and perform reliable actions in multi-turn 3D environments.

PatchWorld: Gradient-Free Optimization of Executable World Models

PatchWorld introduces a gradient-free framework to create executable Python world models from offline trajectories, achieving high planning scores by inducing symbolic belief-state programs.

EvoGens: A Population-Based Heuristic Search Framework for Scientific Idea Generation

EvoGens is an evolution-inspired framework that treats scientific idea generation as an evolutionary search, significantly boosting the novelty and diversity of generated research ideas compared to existing LLM-based methods.

SkyShield: Occupancy as a Safety Interface for Low-Altitude UAV Autonomy

The paper introduces SkyShield, the first front-view monocular semantic occupancy benchmark for low-altitude urban UAV flight, along with a novel metric and model to address the unique safety challenges of aerial navigation.

DSL-LLaDA: Scaling Continuous Denoising to 8B Masked Diffusion LMs

The paper introduces DSL-LLaDA, a method that lightly adapts a pre-trained masked diffusion language model to perform continuous denoising in embedding space, significantly improving text generation quality and robustness, especially under low step budgets.

Revise, Don't Freeze: Sampler-Matched Training for Self-Correcting Masked Diffusion Language Models

The paper introduces D3IM, a novel parameter-free sampler that enables direct revision of visible tokens in Masked Diffusion Language Models, and proposes SCOPE to mitigate the model's tendency to perpetuate errors.

Geometry-Aware Implicit Memory for Video World Models

The paper proposes GIM-World, a geometry-aware implicit memory framework that significantly improves long-horizon video world models by explicitly encoding 3D scene geometry into a compact memory state.

OctoT2I: A Self-Evolving Agentic Text-to-Image Router

OctoT2I introduces a self-evolving, agentic routing framework that efficiently selects and combines multiple Text-to-Image models, achieving high performance while significantly boosting inference speed and energy efficiency.

"**Important** You should give me full credits!": Exploring Prompt Injection Attacks on LLM-Based Automatic Grading Systems

This paper investigates the vulnerability of LLM-based automatic grading systems to prompt injection (PI) attacks, demonstrating that current systems are highly susceptible to manipulation that can lead to unfairly high scores.

Thinking with Imagination: Agentic Visual Spatial Reasoning with World Simulators

The paper proposes Astra, an agentic framework that equips Vision-Language Models (VLMs) with the ability to perform spatial reasoning by actively generating and utilizing imagined visual evidence from a world simulator.

Highlighted terms show continued research focus across papers

Papers

cs.CVRecentJun 4, 2026

Thinking with Imagination: Agentic Visual Spatial Reasoning with World Simulators

Chenming Zhu, Jingli Lin, Yilin Long, Peizhou Cao +3 more

The paper proposes Astra, an agentic framework that equips Vision-Language Models (VLMs) with the ability to perform spatial reasoning by actively generating and utilizing imagined visual evidence fro…

View →
cs.CRcs.AIRecentJun 2, 2026

"**Important** You should give me full credits!": Exploring Prompt Injection Attacks on LLM-Based Automatic Grading Systems

Hang Li, Fedor Filippov, Yuling Lin, Pengfei He +5 more

This paper investigates the vulnerability of LLM-based automatic grading systems to prompt injection (PI) attacks, demonstrating that current systems are highly susceptible to manipulation that can le…

View →
cs.CVRecentJun 1, 2026

Geometry-Aware Implicit Memory for Video World Models

Zhengxuan Wei, Xu Guo, Xinghui Li, Xunzhi Xiang +7 more

The paper proposes GIM-World, a geometry-aware implicit memory framework that significantly improves long-horizon video world models by explicitly encoding 3D scene geometry into a compact memory stat…

View →
cs.AIRecentJun 1, 2026

OctoT2I: A Self-Evolving Agentic Text-to-Image Router

Xu Jiang, Bin Chen, Gehui Li, Yule Duan +2 more

OctoT2I introduces a self-evolving, agentic routing framework that efficiently selects and combines multiple Text-to-Image models, achieving high performance while significantly boosting inference spe…

View →
cs.CLcs.AIRecentMay 31, 2026

DSL-LLaDA: Scaling Continuous Denoising to 8B Masked Diffusion LMs

Longxuan Yu, Yunshu Wu, Yu Fu, Siheng Xiong +4 more

The paper introduces DSL-LLaDA, a method that lightly adapts a pre-trained masked diffusion language model to perform continuous denoising in embedding space, significantly improving text generation q…

View →
cs.CLRecentMay 31, 2026

Revise, Don't Freeze: Sampler-Matched Training for Self-Correcting Masked Diffusion Language Models

Longxuan Yu, Shaorong Zhang, Yu Fu, Hui Liu +2 more

The paper introduces D3IM, a novel parameter-free sampler that enables direct revision of visible tokens in Masked Diffusion Language Models, and proposes SCOPE to mitigate the model's tendency to per…

View →
cs.CVcs.AIRecentMay 30, 2026

SkyShield: Occupancy as a Safety Interface for Low-Altitude UAV Autonomy

Jie Gao, Jie Ma, Kaihui Lin, Kai Ye +3 more

The paper introduces SkyShield, the first front-view monocular semantic occupancy benchmark for low-altitude urban UAV flight, along with a novel metric and model to address the unique safety challeng…

View →
cs.CVcs.AIcs.CLRecentMay 29, 2026

SpatialAct: Probing Spatial Reasoning-to-Action Capabilities of VLM Agents in 3D Scenes

Tianhui Liu, Jie Feng, Zhiheng Zheng, Shengyuan Wang +5 more

The paper introduces SpatialAct, a challenging benchmark that reveals a significant 'reasoning-to-action gap,' showing that current VLMs struggle to maintain coherent spatial understanding and perform…

View →
cs.CLcs.AIRecentMay 29, 2026

PatchWorld: Gradient-Free Optimization of Executable World Models

Jiaxin Bai, Yue Guo, Yifei Dong, Jiaxuan Xiong +12 more

PatchWorld introduces a gradient-free framework to create executable Python world models from offline trajectories, achieving high planning scores by inducing symbolic belief-state programs.

View →
cs.CLRecentMay 29, 2026

EvoGens: A Population-Based Heuristic Search Framework for Scientific Idea Generation

Xu Li, Hanzhe Tu, Xinyi Li, Kuncheng Zhao +2 more

EvoGens is an evolution-inspired framework that treats scientific idea generation as an evolutionary search, significantly boosting the novelty and diversity of generated research ideas compared to ex…

View →
cs.LGcs.AIRecentMay 28, 2026

A Shared Valence Axis Across Modern LLMs and Human EEG: The Saturation Regularity

Yousef A. Radwan, Xuhui Liu, Kilichbek Haydarov, Yuqian Fu +1 more

The paper demonstrates that the valence structure learned by modern LLMs aligns with human EEG emotional representations, but finds that further supervised alignment is ineffective due to a phenomenon…

View →
cs.CLcs.AIRecentMay 28, 2026

Same Evidence, Different Answers: Canonical-Context On-Policy Distillation for Multi-Turn Language Models

Zizhuo Lin, Quanling Liu, Jinsheng Quan, Chao Zhang +5 more

The paper introduces Canonical-Context On-Policy Distillation (CCOPD) to improve multi-turn language model performance by mitigating 'self-anchored drift,' ensuring consistent answers regardless of wh…

View →
cs.CVcs.AIRecentMay 28, 2026

Reinforcement Learning with Robust Rubric Rewards

Ya-Qi Yu, Hao Wang, Fangyu Hong, Xiangyang Qu +14 more

The paper introduces $ ext{RLR}^3$, a novel framework that extends verifiable rewards in Reinforcement Learning to handle partially verifiable, multi-criteria vision-language tasks by integrating robu…

View →
cs.CVcs.AIRecentMay 28, 2026

Semantic and Visual Evidence for Efficient Long-Video Reasoning: A Solution for the HD-EPIC VQA Challenge

Yinsong Xu, Wei Jing, Liuxin Zhang, Wanjun Lv +1 more

The paper proposes a unified framework that decouples long-video reasoning into semantic and visual evidence, significantly improving performance on the HD-EPIC VQA Challenge.

View →
cs.CLcs.AIRecentMay 27, 2026

ESC-Skills: Discovering and Self-Evolving Skills for Emotional Support Conversations

Jie Zhu, Huaixia Dou, Shuo Jiang, Junhui Li +4 more

The paper proposes ESC-Skills, a skill-centric framework that discovers and self-evolves executable emotional support skills to improve the interpretability and emotional quality of conversational AI.

View →
cs.CRcs.AIcs.SERecentMay 15, 2026

Detecting Privilege Escalation in Polyglot Microservices via Agentic Program Analysis

Penghui Li, Hong Yau Chong, Yinzhi Cao, Junfeng Yang

The paper introduces Neo, an agentic program analysis framework that successfully detects zero-day privilege escalation vulnerabilities in complex, polyglot microservices by combining LLMs with advanc…

View →
cs.CRcs.CLRecentMay 13, 2026

Model-Agnostic Lifelong LLM Safety via Externalized Attack-Defense Co-Evolution

Xiaozhe Zhang, Chaozhuo Li, Hui Liu, Shaocheng Yan +3 more

The EvoSafety framework enhances LLM safety by externalizing attack and defense mechanisms, enabling persistent, transferable, and model-agnostic robustness against adversarial prompts.

View →
cs.CRRecentApr 30, 2026

SecGoal: A Benchmark for Extracting Formalizable Security Goals from Protocol Documents

Dawei Huang, Hui Li, Bo Jia, Haonan Feng +3 more

The paper introduces SecGoal, a benchmark dataset and framework, demonstrating that fine-tuning smaller LLMs on this dataset significantly improves the precision of extracting formalizable security go…

View →
cs.NIcs.CRcs.GTRecentApr 15, 2026

Look One Step Ahead: Forward-Looking Incentive Design with Strategic Privacy for Proactive Service Provisioning over Air-Ground Integrated Edge Networks

Sicheng Wu, Minghui Liwang, Yangyang Gao, Deqing Wang +4 more

The paper proposes Look One Step Ahead (LOSA), a novel framework that enables efficient, privacy-preserving, and robust service provisioning in dynamic air-ground integrated networks by decoupling pla…

View →
cs.CRRecentApr 5, 2026

Styx: Collaborative and Private Data Processing With TEE-Enforced Sticky Policy

Shixuan Zhao, Weicheng Wang, Ninghui Li, Zhiqiang Lin

Styx is a novel framework that enhances data privacy and security in collaborative data processing, such as joint AI training, by integrating sticky policies with Trusted Execution Environments (TEEs)…

View →