Hui Li

27 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×14NLP×10Crypto×9Vision×6Software Eng.×2Info Retrieval×2ML×2Robotics×1

Frequent co-authors

Hui Liu4×

Xihui Liu2×

Jie Zhu2×

Penghui Li2×

Longxuan Yu2×

Yu Fu2×

Research Timeline

2026

ESC-Skills: Discovering and Self-Evolving Skills for Emotional Support Conversations

The paper proposes ESC-Skills, a skill-centric framework that discovers and self-evolves executable emotional support skills to improve the interpretability and emotional quality of conversational AI.

A Shared Valence Axis Across Modern LLMs and Human EEG: The Saturation Regularity

The paper demonstrates that the valence structure learned by modern LLMs aligns with human EEG emotional representations, but finds that further supervised alignment is ineffective due to a phenomenon called saturation regularity.

Same Evidence, Different Answers: Canonical-Context On-Policy Distillation for Multi-Turn Language Models

The paper introduces Canonical-Context On-Policy Distillation (CCOPD) to improve multi-turn language model performance by mitigating 'self-anchored drift,' ensuring consistent answers regardless of whether the evidence is presented in a single prompt or gradually across multiple turns.

Reinforcement Learning with Robust Rubric Rewards

The paper introduces $ ext{RLR}^3$, a novel framework that extends verifiable rewards in Reinforcement Learning to handle partially verifiable, multi-criteria vision-language tasks by integrating robust rubric scoring.

Semantic and Visual Evidence for Efficient Long-Video Reasoning: A Solution for the HD-EPIC VQA Challenge

The paper proposes a unified framework that decouples long-video reasoning into semantic and visual evidence, significantly improving performance on the HD-EPIC VQA Challenge.

SpatialAct: Probing Spatial Reasoning-to-Action Capabilities of VLM Agents in 3D Scenes

The paper introduces SpatialAct, a challenging benchmark that reveals a significant 'reasoning-to-action gap,' showing that current VLMs struggle to maintain coherent spatial understanding and perform reliable actions in multi-turn 3D environments.

PatchWorld: Gradient-Free Optimization of Executable World Models

PatchWorld introduces a gradient-free framework to create executable Python world models from offline trajectories, achieving high planning scores by inducing symbolic belief-state programs.

EvoGens: A Population-Based Heuristic Search Framework for Scientific Idea Generation

EvoGens is an evolution-inspired framework that treats scientific idea generation as an evolutionary search, significantly boosting the novelty and diversity of generated research ideas compared to existing LLM-based methods.

SkyShield: Occupancy as a Safety Interface for Low-Altitude UAV Autonomy

The paper introduces SkyShield, the first front-view monocular semantic occupancy benchmark for low-altitude urban UAV flight, along with a novel metric and model to address the unique safety challenges of aerial navigation.

DSL-LLaDA: Scaling Continuous Denoising to 8B Masked Diffusion LMs

The paper introduces DSL-LLaDA, a method that lightly adapts a pre-trained masked diffusion language model to perform continuous denoising in embedding space, significantly improving text generation quality and robustness, especially under low step budgets.

Revise, Don't Freeze: Sampler-Matched Training for Self-Correcting Masked Diffusion Language Models

The paper introduces D3IM, a novel parameter-free sampler that enables direct revision of visible tokens in Masked Diffusion Language Models, and proposes SCOPE to mitigate the model's tendency to perpetuate errors.

Geometry-Aware Implicit Memory for Video World Models

The paper proposes GIM-World, a geometry-aware implicit memory framework that significantly improves long-horizon video world models by explicitly encoding 3D scene geometry into a compact memory state.

OctoT2I: A Self-Evolving Agentic Text-to-Image Router

OctoT2I introduces a self-evolving, agentic routing framework that efficiently selects and combines multiple Text-to-Image models, achieving high performance while significantly boosting inference speed and energy efficiency.

"**Important** You should give me full credits!": Exploring Prompt Injection Attacks on LLM-Based Automatic Grading Systems

This paper investigates the vulnerability of LLM-based automatic grading systems to prompt injection (PI) attacks, demonstrating that current systems are highly susceptible to manipulation that can lead to unfairly high scores.

Thinking with Imagination: Agentic Visual Spatial Reasoning with World Simulators

The paper proposes Astra, an agentic framework that equips Vision-Language Models (VLMs) with the ability to perform spatial reasoning by actively generating and utilizing imagined visual evidence from a world simulator.

CoRe: A Continuously Reward-Finetuned LLM Query Rewriter for Multi-Stage Context-Aware Relevance in Web-Scale Video Search

The paper presents CoRe, a query rewriter system that uses the deployed multimodal relevance model as its source for reward and closes the simulation-production gap, allowing for weekly redeployment.

Agent-Native Immune System: Architecture, Taxonomy, and Engineering

This paper introduces the Agent-Native Immune System (ANIS), an endogenous defense architecture for autonomous agents against runtime hijacking.

Symbolon: Symbolic Execution by Learning Code Transformation

The paper presents Symbolon, a framework that learns and applies context-sensitively diverse code transformations to improve symbolic execution, increasing coverage and reducing costs.

UniClawBench: A Universal Benchmark for Proactive Agents on Real-World Tasks

The paper introduces UniClawBench, a capability-driven benchmark for evaluating proactive agents in real-world settings, using five foundational capabilities and live Docker containers.

A Monolithic Hand with Asymmetric Origami Bending and Dual-chamber Actuators

This paper introduces the asymmetric origami bending (AOB) pattern and asymmetric dual-chamber (ADC) design for creating a soft robotic hand called OSOR, which achieves bio-inspired motions and simplified manufacturing.

Highlighted terms show continued research focus across papers

Papers

cs.ROEmpiricalRecentJul 24, 2026

A Monolithic Hand with Asymmetric Origami Bending and Dual-chamber Actuators

Nan Huang, Yuming Zhu, Zicong Zhang, Jianhui Liu +4 more

View →

cs.CLEmpiricalRecent