He Liu

12 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×9Crypto×7Vision×3ML×2Audio and Speech Processing×1NLP×1Multiagent×1Comp. Eng.×1

Frequent co-authors

Zhe Liu3×

Che Liu2×

Hao Cheng2×

Changtao Miao2×

Tianle Song2×

Yin Wu2×

Research Timeline

2026

GasLiteAA: Optimizing ERC-4337 for Efficient and Secure Gas Sponsorship

GasLiteAA proposes optimizing the ERC-4337 standard by offloading gas sponsorship logic to Trusted Execution Environments (TEE), significantly reducing on-chain gas costs while maintaining security and verifiability.

SafeHarbor: Hierarchical Memory-Augmented Guardrail for LLM Agent Safety

SafeHarbor is a novel, hierarchical memory-augmented framework that establishes context-aware decision boundaries for LLM agents, achieving state-of-the-art safety while minimizing over-refusal.

OrchJail: Jailbreaking Tool-Calling Text-to-Image Agents by Orchestration-Guided Fuzzing

OrchJail introduces an orchestration-guided fuzzing framework to systematically jailbreak tool-calling text-to-image agents by exploiting unsafe multi-step tool-orchestration patterns.

DCVD: Dual-Channel Cross-Modal Fusion for Joint Vulnerability Detection and Localization

DCVD proposes a dual-channel cross-modal fusion framework that jointly detects software vulnerabilities and precisely localizes the vulnerable lines, outperforming existing state-of-the-art methods.

LITMUS: Benchmarking Behavioral Jailbreaks of LLM Agents in Real OS Environments

The paper introduces LITMUS, a novel benchmark that rigorously tests LLM agents for dangerous, physical-layer behavioral jailbreaks in real OS environments, revealing that current agents frequently execute high-risk operations despite safety guardrails.

SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer

SANA-Streaming introduces a novel, efficient framework that enables real-time, high-resolution streaming video-to-video editing by combining a hybrid diffusion transformer with specialized training and hardware co-design.

GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models

The paper proposes Guided Denoiser Self-Distillation (GDSD), a novel method that bypasses the use of likelihood surrogates (like ELBO) in RL for diffusion language models, achieving state-of-the-art performance on complex benchmarks.

SeClaw: Spec-Driven Security Task Synthesis for Evaluating Autonomous Agents

SeClaw is a new framework that synthesizes security tasks from structured risk specifications to evaluate autonomous LLM agents' behavior in stateful environments, focusing on the process of unsafe actions rather than just the final outcome.

SeClaw: Spec-Driven Security Task Synthesis for Evaluating Autonomous Agents

SeClaw is a new framework that uses specification-driven task synthesis to create comprehensive and controllable security benchmarks for evaluating the unsafe behaviors of autonomous LLM agents.

GigaSpeechBench: A Real-World Multilingual Speech-to-Text Benchmark

The paper introduces GigaSpeechBench, a comprehensive multilingual and multidimensional ASR & AST benchmark with 680 hours of human-annotated speech, featuring 12 low-resource languages, 6 Chinese dialects, 6 English accents, dense terminology, older adult and child speech, and human-annotated translations.

Search Beyond What Can Be Taught: Evolving the Knowledge Boundary in Agentic Visual Generation

The paper constructs datasets and benchmarks to evaluate the performance of visual generators in handling open-ended requests, and proposes a teach-then-search co-training framework to improve their world-knowledge.

ELSA3D: Elastic Semantic Anchoring for Unified 3D Understanding and Generation

The paper introduces ELSA3D, a unified 3D model that uses elastic semantic anchoring to improve interaction between text and 3D representations, achieving state-of-the-art performance with reduced FLOPs and inference latency.

Highlighted terms show continued research focus across papers

Papers

cs.CVcs.AIcs.LGEmpiricalRecentJul 7, 2026

ELSA3D: Elastic Semantic Anchoring for Unified 3D Understanding and Generation

Tianjiao Yu, Xinzhuo Li, Yifan Shen, Onkar Susladkar +3 more

View →

cs.CVcs.AIEmpirical