Yu Huang

9 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×6NLP×3Crypto×3Software Eng.×2ML×2Distributed×1OS×1Optimization and Control×1

Frequent co-authors

Qian Cheng1×

Saad Mohammad Rafid Pial1×

Ruize Tang1×

Yiming Su1×

Emilie Ma1×

Finn Hackett1×

Research Timeline

2026

Acoustic Interference: A New Paradigm Weaponizing Acoustic Latent Semantic for Universal Jailbreak against Large Audio Language Models

The paper introduces Acoustic Interference Attack (AIA), a novel jailbreak method that bypasses Large Audio Language Model (LALM) safety alignments by manipulating the underlying acoustic latent semantics rather than injecting malicious content.

Backdooring Masked Diffusion Language Models

The paper introduces SHADOWMASK, the first systematic backdoor attack targeting Masked Diffusion Language Models (MDLMs), demonstrating near-100% attack success while preserving clean model utility.

Privacy-Preserving Screening for Record Linkage

The paper introduces Appraisal, a novel Screening-then-Linkage framework (PPRS) that significantly improves the scalability and efficiency of Privacy-Preserving Record Linkage by incorporating a lightweight screening phase.

Demystifying Data Organization for Enhanced LLM Training

This paper proposes four guidelines and two novel data ordering methods (STR and SAW) to systematically optimize data organization, significantly enhancing the stability and performance of LLM training.

How Coding Agents Fail Their Users: A Large-Scale Analysis of Developer-Agent Misalignment in 20,574 Real-World Sessions

This study analyzes over 20,000 real-world coding sessions to show that AI coding agents frequently fail users through subtle misalignment, requiring constant manual correction even when major system damage is avoided.

Agentic Transformers Provably Learn to Search via Reinforcement Learning

This paper demonstrates that transformer-based policies can provably learn complex tree search mechanisms, such as depth-first search, purely through reinforcement learning in a stochastic environment.

UniScale: Adaptive Unified Inference Scaling via Online Joint Optimization of Model Routing and Test-Time Scaling

UniScale proposes a unified framework that jointly optimizes model routing and test-time scaling to achieve a superior, fine-grained quality-cost trade-off for large language model inference.

PatchWorld: Gradient-Free Optimization of Executable World Models

PatchWorld introduces a gradient-free framework to create executable Python world models from offline trajectories, achieving high planning scores by inducing symbolic belief-state programs.

Specula: Scaling formal specifications for autonomous model checking of system code

Specula is an autonomous system that generates high-quality formal specifications for large, complex code using LLMs, improving understanding and finding bugs.

Highlighted terms show continued research focus across papers

Papers

cs.SEcs.AIcs.DCNEWEmpiricalJul 28, 2026

Specula: Scaling formal specifications for autonomous model checking of system code

Qian Cheng, Saad Mohammad Rafid Pial, Ruize Tang, Yiming Su +5 more

Specula is an autonomous system that generates high-quality formal specifications for large, complex code using LLMs, improving understanding and finding bugs.

View →

cs.LGcs.AI