Yang Liu

39 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×22Crypto×21Vision×9NLP×9Software Eng.×8ML×7Info Retrieval×2Multiagent×1

Frequent co-authors

Jing Chen3×

Yilong Yang3×

Zhuo Ma3×

Yebo Feng3×

Cong Wu3×

Renyang Liu3×

Research Timeline

2026

Opt-Verifier: Unleashing the Power of LLMs for Optimization Modeling via Dual-Side Verification

The paper introduces Opt-Verifier, a novel LLM-based framework that significantly improves the accuracy of automated optimization model generation by implementing dual-side verification from both structural and solution perspectives.

Xetrieval: Mechanistically Explaining Dense Retrieval

Xetrieval introduces an embedding-level framework to mechanistically explain dense retrieval decisions by decomposing high-dimensional embeddings into sparse, human-interpretable features.

Do Physics Foundation Models Learn Generalizable Physics? A Bias-Aware Benchmark Across Physical Regimes and Distribution Shifts

The paper introduces a comprehensive benchmark to test if physics foundation models learn generalizable dynamics, finding that their performance is highly conditional and not universally general.

BAGEN: Are LLM Agents Budget-Aware?

This paper introduces the concept of Budget-Aware Agents (BAGEN), showing that current LLM agents often fail to manage resources proactively, and proposes that incorporating early stop and interval estimation significantly improves efficiency.

LaSR: Context-Aware Speech Recognition via Latent Reasoning

The paper proposes LaSR, a context-aware training paradigm that uses latent reasoning to significantly improve speech recognition, especially for specialized terminology, without adding latency.

Bridging Requirements and Architecture: Multi-Agent Orchestration with External Knowledge and Hierarchical Memory

The paper introduces MAAD, a multi-agent framework that autonomously transforms software requirements into comprehensive, multi-view architectural blueprints, significantly improving completeness and reducing manual validation.

Distilling Neuro-Symbolic Programs into 3D Multi-modal LLMs

The paper introduces APEIRIA, a neuro-symbolic 3D Multi-modal LLM that bridges the gap between interpretable symbolic reasoning and flexible, open-vocabulary 3D understanding.

Unlocking the Black Box of Latent Reasoning: An Interpretability-Guided Approach to Intervention

This paper introduces interpretability-guided, training-free interventions that systematically improve the accuracy and controllability of latent reasoning in LLMs by leveraging structural and causal insights into continuous hidden states.

Training-Free Composed Video Retrieval via Visual Representation-Guided Video-LLM Reasoning

The paper proposes a training-free framework, Visual Representation-Guided Video-LLM Reasoning, to perform composed video retrieval by using visual examples and text instructions, achieving strong performance on the CVPR 2026 challenge.

Community-Aware Assessment of Social Textual Engagement and Resonance: A Human-Centric Perspective on User-Generated Content Evaluation

The paper introduces CASTER, a new human-centric task for evaluating User-Generated Content (UGC) resonance, and proposes MEDEA, an architecture that uses a Social Chain-of-Thought mechanism to simulate community reactions for quality assessment.

Do Gender Cues Affect LLM Value Trade-offs? Evidence from a Controlled Decision Benchmark

The paper demonstrates that explicit gender cues systematically affect LLM value trade-offs, causing decision flips that are often masked or misattributed by the models themselves.

Outsmarting the Chameleon: Counterfactual Decoupling for Tactical OOD Shifts in Live Streaming Risk Assessment

The paper proposes a novel framework, LPCD, that uses latent causal modeling to robustly assess evolving adversarial risks in live streaming by decoupling malicious intent from superficial tactical shifts.

Benign Inputs, Harmful Outputs: Cross-Modal Jailbreaking via Distributed Semantic Recomposition

The paper introduces Distributed Semantic Recomposition (DSR), a novel cross-modal jailbreaking framework that bypasses existing safety filters by decomposing harmful intent into benign input components, achieving high attack success rates with low input toxicity.

HORIZON: Recoverability-Governed Curriculum for Physical-Domain Scaling

This paper studies how to scale robust robot policies by expanding physical domains in a recoverable way.

Regret Minimization with Adaptive Opponents in Repeated Games

This paper introduces Repeated Policy Regret (RP-Regret), a novel game-theoretic metric for analyzing regret in repeated games with adaptive opponents, and proposes algorithms to minimize it.

Through the PRISM: Preference Representation in Intermediate States of Video Diffusion Models

The paper introduces PRISM, a method for decoding preference signals from noisy latents using a lightweight Query-based Aggregation head and a frozen video diffusion backbone, achieving state-of-the-art preference accuracy and noise-robustness.

CoLT: Teaching Multi-Modal Models to Think with Chain of Latent Thoughts

This paper proposes CoLT, a framework that enables multi-modal models to reason through a chain of latent thought representations instead of text tokens, improving performance and reducing inference time.

FARS: A Fully Automated Research System Deployed at Scale

FARS is a fully automated AI-for-AI research system that generated and advanced 166 complete research papers across 67 topics in a large-scale public deployment, with evaluations from 282 reviews.

Knowledge Over Parameters: Evolving Smart Contract Vulnerability Detection

This paper presents EvoVuln, an automated framework that synthesizes and refines detection logic for smart contract vulnerabilities using minimal labeled samples.

O-VAD: Industrial Video Anomaly Detection through Object-Centric Tracking and Reasoning

This paper introduces an agentic framework for industrial video anomaly detection, capable of tracking spatial-temporal dynamics and underlying transformations of objects to identify abnormalities.

Highlighted terms show continued research focus across papers

Papers

cs.CVcs.AIcs.CLEmpiricalRecentJul 20, 2026

O-VAD: Industrial Video Anomaly Detection through Object-Centric Tracking and Reasoning

Mei Yuan, Qi Long, Qifeng Wu, Zhenyang Li +4 more

This paper introduces an agentic framework for industrial video anomaly detection, capable of tracking spatial-temporal dynamics and underlying transformations of objects to identify abnormalities.

View →

cs.CRcs.SEEmpirical