Ruizhe Li

3 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

NLP×2Sound×1AI×1Crypto×1

Frequent co-authors

Yujian Ma1×

Jinqiu Sang1×

Jiaao Yu1×

Ang Li1×

Giulia Pucci1×

Emily Hemendinger1×

Research Timeline

2026

Trust No Tool: Evaluating and Defending LLM Agents under Untrusted Tool Feedback

The paper introduces a new security benchmark and framework to defend LLM agents against 'cognitive poisoning,' where malicious tools build trust through benign feedback before executing a harmful final action.

Food Noise & False Safety: A Systematic Evaluation of How LLMs Fail to Adapt to Eating Disorder Queries with Clinician Feedback

This paper systematically evaluates how LLMs uncritically adapt to potentially dangerous user prompts related to eating disorders, finding that specific linguistic cues significantly increase the likelihood of unsafe responses.

From Semantics to Readout: Mechanistic Understanding of Audio Tokens after Fine-Tuning for Temporal Audio Grounding

This paper examines how fine-tuning large audio-language models affects the semantics, decoder accessibility, and temporal output alignment of native audio-token states using temporal audio grounding.

Highlighted terms show continued research focus across papers

Papers

cs.SDNEWEmpiricalJul 28, 2026

From Semantics to Readout: Mechanistic Understanding of Audio Tokens after Fine-Tuning for Temporal Audio Grounding

Yujian Ma, Jinqiu Sang, Ruizhe Li, Jiaao Yu +1 more

View →

cs.AIcs.CL