Yi Yang

21 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

NLP×9AI×8Crypto×4Vision×3Robotics×2Audio and Speech Processing×2ML×2HCI×2

Frequent co-authors

Ziyi Yang3×

Diyi Yang3×

Hangjie Yuan1×

Yichen Qian1×

Zhiwei Tang1×

Xianzhe Xu1×

Research Timeline

2026

Checkerboard: A Simple, Effective, Efficient and Learning-free Clean Label Backdoor Attack with Low Poisoning Budget

The paper introduces Checkerboard, a novel, learning-free clean-label backdoor attack that efficiently poisons training data to compromise model integrity with minimal poisoning budget.

SecureForge: Finding and Preventing Vulnerabilities in LLM-Generated Code via Prompt Optimization

SecureForge is an automated pipeline that significantly reduces cybersecurity vulnerabilities in LLM-generated code by optimizing system prompts, achieving up to a 48% reduction in output vulnerabilities.

StoryLens: Preference-Aligned Story Rewriting via Context-Aware Narrative Enrichment

The paper introduces STORYLENSWRITER, a novel framework that significantly improves personalized story rewriting by incorporating context-aware narrative enrichment, outperforming style-only adaptation.

A Unified Framework for the Evaluation of LLM Agentic Capabilities

The paper introduces a unified framework to fairly evaluate LLM agentic capabilities by standardizing diverse benchmarks and separating the effects of the LLM model from the surrounding framework and environment.

SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search

The paper proposes SAAS, a novel RL framework that equips LLM agents with self-awareness to precisely regulate search behavior, significantly mitigating costly over-search without sacrificing accuracy.

Evolving Skill-Structured Attack Memory Enhances LLM Jailbreaking

The paper proposes MemoAttack, a memory-driven black-box jailbreak framework that systematically models, evolves, and selects attack experiences to significantly enhance LLM jailbreaking success rates.

A Unified and Reproducible Experimentation Framework for Speech Understanding

The paper introduces SURE, a unified framework designed to standardize and improve the comparability and reproducibility of evaluations for advanced speech understanding models.

A Reconfigurable Computing In-Memory Macro with Charge-sharing-based Weighted Accumulator

The paper proposes a highly reconfigurable 256x128 in-memory computing array that significantly improves efficiency and performance for analog computing by introducing novel components for ADC, weighted accumulation, and bitcell design.

Beyond the Mouth: Upper-Face Affective Cues in Audiovisual Sentence Recognition under Acoustic Uncertainty

This paper investigates if upper-face affective cues enhance audiovisual sentence recognition, especially when audio is degraded, finding that while mouth cues are crucial for robustness, upper-face cues improve overall confidence and performance under noise.

Skill or Skip? Learning Selective Skill Invocation in Agentic Tasks via Dual-Granularity Preference Learning

SelSkill introduces a dual-granularity preference learning framework that treats skill use as a 'skill-or-skip' decision, significantly improving agent performance and execution precision in complex agentic tasks.

SkillHarm: Lifecycle-Aware Skill-Based Attacks via Automated Construction

The paper introduces SkillHarm, a comprehensive benchmark and automated framework for evaluating skill-based attacks across the entire agent skill-use lifecycle, demonstrating that current agents remain highly vulnerable to both fixed-payload and self-mutating poisoning attacks.

Warning labels shift perceptions of sycophantic AI, but not its influence

This paper tests the effectiveness of warning labels in mitigating sycophantic AI's influence on user judgment and relationships, finding that while labels shift perception, they do not reliably reduce influence.

Regime-Aware Peer Specialization for Robust RAG under Heterogeneous Knowledge Conflicts

This paper proposes RAPS-DA, a framework that addresses conflicts in retrieval-augmented generation using a regime-aware peer specialization system and a dual-layer selector.

Alignment Is All You Need For X-to-4D Generation

This paper introduces Align4D, a framework for generating coherent video-3D pairs using any-modal input, achieving state-of-the-art quality and consistency in X-to-4D generation.

Clinical Translation of Brain-Computer Interface in China: A Landscape Analysis of Investigator-Initiated Trials, Registered Clinical Trials, and Regulatory Approval

This paper presents the first quantitative analysis of China's Brain-Computer Interface (BCI) translational ecosystem, examining clinical trials, investigator-initiated trials, and regulatory-approved products.

BadWAM: When World-Action Models Dream Right but Act Wrong

This paper introduces BadWAM, a framework for modeling and evaluating World-Action Drift Attacks, a new class of adversarial attacks that break the alignment between a World-Action Model's (WAM's) imagined future and its executed actions.

Adaptive Momentum Enhanced Distributed Multichannel Active Noise Control for Faster Convergence under Communication Delays

This paper proposes an adaptive momentum term for the ASSS-MGDFxLMS algorithm in distributed multichannel active noise control systems to accelerate convergence while maintaining robustness under communication delays.

Robots Acquire Manipulation Skills in Seconds from a Single Human Video

A new framework called HOST enables robots to acquire new skills from a single human video in seconds while retaining previously mastered skills.

Skill Self-Play: Pushing the Frontier of LLM Capability with Co-Evolving Skills

This paper introduces Skill Self-Play (Skill-SP), a co-evolutionary framework for LLM training that bridges the gap between structured verification and open-ended exploration.

ClinFusion: A Vision-Centric Multimodal LLM System for Holistic Medical Understanding

This paper introduces ClinFusion, a vision-centric multimodal large language model designed for holistic medical understanding, featuring a Cascade Spatial-Aware Locality Fusion operator and a vision-grounded evaluation framework.

Highlighted terms show continued research focus across papers

Papers

cs.CVcs.AIcs.CLEmpiricalRecentJul 27, 2026

ClinFusion: A Vision-Centric Multimodal LLM System for Holistic Medical Understanding

Hangjie Yuan, Yichen Qian, Zhiwei Tang, Xianzhe Xu +20 more

View →

cs.CLEmpirical