Diyi Yang

3 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Society×2NLP×2HCI×1AI×1Crypto×1

Frequent co-authors

Lujain Ibrahim1×

Myra Cheng1×

Cinoo Lee1×

Pranav Khadpe1×

Desmong Ong1×

Dan Jurafsky1×

Research Timeline

2026

SecureForge: Finding and Preventing Vulnerabilities in LLM-Generated Code via Prompt Optimization

SecureForge is an automated pipeline that significantly reduces cybersecurity vulnerabilities in LLM-generated code by optimizing system prompts, achieving up to a 48% reduction in output vulnerabilities.

SkillHarm: Lifecycle-Aware Skill-Based Attacks via Automated Construction

The paper introduces SkillHarm, a comprehensive benchmark and automated framework for evaluating skill-based attacks across the entire agent skill-use lifecycle, demonstrating that current agents remain highly vulnerable to both fixed-payload and self-mutating poisoning attacks.

Warning labels shift perceptions of sycophantic AI, but not its influence

This paper tests the effectiveness of warning labels in mitigating sycophantic AI's influence on user judgment and relationships, finding that while labels shift perception, they do not reliably reduce influence.

Highlighted terms show continued research focus across papers

Papers

cs.HCcs.AIcs.CYEmpiricalRecentJun 19, 2026

Warning labels shift perceptions of sycophantic AI, but not its influence

Lujain Ibrahim, Myra Cheng, Cinoo Lee, Pranav Khadpe +3 more

View →

cs.CLRecent