Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Yu Jiang

Yu Jiang

4 indexed papers

Recent (6 mo)
4
With code
0
Influential cites
0
Benchmarked
0

Publications per year

4
26

Top categories

NLP×2Crypto×2AI×2Software Eng.×1Vision×1Multimedia×1

Frequent co-authors

Linfeng Liu1×
Tiffany Zhan1×
Louie Hong Yao1×
Saptarshi Ghosh1×
Tianyu Jiang1×
Puzhuo Liu1×

Research Timeline

2026
Evaluating Privilege Usage of Agents with Real-World Tools

The paper introduces GrantBox, a new security sandbox that evaluates how well LLM agents handle real-world tool privileges, finding that agents remain highly vulnerable to sophisticated attacks.

VCap: Hypergeometric Rewards for Weak-to-Strong Visual Captioning

VCap introduces a novel Witness-Adjudicator reward mechanism that provides highly precise, factually grounded feedback for visual captioning, enabling state-of-the-art performance in RL-trained multimodal models.

Cross-Lingual Steering for Figurative Language Generation

The paper demonstrates that the internal signals governing figurative language generation are reusable across multiple languages, showing that a steering direction learned in one language can effectively enhance generation in another.

CODEFUSE-DEBENCH: An Empirical Study on Readability, Recompilability, and Functionality

The paper introduces DEBENCH, a novel framework that evaluates binary decompilers based on three orthogonal dimensions—readability, recompilability, and functionality—revealing that functional recovery is significantly harder than simple code readability.

Highlighted terms show continued research focus across papers

Papers

cs.CLRecentMay 28, 2026

Cross-Lingual Steering for Figurative Language Generation

Linfeng Liu, Tiffany Zhan, Louie Hong Yao, Saptarshi Ghosh +1 more

The paper demonstrates that the internal signals governing figurative language generation are reusable across multiple languages, showing that a steering direction learned in one language can effectiv…

View →
cs.SEcs.CRRecentMay 28, 2026

CODEFUSE-DEBENCH: An Empirical Study on Readability, Recompilability, and Functionality

Puzhuo Liu, Yuhan Huang, Jianlei Chi, Peng Di +1 more

The paper introduces DEBENCH, a novel framework that evaluates binary decompilers based on three orthogonal dimensions—readability, recompilability, and functionality—revealing that functional recover…

View →
cs.CVcs.AIcs.CLRecentMay 27, 2026

VCap: Hypergeometric Rewards for Weak-to-Strong Visual Captioning

Xingyu Lu, Jinpeng Wang, Yi-Fan Zhang, Yankai Yang +12 more

VCap introduces a novel Witness-Adjudicator reward mechanism that provides highly precise, factually grounded feedback for visual captioning, enabling state-of-the-art performance in RL-trained multim…

View →
cs.CRcs.AIRecentMar 30, 2026

Evaluating Privilege Usage of Agents with Real-World Tools

Quan Zhang, Lianhang Fu, Lvsi Lian, Gwihwan Go +4 more

The paper introduces GrantBox, a new security sandbox that evaluates how well LLM agents handle real-world tool privileges, finding that agents remain highly vulnerable to sophisticated attacks.

View →