Vidhata Jayaraman — arXiv Papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×1AI×1

Frequent co-authors

Max Hartman1×

Moulik Choraria1×

Yash Savani1×

Lav R. Varshney1×

Research Timeline

2026

Hiding in Plain Sight: Detectability-Aware Antidistillation of Reasoning Models

The paper introduces TraceGuard, a detectability-aware antidistillation method that identifies and poisons 'thought anchors'—sparsely critical sentences—to degrade student model learning without making the defense obvious.

Highlighted terms show continued research focus across papers