Vidhata Jayaraman
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
Crypto×1AI×1
Frequent co-authors
Research Timeline
2026
Hiding in Plain Sight: Detectability-Aware Antidistillation of Reasoning Models
The paper introduces TraceGuard, a detectability-aware antidistillation method that identifies and poisons 'thought anchors'—sparsely critical sentences—to degrade student model learning without making the defense obvious.
Highlighted terms show continued research focus across papers
Papers
cs.CRcs.AIRecentApr 25, 2026
Hiding in Plain Sight: Detectability-Aware Antidistillation of Reasoning Models
Max Hartman, Vidhata Jayaraman, Moulik Choraria, Yash Savani +1 more
The paper introduces TraceGuard, a detectability-aware antidistillation method that identifies and poisons 'thought anchors'—sparsely critical sentences—to degrade student model learning without makin…
View →