Aniket Anand
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
ML×1NLP×1
Frequent co-authors
Research Timeline
2026
Measuring, Localizing, and Ablating Alignment Signatures in LLMs
The paper demonstrates that the AI-like style introduced by post-training alignment can be measured, localized, and causally removed using a novel ablation technique called PASTA.
Highlighted terms show continued research focus across papers
Papers
cs.LGcs.CLRecentMay 28, 2026
Measuring, Localizing, and Ablating Alignment Signatures in LLMs
Aniket Anand, Janvijay Singh, Zhewei Sun, Dilek Hakkani-Tür +1 more
The paper demonstrates that the AI-like style introduced by post-training alignment can be measured, localized, and causally removed using a novel ablation technique called PASTA.
View →