Samarth Raina
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
NLP×1AI×1ML×1
Frequent co-authors
Research Timeline
2026
MENTIS: What Belief Changes Under Alignment? Measuring Multi-Scale Latent Torsion in Language Models
The paper introduces MENTIS, a geometry-first framework that measures how preference alignment structurally changes the internal computations of language models, finding that these changes are selective, depth-localized, and concept-dependent.
Highlighted terms show continued research focus across papers