Amit Dhanda

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×2NLP×1ML×1Multiagent×1

Frequent co-authors

Partha Pratim Saha1×

Samarth Raina1×

Mayur Parvatikar1×

Vinija Jain1×

Aman Chadha1×

Amitava Das1×

Research Timeline

2026

Safe Equilibrium Policy Optimization for Strategic Agent Policies

The paper introduces Safe Equilibrium Policy Optimization (σepo{}) to train language models for multi-agent strategic tasks, achieving improved safety and robustness across various game domains.

MENTIS: What Belief Changes Under Alignment? Measuring Multi-Scale Latent Torsion in Language Models

The paper introduces MENTIS, a geometry-first framework that measures how preference alignment structurally changes the internal computations of language models, finding that these changes are selective, depth-localized, and concept-dependent.

Highlighted terms show continued research focus across papers

Papers

cs.CLcs.AIcs.LGRecentMay 31, 2026

MENTIS: What Belief Changes Under Alignment? Measuring Multi-Scale Latent Torsion in Language Models

Partha Pratim Saha, Samarth Raina, Mayur Parvatikar, Amit Dhanda +3 more

View →

cs.MAcs.AIRecentMay 29, 2026