Aviral Srivastava
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
Crypto×1AI×1
Frequent co-authors
Research Timeline
2026
Attention Is Where You Attack
The paper introduces the Attention Redistribution Attack (ARA), a white-box adversarial method that bypasses safety alignments in LLMs by manipulating the attention mechanism's geometry, showing that safety is not localized but emerges from attention routing.
Highlighted terms show continued research focus across papers