Diptisha Samanta
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
Crypto×1AI×1
Frequent co-authors
Research Timeline
2026
Automated Framework to Evaluate and Harden LLM System Instructions against Encoding Attacks
The paper introduces an automated framework demonstrating that LLM system instructions are vulnerable to encoding attacks, where structured output requests can bypass safety refusals and leak sensitive content.
Highlighted terms show continued research focus across papers