Aditya Nawal

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

NLP×2AI×2Crypto×2

Frequent co-authors

Manit Baser2×

Mohan Gurusamy2×

Research Timeline

2026

Relevance as a Vulnerability: How Web Retrieval Degrades Safety Alignment in LLM Agents

This paper introduces AgentREVEAL, a diagnostic framework that demonstrates that the utility of web retrieval in LLM agents creates a safety-utility trade-off, as relevance itself can degrade safety alignment and increase harmful compliance.

Relevance as a Vulnerability: How Web Retrieval Degrades Safety Alignment in LLM Agents

This paper introduces AgentREVEAL, a diagnostic framework showing that the utility of web retrieval in LLM agents creates a safety-utility trade-off, as relevance itself can degrade safety alignment and increase harmful compliance.

Highlighted terms show continued research focus across papers

Papers

cs.CLcs.AIcs.CRRecentMay 28, 2026

Relevance as a Vulnerability: How Web Retrieval Degrades Safety Alignment in LLM Agents

Aditya Nawal, Manit Baser, Mohan Gurusamy

View →

cs.CLcs.AIcs.CRRecentMay 28, 2026