Marco Arazzi

3 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×3AI×2NLP×1ML×1

Frequent co-authors

Vignesh Kumar Kembu3×

Antonino Nocera3×

Saraga Sakthidharan2×

Stjepan Picek1×

Aiman Al Masoud1×

Antony Anju1×

Research Timeline

2026

SecureBreak -- A dataset towards safe and secure models

The paper introduces SecureBreak, a manually annotated, safety-oriented dataset designed to help detect harmful outputs from large language models (LLMs) that bypass existing security alignments.

Security in LLM-as-a-Judge: A Comprehensive SoK

This paper provides the first comprehensive Systematization of Knowledge (SoK) on the security aspects of LLM-as-a-Judge (LaaJ) systems, identifying key vulnerabilities and proposing a taxonomy for future research.

You Snooze, You Lose: Automatic Safety Alignment Restoration through Neural Weight Translation

The paper introduces NeWTral, a framework that restores safety alignment to specialized LLM adapters without sacrificing their domain-specific knowledge, achieving a significant reduction in attack success rates while maintaining high fidelity.

Highlighted terms show continued research focus across papers

Papers

cs.CRRecentMay 6, 2026

You Snooze, You Lose: Automatic Safety Alignment Restoration through Neural Weight Translation

Marco Arazzi, Vignesh Kumar Kembu, Antonino Nocera, Stjepan Picek +1 more

View →

cs.CRcs.AIRecentMar 31, 2026