When Context Flips, Safety Breaks: Diagnosing Brittle Safety in Aligned Language Models | ArxivCSExplorer