Inform, Coach, Relate, Listen: Auditing LLM Caregiving Support Roles

ESC-Skills: Discovering and Self-Evolving Skills for Emotional Support Conversations

The paper proposes ESC-Skills, a skill-centric framework that discovers and self…

When Does Persona Prompting Actually Help? A Retrieval and Metric Analysis of Expert Role Injection…

Persona prompting does not universally improve LLM performance; instead, it syst…

NICE: A Theory-Grounded Diagnostic Benchmark for Social Intelligence of LLMs

The paper introduces NICE, a novel, theory-grounded diagnostic benchmark for ass…

When Context Flips, Safety Breaks: Diagnosing Brittle Safety in Aligned Language Models

The paper introduces 'brittle safety,' a failure mode where aligned language mod…

Reliable Multilingual Orthopedic Decision Support from Clinical Narratives: Language-Aware Adaptatio…

The paper introduces a reliability-oriented framework, IndicBERT-HPA, for multil…

Gram: Assessing sabotage propensities via automated alignment auditing

The paper introduces Gram, an automated framework that assesses AI agent propens…

Configurable Reward Model for Balanced Safety Alignment

The paper introduces the Configurable Safety Reward Model (CSRM), a novel reward…

On the impact of retrieved content representations in RAG Pipelines

The paper systematically compares multiple content representations for RAG pipel…