Jenny Bao

1 indexed paper

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

NLP×1Crypto×1

Frequent co-authors

Xuanli He1×

Bilgehan Sel1×

Faizan Ali1×

Hoagy Cunningham1×

Jerry Wei1×

Research Timeline

2026

Segment-Level Coherence for Robust Harmful Intent Probing in LLMs

The paper introduces a robust streaming probing objective that requires multiple evidence tokens to support a prediction, significantly improving the detection of harmful intent in LLMs, especially in sensitive CBRN domains.

Highlighted terms show continued research focus across papers

Papers

cs.CLcs.CRRecentApr 16, 2026

Segment-Level Coherence for Robust Harmful Intent Probing in LLMs

Xuanli He, Bilgehan Sel, Faizan Ali, Jenny Bao +2 more

View →