Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Ari Holtzman

Ari Holtzman

2 indexed papers

Recent (6 mo)
2
With code
0
Influential cites
0
Benchmarked
0

Publications per year

2
26

Top categories

AI×2ML×1Crypto×1

Frequent co-authors

Todd Nief1×
Harvey Yiyun Fu1×
Mark Muchane1×
Peter West1×

Research Timeline

2026
Can You Keep a Secret? Involuntary Information Leakage in Language Model Writing

Frontier language models involuntarily leak secret information through thematic elements in their writing, even when explicitly instructed to keep the secret hidden.

Subliminal Learning is a LoRA Artifact

The paper demonstrates that the phenomenon of 'subliminal learning,' where behavioral traits are transmitted between language models, is not a fundamental learning mechanism but rather a fragile artifact of LoRA fine-tuning and specific contextual tokens.

Highlighted terms show continued research focus across papers

Papers

cs.AIcs.LGRecentMay 30, 2026

Subliminal Learning is a LoRA Artifact

Todd Nief, Harvey Yiyun Fu, Mark Muchane, Ari Holtzman

The paper demonstrates that the phenomenon of 'subliminal learning,' where behavioral traits are transmitted between language models, is not a fundamental learning mechanism but rather a fragile artif…

View →
cs.CRcs.AIRecentMay 11, 2026

Can You Keep a Secret? Involuntary Information Leakage in Language Model Writing

Ari Holtzman, Peter West

Frontier language models involuntarily leak secret information through thematic elements in their writing, even when explicitly instructed to keep the secret hidden.

View →