Seanie Lee
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper introduces T-MAP, a trajectory-aware evolutionary search method, to discover and generate multi-step adversarial prompts that exploit vulnerabilities in autonomous LLM agents through tool execution, significantly improving attack realization rates.
The paper proposes SELFCI, a complementary self-distillation framework that effectively balances the privacy requirements of Contextual Integrity (CI) with the utility of large language models, outperforming existing methods without external supervision.
Papers
It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs
Sangwoo Park, Woongyeong Yeo, Seanie Lee, Yumin Choi +5 more
The paper proposes SELFCI, a complementary self-distillation framework that effectively balances the privacy requirements of Contextual Integrity (CI) with the utility of large language models, outper…