Ping Xiong
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper proposes Two-stage Backdoor Hijacking (TSBH) to create persistent, trigger-activated malicious behaviors by manipulating the observable Chain-of-Thought (CoT) process in Large Language Models.
The study found that human judgment of logical fallacies is significantly biased by source labels (e.g., human vs. AI), while LLM evaluations remained comparatively stable across these source conditions.
Papers
Label Over Logic? How Source Cues Bias Human Fallacy Judgments More Than LLMs
The study found that human judgment of logical fallacies is significantly biased by source labels (e.g., human vs. AI), while LLM evaluations remained comparatively stable across these source conditio…