Jianwei Li

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×2AI×2ML×2

Frequent co-authors

Jung-Eun Kim2×

Research Timeline

2026

Position: Retire the "Positive Backdoor" Label -- Secret Alignment Requires Strict and Systematic Evaluation

The paper argues that the 'positive backdoor' label should be retired and replaced with 'Secret Alignment,' asserting that all such protective claims require rigorous, standardized evaluation due to inherent brittleness.

Position: Retire the "Positive Backdoor" Label -- Secret Alignment Requires Strict and Systematic Evaluation

The paper argues that the 'positive backdoor' label should be retired and replaced with 'Secret Alignment,' asserting that such protective claims must be rigorously evaluated for security, especially concerning confidentiality, integrity, and availability.

Highlighted terms show continued research focus across papers

Papers

cs.CRcs.AIcs.LGRecentMay 27, 2026

Position: Retire the "Positive Backdoor" Label -- Secret Alignment Requires Strict and Systematic Evaluation

Jianwei Li, Jung-Eun Kim

View →

cs.CRcs.AIcs.LGRecentMay 27, 2026