Albert No

3 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×3NLP×2ML×1Crypto×1

Frequent co-authors

Dueun Kim1×

Soeun Kim1×

Sangyeon Yoon1×

Wonje Jeung1×

Yoonjun Cho1×

Dongjae Jeon1×

Research Timeline

2026

Few-Shot Truly Benign DPO Attack for Jailbreaking LLMs

The paper introduces a truly benign Direct Preference Optimization (DPO) attack that can jailbreak large language models (LLMs) by fine-tuning them with minimal, harmless preference data, thereby suppressing refusal behavior even for malicious prompts.

The Confidence Shortcut: A Reasoning Failure Mode of Masked Diffusion Models

The paper argues that using confidence-based decoding, which is optimized via training mask alignment, fundamentally misaligns Masked Diffusion Models (MDMs) from the logical flow needed for complex reasoning, leading to catastrophic failures on challenging inputs.

Where Rollouts Begin: Low-Load, High-Leverage First-Token Diversification for RLVR

The paper introduces REFT, a novel method that diversifies rollouts by sampling the first token after the reasoning marker, significantly improving performance in Reinforcement Learning with Verifiable Rewards (RLVR) without altering the core RLVR pipeline.

Highlighted terms show continued research focus across papers

Papers

cs.AIcs.CLRecentMay 27, 2026

The Confidence Shortcut: A Reasoning Failure Mode of Masked Diffusion Models

Dueun Kim, Albert No

View →

cs.AIcs.CLcs.LGRecentMay 27, 2026