Yujiu Yang

5 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

NLP×4AI×4ML×3Vision×2

Frequent co-authors

Junjie Wang2×

Chufan Shi2×

Yusong Zhao1×

Yuejin Xie1×

Youliang Yuan1×

Junjie Hu1×

Research Timeline

2026

OmniVerifier-M1: Multimodal Meta-Verifier with Explicit Structured Recalibration

The paper introduces OmniVerifier-M1, a multimodal meta-verifier that uses symbolic outputs and decoupled reinforcement learning to provide robust, fine-grained verification and error localization for large multimodal models.

Integrated and Cross-Architecture Interpretation of LLM Reasoning

The paper introduces an Integrated, cross-Architecture Reasoning (IAR) framework to provide a unified and robust method for interpreting the opaque reasoning processes within Large Language Models.

Smaller Models are Natural Explorers for Policy-Level Diversity in GRPO

The paper proposes S2L-PO, a framework that uses smaller, naturally diverse models as structured explorers to enhance the policy-level diversity and performance of larger language models during training.

Internalize the Temperature: On-Policy Self-Distillation as Policy Reheater for Reinforcement Learning

The paper introduces Temperature-Scaled On-Policy Self-Distillation (TS-OPSD), a novel method that internalizes temperature-based policy reheating into model parameters to combat entropy collapse in reinforcement learning.

PaSBench-Video: A Streaming Video Benchmark for Proactive Safety Warning

The paper introduces PaSBench-Video, a comprehensive streaming video benchmark designed to rigorously test multimodal LLMs' ability to issue proactive safety warnings, finding that current models struggle with temporal precision and high false-positive rates.

Highlighted terms show continued research focus across papers

Papers

cs.CLcs.AIcs.CVRecentJun 1, 2026

PaSBench-Video: A Streaming Video Benchmark for Proactive Safety Warning

Yusong Zhao, Yuejin Xie, Youliang Yuan, Junjie Hu +3 more

View →

cs.CLcs.LGRecentMay 30, 2026