Yunpu Ma

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

NLP×1ML×1AI×1

Frequent co-authors

Junlong Tong1×

Yao Zhang1×

Anhao Zhao1×

Yingqi Fan1×

Xiaoyu Shen1×

Jinhe Bi1×

Research Timeline

2026

EchoRL: Reinforcement Learning via Rollout Echoing

EchoRL proposes a lightweight module to exploit valuable learning signals from advantage-degenerated rollouts in Reinforcement Learning with Verifiable Rewards (RLVR), significantly improving LLM post-training performance.

ProactiveLLM: Learning Active Interaction for Streaming Large Language Models

ProactiveLLM introduces a novel framework that enables streaming LLMs to actively decide when to interact with incoming data by leveraging the model's internal states, significantly reducing latency while maintaining quality.

Highlighted terms show continued research focus across papers

Papers

cs.CLRecentMay 30, 2026

ProactiveLLM: Learning Active Interaction for Streaming Large Language Models

Junlong Tong, Yao Zhang, Anhao Zhao, Yingqi Fan +2 more

View →

cs.LGcs.AIRecentMay 29, 2026