Bin Cui

3 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

ML×2AI×2Architecture×1NLP×1

Frequent co-authors

Xupeng Miao3×

Chunan Shi1×

Yilei Chen1×

Yilin Chen1×

Weitong Qian1×

Beicheng Xu1×

Research Timeline

2026

AutoSci: A Memory-Centric Agentic System for the Full Scientific Research Lifecycle

AutoSci is a memory-centric agentic system designed to automate the entire scientific research lifecycle by integrating structured memory, multi-stage execution, and continuous self-improvement.

DARTS: Distribution-Aware Active Rollout Trajectory Shaping for Accelerating LLM Reinforcement Learning

The paper proposes DARTS, a distribution-aware active rollout trajectory shaping method that fundamentally accelerates LLM reinforcement learning by actively shaping the long-tail response distribution towards conciseness and certainty.

Multi-Segment Attention: Enabling Efficient KV-Cache Management for Faster Large Language Model Serving

The paper proposes AsymCache, a computation-latency-aware KV cache management system that optimizes LLM inference by aligning cache eviction decisions with GPU attention kernel performance, significantly reducing both Time-to-First-Token (TTFT) and Time-Per-Output-Token (TPOT).

Highlighted terms show continued research focus across papers

Papers

cs.ARcs.CLcs.LGRecentJun 1, 2026

Multi-Segment Attention: Enabling Efficient KV-Cache Management for Faster Large Language Model Serving

Chunan Shi, Yilei Chen, Yilin Chen, Xupeng Miao +1 more

View →

cs.AIRecentMay 29, 2026