Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Mingyuan Xiang

Mingyuan Xiang

1 indexed paper

Recent (6 mo)
1
With code
0
Influential cites
0
Benchmarked
0

Publications per year

1
26

Top categories

Distributed×1Architecture×1ML×1

Frequent co-authors

Jianru Ding1×
Ryien Hosseini1×
Pouya Mahdi Gholami1×
Henry Hoffmann1×

Research Timeline

2026
Observation, Not Prediction: Conversation-Level Disaggregated Scheduling for Agentic Serving

The paper proposes scheduling LLM agent workloads at the conversation level rather than the turn level, significantly reducing latency and improving energy efficiency by transforming unpredictable multi-turn inference into a stable, two-phase process.

Highlighted terms show continued research focus across papers

Papers

cs.DCcs.ARcs.LGRecentJun 1, 2026

Observation, Not Prediction: Conversation-Level Disaggregated Scheduling for Agentic Serving

Jianru Ding, Ryien Hosseini, Pouya Mahdi Gholami, Mingyuan Xiang +1 more

The paper proposes scheduling LLM agent workloads at the conversation level rather than the turn level, significantly reducing latency and improving energy efficiency by transforming unpredictable mul…

View →