Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Pouya Mahdi Gholami

Pouya Mahdi Gholami

1 indexed paper

Recent (6 mo)
1
With code
0
Influential cites
0
Benchmarked
0

Publications per year

1
26

Top categories

Distributed×1Architecture×1ML×1

Frequent co-authors

Jianru Ding1×
Ryien Hosseini1×
Mingyuan Xiang1×
Henry Hoffmann1×

Research Timeline

2026
Observation, Not Prediction: Conversation-Level Disaggregated Scheduling for Agentic Serving

The paper proposes scheduling LLM agent workloads at the conversation level rather than the turn level, significantly reducing latency and improving energy efficiency by transforming unpredictable multi-turn inference into a stable, two-phase process.

Highlighted terms show continued research focus across papers

Papers

cs.DCcs.ARcs.LGRecentJun 1, 2026

Observation, Not Prediction: Conversation-Level Disaggregated Scheduling for Agentic Serving

Jianru Ding, Ryien Hosseini, Pouya Mahdi Gholami, Mingyuan Xiang +1 more

The paper proposes scheduling LLM agent workloads at the conversation level rather than the turn level, significantly reducing latency and improving energy efficiency by transforming unpredictable mul…

View →