Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Henry Hoffmann

Henry Hoffmann

1 indexed paper

Recent (6 mo)
1
With code
0
Influential cites
0
Benchmarked
0

Publications per year

1
26

Top categories

Distributed×1Architecture×1ML×1

Frequent co-authors

Jianru Ding1×
Ryien Hosseini1×
Pouya Mahdi Gholami1×
Mingyuan Xiang1×

Research Timeline

2026
Observation, Not Prediction: Conversation-Level Disaggregated Scheduling for Agentic Serving

The paper proposes scheduling LLM agent workloads at the conversation level rather than the turn level, significantly reducing latency and improving energy efficiency by transforming unpredictable multi-turn inference into a stable, two-phase process.

Highlighted terms show continued research focus across papers

Papers

cs.DCcs.ARcs.LGRecentJun 1, 2026

Observation, Not Prediction: Conversation-Level Disaggregated Scheduling for Agentic Serving

Jianru Ding, Ryien Hosseini, Pouya Mahdi Gholami, Mingyuan Xiang +1 more

The paper proposes scheduling LLM agent workloads at the conversation level rather than the turn level, significantly reducing latency and improving energy efficiency by transforming unpredictable mul…

View →