Guang Zhang
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper proposes $D^3$, a dynamic graph-constrained scheduling framework that optimizes LLM training order by modeling sample interactions as a dynamic influence graph.
The paper proposes DiReCT, a novel framework that treats data selection during LLM annealing as a constrained optimization problem based on the spectral geometry of the loss landscape, achieving state-of-the-art performance.
Papers
D$^3$: Dynamic Directional Graph-Constrained Data Scheduling for LLM Training
The paper proposes $D^3$, a dynamic graph-constrained scheduling framework that optimizes LLM training order by modeling sample interactions as a dynamic influence graph.