Yaxin Luo

1 indexed paper

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

NLP×1AI×1ML×1

Frequent co-authors

Jiacheng Cui1×

Xiaohan Zhao1×

Xinyi Shang1×

Jiacheng Liu1×

Xinyue Bi1×

Zhaoyi Li1×

Research Timeline

2026

LLMSurgeon: Diagnosing Data Mixture of Large Language Models

The paper introduces LLMSurgeon, a framework that estimates the domain-level data mixture of a Large Language Model (LLM) using only generated text, thereby providing a post-hoc method to audit the model's 'digital DNA'.

Highlighted terms show continued research focus across papers

Papers

cs.CLcs.AIcs.LGRecentMay 28, 2026

LLMSurgeon: Diagnosing Data Mixture of Large Language Models

Yaxin Luo, Jiacheng Cui, Xiaohan Zhao, Xinyi Shang +4 more

View →