Sina Alemohammad
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
NLP×1AI×1ML×1
Frequent co-authors
Research Timeline
2026
Not All Synthetic Data Is Yours to Learn From
Weak self-training on synthetic data can amplify a language model's existing capabilities, but this effect is strictly dependent on the compatibility between the source and student models, not on the data's intrinsic quality.
Highlighted terms show continued research focus across papers