Huazheng Wang
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper proposes Speculative Pipeline Decoding (SPD), a novel framework that uses pipeline parallelism to accelerate LLM inference by processing multiple tokens in parallel, achieving higher speedup and zero latency bubbles.
EvoPool introduces an evolutionary multi-agent framework that efficiently generates high-quality, specialized supervision labels, significantly outperforming LLM annotation baselines across complex, label-scarce domains.
Papers
EvoPool: Evolutionary Programmatic Annotation for Label-Efficient Specialized Supervision
EvoPool introduces an evolutionary multi-agent framework that efficiently generates high-quality, specialized supervision labels, significantly outperforming LLM annotation baselines across complex, l…