You Wu
3 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
Branch Landing (BRL) is a novel forward-edge CFI framework for RISC-V that uses Bloom filters to overcome the source authorization limitations of existing hardware CFI, achieving low overhead for fine-grained source control.
BlockBatch introduces a novel framework that efficiently accelerates diffusion language model (dLLM) inference by simultaneously executing multiple block-size branches for a single request, achieving significant speedup while maintaining accuracy.
GRKV introduces a training-free KV-cache merging method that uses global regression to distribute information from evicted tokens, solving the over-merging problem inherent in span-based retention.
Papers
GRKV: Global Regression for Training-Free KV Cache Compression in Long-Context LLMs
Junjie Peng, You Wu, Haoyi Wu, Jialong Han +3 more
GRKV introduces a training-free KV-cache merging method that uses global regression to distribute information from evicted tokens, solving the over-merging problem inherent in span-based retention.