Ye Wu

1 indexed paper

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×1

Frequent co-authors

Guanlong Wu1×

Zhaohan li1×

Yao Zhang1×

Zheng Zhang1×

Jianyu Niu1×

Yinqian Zhang1×

Research Timeline

2026

CachePrune: Privacy-Aware and Fine-Grained KV Cache Sharing for Efficient LLM Inference

CachePrune introduces a privacy-aware, fine-grained KV cache sharing mechanism that allows LLM inference systems to safely reuse cache entries across users' requests, significantly improving efficiency while eliminating side-channel leakage.

Highlighted terms show continued research focus across papers

Papers

cs.CRRecentMay 22, 2026

CachePrune: Privacy-Aware and Fine-Grained KV Cache Sharing for Efficient LLM Inference

Guanlong Wu, Zhaohan li, Yao Zhang, Zheng Zhang +3 more

View →