Qi Fan

7 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×4Crypto×3ML×3NLP×2Info Retrieval×1Vision×1Databases×1

Frequent co-authors

Yingqi Fan2×

Junlong Tong2×

Xiaoyu Shen2×

Mark Vero2×

Fabian Kaczmarczyck2×

Ivan Petrov2×

Research Timeline

2026

Honeyval: A Comprehensive Evaluation Framework for LLM-powered HTTP Honeypots

The paper introduces Honeyval, a comprehensive evaluation framework, to rigorously test LLM-powered HTTP honeypots, demonstrating that these systems provide substantially longer and harder-to-detect interactions compared to traditional methods.

PhoneWorld: Scaling Phone-Use Agent Environments

The paper introduces PhoneWorld, a scalable pipeline that automatically converts real-world GUI trajectories and screenshots into controllable, reproducible phone-use environments, significantly improving agent performance across multiple mobile benchmarks.

Honeyval: A Comprehensive Evaluation Framework for LLM-powered HTTP Honeypots

The paper introduces Honeyval, a comprehensive evaluation framework, to rigorously test LLM-powered HTTP honeypots, demonstrating that these honeypots provide substantially longer and harder-to-detect interactions compared to traditional methods.

ProactiveLLM: Learning Active Interaction for Streaming Large Language Models

ProactiveLLM introduces a novel framework that enables streaming LLMs to actively decide when to interact with incoming data by leveraging the model's internal states, significantly reducing latency while maintaining quality.

Inference Cost Attacks for Retrieval-Augmented Large Language Models

This paper introduces a novel attack, RA-ICA, that targets RAG-enhanced LLMs by poisoning external knowledge bases to drastically increase inference costs, achieving up to a 13.12x increase in token consumption.

Geometry-Aware Implicit Memory for Video World Models

The paper proposes GIM-World, a geometry-aware implicit memory framework that significantly improves long-horizon video world models by explicitly encoding 3D scene geometry into a compact memory state.

CompRank: Efficient LLM Reranking via Token-Level Compression and Decoding-Free Scoring

This paper proposes CompRank, a token-efficient reranking framework for large language models that reduces redundant computation and achieves strong reranking performance.

Highlighted terms show continued research focus across papers

Papers

cs.IREmpiricalRecentJun 10, 2026

CompRank: Efficient LLM Reranking via Token-Level Compression and Decoding-Free Scoring

Xuan Lu, Haohang Huang, Yingqi Fan, Junlong Tong +4 more

This paper proposes CompRank, a token-efficient reranking framework for large language models that reduces redundant computation and achieves strong reranking performance.

View →

cs.CVRecentJun 1, 2026