Qi Liu

14 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×8ML×3NLP×3Robotics×2Vision×2Crypto×2Distributed×1Info Retrieval×1

Frequent co-authors

Jiaqi Liu3×

Dong Jing2×

Tianqi Zhang2×

Zhiwu Lu2×

Mingyu Ding2×

Anqi Liu2×

Research Timeline

2026

Ciphertext-Policy ABE for $\mathsf{NC}^1$ Circuits with Constant-Size Ciphertexts from Succinct LWE

The paper presents a lattice-based Ciphertext-Policy Attribute-Based Encryption (CP-ABE) scheme that supports $\mathsf{NC}^1$ access policies while maintaining constant-size ciphertexts.

Breaking the Secret: Economic Interventions for Combating Collusion in Embodied Multi-Agent Systems

The paper proposes a mutagenic incentive intervention approach that mitigates collusion in embodied multi-agent systems by reshaping agents' payoff structures, effectively inducing defection and maintaining system efficiency.

EgoBench: An Interactive Egocentric Multimodal Benchmark for Tool-Using Agents

The paper introduces EgoBench, the first interactive multimodal benchmark designed to jointly evaluate advanced AI agents' capabilities in visual perception, multi-hop reasoning, and dynamic tool usage in real-world, egocentric scenarios.

Entropy-KL Divergence-based Token Masking: A Novel Approach for Selective Fine-tuning of Large Language Models

The paper proposes EKSFT, a selective fine-tuning method that masks high-entropy or high-KL divergence tokens during Supervised Fine-Tuning (SFT) to prevent distribution shift and improve subsequent Reinforcement Learning (RL) performance.

Configurable Reward Model for Balanced Safety Alignment

The paper introduces the Configurable Safety Reward Model (CSRM), a novel reward model that can be jointly optimized for calibrated safety compliance and reward modeling, significantly improving LLM safety alignment across diverse and unseen safety configurations.

The Flip Side of RLHF: On-Policy Feedback for Reward Model Self-Supervised Improvement

The paper introduces SAVE, a framework that uses on-policy feedback and the value function to self-supervise and improve reward models, significantly enhancing RLHF performance across multiple benchmarks.

Repurposing Adversarial Perturbations for Continual Learning: From Defense to Active Alignment

The paper introduces AdvCL, a framework that repurposes adversarial perturbations as a geometric control signal to stabilize continual learning in large language models, significantly reducing forgetting and enhancing robustness.

AutoMedBench: Towards Medical AutoResearch with Agentic AI Models

The paper introduces AutoMedBench, a novel workflow-aware benchmark that evaluates autonomous medical-AI agents across a five-stage research process, revealing that agents struggle most with validation and submission.

TempoVLA: Learning Speed-Controllable Vision-Language-Action Policies

TempoVLA is a novel Vision-Language-Action model that enables controllable execution speed for robot manipulation by explicitly conditioning the policy on the desired speed.

Looped World Models

Introduces Looped World Models, a looped architecture for world modelling that iteratively refines latent environment states for up to 100x parameter efficiency.

Toward Calibrated Mixture-of-Experts Under Distribution Shift

This paper studies the behavior of mixture-of-experts (MoE) models under distribution shift and proposes an adversarial reweighting method to improve their calibration.

Learning Action Priors for Cross-embodiment Robot Manipulation

This paper proposes a two-stage training framework to pretrain action modules with motion priors before Vision-Language-Action (VLA) alignment, improving VLA performance and reducing optimization challenges.

From Bootstrapping to Sequence Modeling: A Unified Generative Framework for Personalized Landing-Page Modeling

This paper proposes GLAN, a sequence modeling framework for Personalized Landing Page Modeling on online platforms, addressing the limitations of previous reinforcement learning approaches.

Application-Driven Architecture Exploration for Cross-Layer Heterogeneous Systems

The paper presents CHASE, an application-driven framework that explores physically feasible Cross-layer Heterogeneous System architectures for executing workloads with diverse requirements.

Highlighted terms show continued research focus across papers

Papers

cs.DCEmpiricalRecentJul 25, 2026

Application-Driven Architecture Exploration for Cross-Layer Heterogeneous Systems

Yuchen Fan, Minghong Sun, Jikui Ma, Yunpeng Xu +18 more

The paper presents CHASE, an application-driven framework that explores physically feasible Cross-layer Heterogeneous System architectures for executing workloads with diverse requirements.

View →

cs.IREmpiricalRecent