Huang

50 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×31ML×13Vision×11NLP×10Crypto×8Info Retrieval×3Architecture×3Signal Processing×1

Frequent co-authors

Jian Weng3×

Zhen Huang3×

Yang Yang2×

Guomin Yang2×

Yingjiu Li2×

Rui Shi2×

Research Timeline

2026

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

The paper reframes Parameter-Efficient Fine-Tuning (PEFT) from a mere cost-saving alternative to a robust architecture for creating persistent, personalized models that layer specific behaviors onto large shared foundation models.

Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models

This paper introduces Imaginative Perception Tokens (IPT) to improve spatial reasoning in vision language models.

Formalizing the Binding Problem

This paper formalizes the binding problem using information theory and develops a probing method to measure binding information in deep learning representations, demonstrating that binding is crucial for strong visual recognition.

Skill-RM: Unifying Heterogeneous Evaluation Criteria via Agent Skill

The paper proposes Skill-RM, a unified framework that treats reward modeling as an agentic task to consistently integrate diverse evaluation criteria, achieving superior performance over traditional methods.

MOSAIC: Efficient Mixture-of-Agent Scheduling via Adaptive Aggregation and Inference Concurrency

MOSAIC is a novel scheduling framework that significantly accelerates Mixture-of-Agents (MoA) workloads by jointly optimizing expert placement and utilizing confidence-aware adaptive aggregation.

AI Agents Enable Adaptive Computer Worms

The paper demonstrates a novel, self-sustaining computer worm powered by AI agents that generates tailored attack strategies in real-time, representing a significant shift from traditional, vulnerability-exploiting malware.

Need to Know: Contextual-Integrity-Grounded Query Rewriting for Privacy-Conscious LLM Delegation

The paper introduces a Contextual Integrity (CI) framework and a new benchmark (DelegateCI-Bench) to rewrite user queries sent to cloud LLMs, ensuring only task-essential information is retained while preserving utility and maximizing privacy.

RUBAS: Rubric-Based Reinforcement Learning for Agent Safety

The paper introduces RUBAS, a rubric-based reinforcement learning framework that improves agent safety by providing fine-grained, multi-dimensional rewards for complex tool-use scenarios.

Uncertainty-Aware End-to-End Co-Design of Neural Network Processors: From Training and Mapping to Fabrication

This paper presents a unified framework for end-to-end co-design of neural network processors.

DIST-FL: Enhancing Security for TEE-based Aggregation in Federated Learning

The paper proposes DIST-FL, a distributed system using multiple TEEs and an append-only ledger to enhance the security and robustness of federated learning aggregation against server-side adversaries.

Revisiting Privacy Amplification by Subsampling in Selective Release DPSGD

The paper proposes DPSR-CG, a novel differentially private selective release mechanism that rigorously maintains strict privacy guarantees while significantly improving model utility compared to existing methods.

OneReason Technical Report

The paper proposes OneReason, a framework that enhances the reasoning capability of generative recommendation models by focusing on improving item perception and structuring user behavior into coherent latent interests.

Steering LLM Viewpoints through Fabricated Evidence Injection

This paper introduces Ghostwriter, an attack framework demonstrating that LLMs are highly vulnerable to adopting misleading viewpoints when provided with fabricated, yet credible-looking, evidence.

PriSrv+: Privacy and Usability-Enhanced Wireless Service Discovery with Fast and Expressive Matchmaking Encryption

The paper introduces PriSrv+, an advanced service discovery protocol that significantly enhances privacy, usability, and efficiency in wireless networks through a novel matchmaking encryption scheme called FEME.

PriSrv: Privacy-Enhanced and Highly Usable Service Discovery in Wireless Communications

The paper proposes PriSrv, a novel private service discovery protocol that enhances wireless communication security and privacy by enabling fine-grained, multi-layered matching and mutual authentication.

ReasonAlloc: Hierarchical Decoding-Time KV Cache Budget Allocation for Reasoning Models

This paper proposes a training-free framework called ReasonAlloc to mitigate inference bottlenecks in large language models by recasting decoding-time key-value compression as a hierarchical budget allocation problem.

Fundamentals of NOMA in Low-Earth Orbit Coordinated Multi-Satellite Networks

This paper investigates the downlink performance of CoMS-NOMA networks from a system-level perspective.

CORE-Bench: A Comprehensive Benchmark for Code Retrieval in the Era of Agentic Coding

This paper introduces CORE-Bench, a comprehensive benchmark for code retrieval in agentic coding.

CompRank: Efficient LLM Reranking via Token-Level Compression and Decoding-Free Scoring

This paper proposes CompRank, a token-efficient reranking framework for large language models that reduces redundant computation and achieves strong reranking performance.

Agents-K1: Towards Agent-native Knowledge Orchestration

This paper introduces Agents-K1, an end-to-end knowledge orchestration pipeline that converts raw documents into agent-native scientific knowledge graphs.

Highlighted terms show continued research focus across papers

Papers

cs.AIEmpiricalRecentJun 11, 2026

Agents-K1: Towards Agent-native Knowledge Orchestration

Zongsheng Cao, Bihao Zhan, Jinxin Shi, Jiong Wang +21 more

This paper introduces Agents-K1, an end-to-end knowledge orchestration pipeline that converts raw documents into agent-native scientific knowledge graphs.

View →

cs.IREmpiricalRecent