Yi Zhou

17 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×12Crypto×11NLP×7ML×5Vision×4Robotics×1HCI×1Multiagent×1

Frequent co-authors

Chunyi Zhou6×

Tianyi Zhou5×

Shouling Ji5×

Dongrui Liu3×

Leitao Yuan3×

Jing Shao3×

Research Timeline

2026

Unveiling the Security Risks of Federated Learning in the Wild: From Research to Practice

This paper argues that much of the existing research on Federated Learning (FL) security is based on idealized assumptions, and provides a practical evaluation framework showing that real-world attack performance is often less severe and more unstable than predicted.

ACIArena: Toward Unified Evaluation for Agent Cascading Injection

The paper introduces ACIArena, a unified and comprehensive evaluation framework designed to systematically test the robustness of Multi-Agent Systems against complex Agent Cascading Injection attacks.

Compiling Activation Steering into Weights via Null-Space Constraints for Stealthy Backdoors

The paper proposes a novel method to inject reliable, sustained backdoors into LLMs by compiling an activation steering vector into model weights, ensuring the backdoor only activates upon a specific trigger.

ArmSSL: Adversarial Robust Black-Box Watermarking for Self-Supervised Learning Pre-trained Encoders

ArmSSL is a novel watermarking framework that provides robust, black-box ownership verification for self-supervised learning encoders while maintaining high utility and resisting adversarial attacks.

Profiling for Pennies: Unveiling the Privacy Iceberg of LLM Agents

The paper introduces the PrivacyIceberg framework to systematically categorize and empirically demonstrate the high risk of automated, deep personal profiling using LLM agents, revealing a significant gap between public concern and platform safeguards.

PropGuard: Safeguarding LLM-MAS via Propagation-Aware Exploration and Remediation

PropGuard introduces a propagation-aware framework to safeguard LLM-MAS against malicious attacks by constructing a dual-view graph, identifying suspicious propagation paths, and applying source-guided remediation.

From Compression to Accountability: Harmless Copyright Protection for Dataset Distillation

The paper proposes SubPopMark, a novel subpopulation-driven framework that injects harmless, verifiable markers into distilled datasets to prevent copyright infringement and data leakage.

Angel or Demon: Investigating the Plasticity Interventions' Impact on Backdoor Threats in Deep Reinforcement Learning

This paper systematically investigates how various plasticity interventions affect the vulnerability of deep reinforcement learning agents to backdoor attacks, finding that most interventions mitigate threats while one specific intervention exacerbates them.

Evo-Attacker: Memory-Augmented Reinforcement Learning for Long-Horizon Tool Attacks on LLM-MAS

Evo-Attacker introduces a memory-augmented reinforcement learning framework to perform generalized, long-horizon tool attacks on LLM-MAS, significantly outperforming existing methods.

AI, Take the Wheel: What Drives Delegation and Trust in Human-Computer Cooperative Question Answering?

This study investigates human-AI collaboration in question answering, finding that while collaboration is beneficial, humans make suboptimal decisions by both under-relying on correct AI suggestions and over-relying when the AI is misleading.

AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

The paper introduces AgentDoG 1.5, a lightweight and scalable alignment framework that significantly improves AI agent safety and security for complex, open-world agentic scenarios.

Versatile Framework with Semantic and Structural guidance for Image Reconstruction from Brain Activity

The paper proposes MindDiffuser, a two-stage framework that significantly improves image reconstruction from brain activity by combining semantic guidance from text-to-image models with structural refinement using shallow visual features.

MiraBench: Evaluating Action-Conditioned Reliability in Robotic World Models

The paper introduces MiraBench, a new benchmark that evaluates the action-conditioned reliability of robotic world models, finding that visual fidelity is insufficient and that optimism bias is a pervasive issue across current systems.

AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

The paper introduces AgentDoG 1.5, a lightweight and scalable alignment framework that significantly improves AI agent safety and security for complex open-world agent deployments.

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

COLLEAGUE.SKILL introduces an automated system that distills heterogeneous traces of human expertise and role-specific knowledge into portable, inspectable, and usable AI skill packages.

Sandboxed Coding Agents are Competitive Omni-modal Task Solvers

The paper demonstrates that specialized coding agents, using only text and image access within a sandbox, can effectively solve complex omnimodal tasks, often outperforming state-of-the-art native omnimodal models.

RiskFlow: Fast and Faithful Safety-Critical Traffic Scenario Generation

RiskFlow is a novel framework that generates realistic and safety-critical multi-agent traffic scenarios by reformulating trajectory generation as a single-pass transport problem in the action space.

Highlighted terms show continued research focus across papers

Papers

cs.ROcs.AIRecentJun 4, 2026

RiskFlow: Fast and Faithful Safety-Critical Traffic Scenario Generation

Qi Lan, Yining Tang, Yu Shen, Yi Zhou +3 more

RiskFlow is a novel framework that generates realistic and safety-critical multi-agent traffic scenarios by reformulating trajectory generation as a single-pass transport problem in the action space.

View →

cs.CLcs.CVRecentMay 30, 2026