Wei Lu

10 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

NLP×8Crypto×5AI×4ML×3Info Retrieval×2Vision×1Software Eng.×1

Frequent co-authors

Yawei Luo2×

Wenhang Shi2×

Jinhao Dong2×

Yiren Chen2×

Zhe Zhao2×

Shuqing Bian2×

Research Timeline

2026

Geometry-Aware Localized Watermarking for Copyright Protection in Embedding-as-a-Service

The paper proposes GeoMark, a geometry-aware localized watermarking framework that robustly protects Embedding-as-a-Service (EaaS) against model stealing and copyright infringement while preserving utility.

Same Evidence, Different Answers: Canonical-Context On-Policy Distillation for Multi-Turn Language Models

The paper introduces Canonical-Context On-Policy Distillation (CCOPD) to improve multi-turn language model performance by mitigating 'self-anchored drift,' ensuring consistent answers regardless of whether the evidence is presented in a single prompt or gradually across multiple turns.

Implicit Identity Technologies for LLMs: Fingerprinting and Watermarking across Datasets, Models, and Generated Content

This paper introduces 'implicit identity' as a unifying framework to survey and categorize LLM fingerprinting and watermarking techniques for verifying ownership and provenance across datasets, models, and generated content.

DiscourseFlip: An Oblique Discourse-Level Opinion Manipulation Attack against Black-box Retrieval-Augmented Generation

The paper introduces DiscourseFlip, a novel graph-guided attack that demonstrates how coordinated poisoning across a multi-topic query space can manipulate the overall opinion generated by black-box Retrieval-Augmented Generation (RAG) systems.

DiscourseFlip: An Oblique Discourse-Level Opinion Manipulation Attack against Black-box Retrieval-Augmented Generation

The paper introduces DiscourseFlip, a novel black-box, graph-guided attack that manipulates opinions across an entire multi-topic query network, demonstrating a significant leap in scope and effectiveness over existing RAG attack methods.

Scaling Agentic Capabilities via Grounded Interaction Synthesis

The paper introduces Grounded Agentic Interaction Synthesis (GAIS), a framework that generates high-quality, diverse, and complex agentic training data by anchoring tasks to real-world protocols, significantly improving base model performance.

Training Prompt Matters: State-Adaptive Optimization for Robust Fine-Tuning

The paper introduces State-Adaptive Prompt Optimization (SAPO), a novel training strategy that treats prompts as dynamic variables to achieve robust fine-tuning, significantly mitigating catastrophic forgetting and improving generalization in LLMs.

Sequential Data Poisoning in LLM Post-Training

The paper introduces the threat model of sequential data poisoning, demonstrating that multiple, collaborating attackers can exploit compound vulnerabilities in LLM post-training pipelines that are invisible when analyzing individual stages.

Reinforcement learning to improve large language model-based automated code compliance systems

This paper presents P4IR, a two-stage framework for automated code compliance using supervised fine-tuning and Group Relative Policy Optimization, achieving significant improvements over baselines and leading LLMs.

Alignment Is All You Need For X-to-4D Generation

This paper introduces Align4D, a framework for generating coherent video-3D pairs using any-modal input, achieving state-of-the-art quality and consistency in X-to-4D generation.

Highlighted terms show continued research focus across papers

Papers

cs.CVEmpiricalRecentJul 2, 2026

Alignment Is All You Need For X-to-4D Generation

Qiaowei Miao, Kehan Li, Yawei Luo, Yi Yang

This paper introduces Align4D, a framework for generating coherent video-3D pairs using any-modal input, achieving state-of-the-art quality and consistency in X-to-4D generation.

View →

cs.SEcs.AIcs.CLEmpirical