Yang Wang

16 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×10ML×6Crypto×5Vision×3NLP×3Sound×1Robotics×1Software Eng.×1

Frequent co-authors

Yuyang Wang2×

Research Timeline

2026

FedFG: Privacy-Preserving and Robust Federated Learning via Flow-Matching Generation

FedFG introduces a robust federated learning framework using flow-matching generation to simultaneously enhance client privacy and defend against sophisticated poisoning attacks.

Dummy-Aware Weighted Attack (DAWA): Breaking the Safe Sink in Dummy Class Defenses

The paper introduces Dummy-Aware Weighted Attack (DAWA), a novel evaluation method that significantly reduces the reported robustness of Dummy Classes-based defenses by simultaneously targeting both the true and dummy class labels.

ProjLens: Unveiling the Role of Projectors in Multimodal Model Safety

The paper introduces ProjLens, an interpretability framework that reveals that backdoor vulnerabilities in Multimodal Large Language Models (MLLMs) are encoded within a low-rank subspace of the projector, causing a measurable semantic shift in poisoned inputs.

When Are LLM Inferences Acceptable? User Reactions and Control Preferences for Inferred Personal Information

This study investigated user reactions to inferred personal information from their own ChatGPT histories, finding that acceptability is governed by context-sensitive norms regarding generation, retention, and transmission, rather than just the inference content.

Universal Graph Backdoor Defense: A Feature-based Homophily Perspective

The paper proposes a universal graph backdoor defense framework that addresses feature-based graph backdoor attacks, which are more challenging than traditional subgraph-based attacks, by leveraging local feature consistency.

FedMPT: Federated Multi-label Prompt Tuning of Vision-Language Models

FedMPT introduces a novel federated learning framework for Multi-Label Recognition (MLR) using Vision-Language Models (VLMs) by leveraging generalizable conditions to mitigate label overfitting and improve robustness.

Harness-Bench: Measuring Harness Effects across Models in Realistic Agent Workflows

The paper introduces Harness-Bench, a diagnostic benchmark that measures how different system 'harnesses' affect LLM agent performance in realistic workflows, showing that agent capability must be reported at the model-harness configuration level.

What drives performance in molecular MPNNs? An operator-level factorial benchmark

The paper introduces an operator-level factorial benchmark for molecular MPNNs, finding that message construction (specifically concatenation-based mixing) is the primary determinant of performance, rather than the complexity of the node update mechanism.

DeMaVLA: A Vision-Language-Action Foundation Model for Generalizable Deformable Manipulation

DeMaVLA is a generalizable Vision-Language-Action foundation model designed for deformable object manipulation, achieving strong real-world performance on folding tasks by leveraging large-scale real-world data and corrective learning.

Not All Synthetic Data Is Yours to Learn From

Weak self-training on synthetic data can amplify a language model's existing capabilities, but this effect is strictly dependent on the compatibility between the source and student models, not on the data's intrinsic quality.

MechVQA: Benchmarking and Enhancing Multimodal LLMs on Comprehensive Mechanical Drawing Understanding

The paper introduces MechVQA, a comprehensive dataset and benchmark for mechanical drawing understanding, and proposes the MechVL model, which significantly improves Multimodal LLMs' performance on these specialized tasks.

Knowledge Boundary Probing and Demand-Guided Intervention for LLM-Based Power System Code Generation

The paper addresses the reliability of open-weight LLMs for power system code generation by identifying structured API-knowledge boundary errors and proposing a boundary-aware intervention that significantly boosts accuracy without fine-tuning.

Understanding LLM Behavior in Multi-Target Cross-Lingual Summarization

The paper introduces a new benchmark for multi-target cross-lingual summarization (MTXLS) and proposes an activation steering method that significantly improves LLM performance by guiding the generation process using English representations.

TabPrep: Closing the Feature Engineering Gap in Tabular Benchmarks

The paper introduces TabPrep, a feature engineering pipeline that systematically improves performance across various tabular machine learning models by addressing structural data patterns ignored by current benchmarks.

HumanNOVA: Photorealistic, Universal and Rapid 3D Human Avatar Modeling from a Single Image

HumanNOVA introduces a photorealistic, universal, and rapid model capable of generating high-quality 3D human avatars from a single input RGB image.

MOSS-Audio Technical Report

MOSS-Audio is a unified audio-language model designed for comprehensive understanding of speech, environmental sounds, and music, achieving strong performance across various audio-grounded tasks.

Highlighted terms show continued research focus across papers

Papers

cs.LGRecentJun 1, 2026

TabPrep: Closing the Feature Engineering Gap in Tabular Benchmarks

Andrej Tschalzev, Nick Erickson, Yuyang Wang, Huzefa Rangwala +3 more

View →

cs.CVRecentJun 1, 2026