Yanfeng Wang

5 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×3Info Retrieval×2NLP×2Vision×1ML×1Image and Video Processing×1

Frequent co-authors

Ziyang Cheng2×

Yu Wang2×

Tengfei Zhang1×

Ziheng Zhao1×

Lisong Dai1×

Xiaoman Zhang1×

Research Timeline

2026

Agentic Active Omni-Modal Perception for Multi-Hop Audio-Visual Reasoning

The paper introduces MOV-Bench, a challenging benchmark for multi-hop audio-visual reasoning, and proposes AOP-Agent, an agentic framework that significantly improves open-source Omni-LLMs' ability to perform active cross-modal perception.

SkillBrew: Multi-Objective Curation of Skill Banks for LLM Agents

The paper introduces SkillBrew, a multi-objective framework that treats skill bank curation as a constrained optimization problem to build efficient and well-curated skill repositories for LLM agents.

LaSR: Context-Aware Speech Recognition via Latent Reasoning

The paper proposes LaSR, a context-aware training paradigm that uses latent reasoning to significantly improve speech recognition, especially for specialized terminology, without adding latency.

MCP-Persona: Benchmarking LLM Agents on Real-World Personal Applications via Environment Simulation

The paper introduces MCP-Persona, a novel benchmark designed to evaluate LLM agents' performance on real-world, personalized applications using the Model Context Protocol (MCP), revealing that current state-of-the-art agents struggle with such personalized tool use.

A Vision-language Framework for Comparative Reasoning in Radiology

This paper introduces MedReCo and MedReCo-VLM, a framework that enables entity-aware cross-image reasoning for medical imaging, allowing AI to compare current scans with prior studies and analogous cases based on structured clinical reports.

Highlighted terms show continued research focus across papers

Papers

cs.CVcs.IRcs.LGRecentJun 4, 2026

A Vision-language Framework for Comparative Reasoning in Radiology

Tengfei Zhang, Ziheng Zhao, Lisong Dai, Xiaoman Zhang +4 more

View →

cs.AIRecentJun 1, 2026