Yu Wang

50 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×18Crypto×14Vision×12NLP×10Robotics×4Software Eng.×4Distributed×3Multiagent×2

Frequent co-authors

Haoyu Wang7×

Ruoyu Wang3×

Xinyu Wang3×

Hao Ouyang3×

Qiuyu Wang3×

Yujun Shen3×

Research Timeline

2026

Chronos: A Physics-Informed Full-History Framework for Non-Markovian Long-Horizon Manipulation

This paper introduces Chronos, a physics-informed framework for non-Markovian long-horizon manipulation, which elevates observation history to the latent state of the policy dynamics and achieves higher success rates and fewer parameters than Markovian VLA baselines in both simulated and real-world experiments.

Energy-Aware Scheduling for Serverless LLM Serving on Shared GPUs

This paper presents Festina, a profiling-guided, power-aware control plane for minimizing energy consumption in serverless large language model (LLM) serving.

WorldDirector: Building Controllable World Simulators with Persistent Dynamic Memory

The paper introduces WorldDirector, a framework for creating controllable video worlds with persistent dynamic object memory and exact visual identities.

MentalThink: Shaping Thoughts in Mental SVG World

The paper introduces MentalThink, a visual-symbolic reasoning paradigm that equips Multimodal Language Models with an executable mechanism for mental visualization using SVG graphics.

ArchEval: Measuring AI Agents as Computer Architects

This paper introduces ArchEval, a benchmark and platform for evaluating LLM agents on computer architecture design and optimization.

Scaling Mixture-of-Experts Video Pretraining for Embodied Intelligence

This paper introduces LingBot-Video, a video pretraining paradigm for embodied intelligence using a DiT-based approach, Mixture-of-Experts framework, and extensive robot-oriented data.

Infinite Worlds with Versatile Interactions

The paper introduces LingBot-World 2.0, an advanced version of a language model with unbounded interaction horizon, rapid response time, diverse interactive elements, and agentic harness integration.

Motion-Conditioned Multi-View Fusion for Myocardial Infarction Localization from Echocardiography

Proposed MCF-Net framework fuses myocardial motion cues with foundation model representations for reliable segment-level MI localization in echocardiography.

AlphaOracle: Oracle bone script decipherment via human-workflow-inspired deep learning

Introduces AlphaOracle, a human-workflow-inspired framework for deciphering undeciphered oracle bone script characters using a large digitized corpus, reducing analysis time and agreeing with expert interpretations.

FinSAgent: Corpus-Aligned Multi-Agent RAG Framework for Evidence-Grounded SEC Filing Question Answering

This paper proposes FinSAgent, an evidence-grounded multi-agent framework for financial question answering over SEC filings, which improves retrieval coverage and answer correctness through corpus-side conditioning.

No Training, Better Flights: Test-Time Scaled VLMs for UAV Navigation

This paper proposes a test-time scaling approach for Vision-Language Models in Unmanned Aerial Vehicle navigation, enabling self-correction and generation of more accurate and reliable flight plans.

Defense Against LLM Backdoors using Critical Neuron Isolation Pruning

The paper introduces DeCNIP, a method for identifying and neutralizing backdoors in large language models using representational analysis and neuron isolation pruning.

CARA: Concept-Aware Risk Attention for Interpretable Collision Anticipation

The paper proposes CARA, an interpretable framework for collision anticipation in autonomous driving using domain-grounded risk concepts, aligning them with video frames, and organizing them into evolving concept trajectories.

IR275K: A Benchmark for Infrared Multi-Frame Super-Resolution Toward Efficient Remote Sensing

This paper introduces IR275K, a curated benchmark for multi-frame super-resolution in infrared remote sensing, and evaluates CGMamba, a lightweight state-space model, achieving state-of-the-art performance.

On the Runtime Analysis of Reinforcement Learning Hyper-Heuristics

This paper rigorously proves that a Reinforcement Learning Hyper-heuristic (RLHH) optimizes the LeadingOnes benchmark function with optimal expected runtime using two random local search operators, outperforming the Generalised Random Gradient HH.

Music-JEPA: Learning a World Model of Sound from Action

This paper proposes a method for learning a world model of piano sound using Joint Embedding Predictive Architectures (JEPA), treating music as an action-conditioned system.

Libra: Taming Attention Workload Skew in Long-Context LLM Training with Bounded Sequence Pool

Libra is a load balancing approach for long-context LLM training that groups packed sequences into fixed-size pools and reduces attention workload variance, improving end-to-end throughput and straggler-attention speedup.

Application-Driven Architecture Exploration for Cross-Layer Heterogeneous Systems

The paper presents CHASE, an application-driven framework that explores physically feasible Cross-layer Heterogeneous System architectures for executing workloads with diverse requirements.

Data Pyramid for Embodied Manipulation

This paper organizes embodied data sources for multimodal foundation models into a pyramid, focusing on real-robot, UMI-style, egocentric and exocentric, simulation, and general vision-language data.

CoRenew: A large language model agent-based policy simulation platform for multifamily residential redevelopment

The paper introduces CoRenew, an open-source platform for simulating negotiations among stakeholders in multifamily residential redevelopment using LLM-based agents, with validations against survey responses and real-life negotiation data.

Highlighted terms show continued research focus across papers

Papers

cs.MAEmpiricalRecentJul 28, 2026

CoRenew: A large language model agent-based policy simulation platform for multifamily residential redevelopment

Yudi Zhang, Yuming Lin, Li Tian, Yu Wang +1 more

View →

cs.ROcs.CVSurvey