Hu Wei
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper introduces CrystalXRD-Bench, a new benchmark designed to test Vision-Language Models (VLMs) on the complex task of identifying crystallographic Miller indices (HKLs) from rendered X-ray Diffraction (XRD) patterns, finding that current models struggle significantly with this multi-step scientific reasoning.
The paper introduces BilliardPhys-Bench, a new benchmark that demonstrates that current multimodal LLMs struggle with complex physical reasoning and predicting object dynamics in simulated environments.
Papers
BilliardPhys-Bench: Benchmarking Physical Reasoning and Visual Dynamics of Multimodal LLMs
Ben Wang, Xiaogang Li, Ruochen Gao, Peiyao Xiao +5 more
The paper introduces BilliardPhys-Bench, a new benchmark that demonstrates that current multimodal LLMs struggle with complex physical reasoning and predicting object dynamics in simulated environment…