Vikas Chandra

1 indexed paper

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Vision×1AI×1

Frequent co-authors

Zhipeng Cai1×

Zhuang Liu1×

Yunyang Xiong1×

Zechun Liu1×

Yangyang Shi1×

Research Timeline

2026

VLM3: Vision Language Models Are Native 3D Learners

The paper proposes VLM3, a simple, scalable method that demonstrates standard Vision Language Models (VLMs) can natively learn 3D understanding by focusing on architectural simplicity and specific data techniques.

Highlighted terms show continued research focus across papers

Papers

cs.CVcs.AIRecentMay 28, 2026

VLM3: Vision Language Models Are Native 3D Learners

Zhipeng Cai, Zhuang Liu, Yunyang Xiong, Zechun Liu +2 more

View →