Jaemin Cho
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
This paper demonstrates that Large Language Models (LLMs) can serve as accurate and selective surrogates for costly GPU kernel performance measurements, significantly expanding the search space for optimizing deep learning kernels.
This paper introduces Imaginative Perception Tokens (IPT) to improve spatial reasoning in vision language models.
Papers
Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models
Mahtab Bigverdi, Lindsey Li, Weikai Huang, Yiming Liu +7 more
This paper introduces Imaginative Perception Tokens (IPT) to improve spatial reasoning in vision language models.