Sherzod Hakimov

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

NLP×2Vision×1AI×1Robotics×1

Frequent co-authors

David Schlangen2×

Mattia D'Agostini1×

Ivan Samodelkin1×

Chalamalasetti Kranti1×

Research Timeline

2026

Multi-Turn Multi-Agent Dialogue for Collaborative Reconstruction Improves VLM Performance on Spatial Reasoning, But Only Barely

The paper evaluates the performance of Vision-Language Models (VLMs) in a collaborative dialogue task requiring spatial reconstruction, finding that while detailed text representations improve results, the models still struggle with complex visual spatial reasoning.

The Image Reconstruction Game: Drawing Common Ground Through Iterative Multimodal Dialogue

The paper introduces the Image Reconstruction Game, a benchmark showing that the quality of the descriptive model is the primary determinant of image reconstruction success, while the generator's role is secondary.

Highlighted terms show continued research focus across papers

Papers

cs.CVcs.AIcs.CLRecentJun 1, 2026

The Image Reconstruction Game: Drawing Common Ground Through Iterative Multimodal Dialogue

Sherzod Hakimov, Mattia D'Agostini, Ivan Samodelkin, David Schlangen

View →

cs.CLcs.RORecentMay 29, 2026