Yangyu Huang*, Tianyi Gao*, Haoran Xu, Qihao Zhao, Yang Song, Zhipeng Gui, Tengchao Lv, Lei Cui, Scarlett Li, Furu Wei
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025
Microsoft Foundry Labs Project
We introduce GeoMap-Bench, a vision-language benchmark for geologic map understanding. It consists of 25 task types, which measure abilities across 5 aspects. Our benchmark reveals a significant performance gap between state-of-the-art MLLMs and human experts, we further explore agentic baselines to improve the performance.