VisVM: Scaling Inference-time Search with Vision Value Model for Improved Visual Comprehension

Year
2025
Type(s)
Author(s)
Xiyao Wang and Zhengyuan Yang and Linjie Li and Hongjin Lu and Yuancheng Xu and Chung-Ching Lin and Kevin Lin and Furong Huang and Lijuan Wang
Source
In International Conference on Computer Vision (ICCV), 2025, 2025
Url
https://arxiv.org/abs/2412.03704
BibTeX
BibTeX