Visual inputs during natural perception are highly ambiguous: objects are frequently occluded, lighting conditions vary, and object identification depends significantly on prior experiences. However, ...
Abstract: The use of visuals as collaboration catalysts has recently gained attention in research on group work, knowledge management, sense making, and collaboration in general. A special feature of ...
description [ICCV 2025][Multimodal VLM][Visual Question Answering] This work is the first to formally define and systematically investigate **focus ambiguity** in visual question answering — the ...