Abstract: Scene-Text Visual Question Answering (ST-VQA) aims to understand scene text in images and answer questions related to the text content. Most existing methods heavily rely on the accuracy of ...
This repository contains the dataset and the code for our paper UAV Coarse Visual Localization in Large-Scale Continuous Scenes. Thank you for your kindly attention. Build train and test sets using ...
Abstract: Investigating auditory perception and cognition in realistic, controlled environments is made possible by virtual reality (VR). However, when visual information is presented, sound ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results