AVQA: Merging Sound andAVQA: Merging Sound andVisionto answer questions.AI systems connecting audio and visualsComputer Vision and Pattern RecognitionAudio-Visual Question Answering: Bridging Sound and SightAVQA connects audio and visual elements in videos to answer questions.2025-10-09T23:47:12+00:00 ― 6 min read