News

Abstract: Knowledge-based visual question answering (VQA) requires external knowledge beyond the image to answer the question ... Recent works have resorted to using a powerful large language model ...