In authors or contributors

1 resource

  • Aishwarya Agrawal, Dhruv Batra, Devi Par...
    |
    Oct 27th, 2016
    |
    journalArticle
    Aishwarya Agrawal, Dhruv Batra, Devi Par...
    Oct 27th, 2016

    Recently, a number of deep-learning based models have been proposed for the task of Visual Question Answering (VQA). The performance of most models is clustered around 60-70%. In this paper we propose systematic methods to analyze the behavior of these models as a first step towards recognizing their strengths and weaknesses, and identifying the most fruitful directions for progress. We analyze two models, one each from two major classes of VQA models -- with-attention and without-attention...

Last update from database: 27/10/2025, 21:15 (UTC)
Powered by Zotero and Kerko.