publications
Publications with the keyword: vision and language
View all publications
- [14]
- Seeing past words: Testing the cross-modal capabilities of pretrained V&L models on counting tasks (Parcabalescu, L; Gatt, A; Frank, A and Calixto, I), In Proceedings of the Workshop Beyond Language: Multimodal Semantic Representations (MMSR'21), 2021.
- [13]
- Gradations of error severity in automatic image description (van Miltenburg, E; Lu, W-T; Krahmer, E; Gatt, A; Chen, G; Li, L and van Deemter, K), In Proceedings of the 13th International Conference on Natural Language Genration (INLG'20), Association for Computational Linguistics, 2020.
- [12]
- Transfer learning from language models to image caption generators: Better models may not transfer better (Tanti, M; Gatt, A and Camilleri, KP), arXiv preprint, volume 1901.01216, 2019.
- [11]
- Quantifying the amount of visual information used by neural caption generators (Tanti, M; Gatt, A and Camilleri, K), In Computer Vision – ECCV 2018 Workshops: Proceedings of the Workshop on Shortcomings in Vision and Language (Leal-Taixé, L; Roth, S, eds.), Springer, 2019.
- [10]
- Pre-gen metrics: Predicting caption quality metrics without generating captions (Tanti, M; Gatt, A and Muscat, A), In Computer Vision – ECCV 2018 Workshops: Proceedings of the Workshop on Shortcomings in Vision and Language (Leal-Taixé, L; Roth, S, eds.), Springer, 2019.
- [9]
- Visually Grounded Generation of Entailments from Premises (Jafaritazehjani, S; Gatt, A and Tanti, M), In Proceedings of the 12th International Conference on Natural Language Generation (INLG'19), Association for Computational Linguistics, 2019.
- [8]
- Where to put the image in an image caption generator. (Tanti, M; Gatt, A and Camilleri, K), Natural Language Engineering, volume 24, 2018.
- [7]
- Face2Text: Collecting an Annotated Image Description Corpus for the Generation of Rich Face Descriptions (Gatt, A; Tanti, M; Muscat, A; Paggio, P; Farrugia, R; Borg, C; Camilleri, K; Rosner, M and van der Plas, L), In Proceedings of the 11th edition of the Language Resources and Evaluation Conference (LREC'18), 2018.
- [6]
- Predicting visual spatial relations in the Maltese language (Muscat, A and Gatt, A), In Breaking Barriers: Junior College Multidisciplinary Conference, University of Malta Junior College, 2018.
- [5]
- Grounded textual entailment (Vutrong, H; Greco, C; Erofeeva, A; Jafaritazehjani, S; Linders, G; Tanti, M; Testoni, A; Bernardi, R and Gatt, A), In Proceedings of the 27th International Conference on Computational Linguistics (COLING'18), Association for Computational Linguistics, 2018.
- [4]
- What is the Role of Recurrent Neural Networks (RNNs) in an Image Caption Generator? (Tanti, M; Gatt, A and Camilleri, K), In Proceedings of the 10th International Conference on Natural Language Generation (INLG'17), Association for Computational Linguistics, 2017.
- [3]
- Reference Production as Search: The Impact of Domain Size on the Production of Distinguishing Descriptions (Gatt, A; Krahmer, E; van Deemter, K and van Gompel, RPG), Cognitive science, volume 41, 2017.
- [2]
- Viewing time affects overspecification: Evidence for two strategies of attribute selection during reference production (Koolen, Ruud; Gatt, Albert; van Gompel, Roger PG; Krahmer, Emiel and van Deemter, Kees), In Proceedings of the 38th Annual Meeting of the Cognitive Science Society (CogSci'16), Cognitive Science Society, 2016.
- [1]
- Production of referring expressions: Preference trumps discrimination. (Gatt, A; Krahmer, E; van Gompel, RPG and van Deemter, K), In Proceedings of the 35th Annual Meeting of the Cognitive Science Society (CogSci'13), Cognitive Science Societyg, 2013.