publications
Publications with the keyword: evaluation
View all publications
- [29]
- Gradations of error severity in automatic image description (van Miltenburg, E; Lu, W-T; Krahmer, E; Gatt, A; Chen, G; Li, L and van Deemter, K), In Proceedings of the 13th International Conference on Natural Language Genration (INLG'20), Association for Computational Linguistics, 2020.
- [28]
- Unmasking Contextual Stereotypes: Measuring and Mitigating BERT's Gender Bias (Bartl, M; Nissim, M and Gatt, A), In Proceedings of the 2nd Workshop on Gender Bias in Natural Language Processing (GeBNLP 2020), Association for Computational Linguistics, 2020.
- [27]
- On the interaction of automatic evaluation and task framing in headline style transfer (De Mattei, L; Cafagna, M; Lai, H; Nissim, M; Dell'Orletta, F and Gatt, A), In Proceedings of the 1st Workshop on Evaluating NLG Evaluation (EvalNLGEval'20), Association for Computational Linguistics, 2020.
- [26]
- Human evaluation of automatically generated text: Current trends and best practice guidelines (van der Lee, C; Gatt, A; van Miltenburg, E and Krahmer, E), Computer Speech and Language, volume 67, 2020.
- [25]
- CHANGE-IT: Change headlines, adapt news, generate (De Mattei, L; Cafagna, M; Dell'Orletta, F; Nissim, M and Gatt, A), In Proceedings of the 7th Evaluation Campaign of Natural Language Processing and Speech Tools for Italian (EVALITA'20), 2020.
- [24]
- Best Practices for the Human Evaluation of Automatically Generated Text (van der Lee, C; Gatt, A; van Miltenburg, E; Wubben, S and Krahmer, E), In Proceedings of the 12th International Conference on Natural Language Generation (INLG'19), Association for Computational Linguistics, 2019.
- [23]
- Generation of referring expressions: Assessing the incremental algorithm (van Deemter, K; Gatt, A; van der Sluis, I and Power, R), Cognitive science, volume 36, 2012.
- [22]
- Assessing the Incremental Algorithm: a Response to Krahmer et al. (van Deemter, K; Gatt, A; van der Sluis, I and Power, R), Cognitive science, volume 36, 2012.
- [21]
- A repository of data and evaluation resources for natural language generation (Belz, Anja and Gatt, Albert), In Proceedings of the 8th Language Resources and Evaluation Conference (LREC'12), ELRA, 2012.
- [20]
- What is in a text and what does it do: Qualitative Evaluations of an NLG system–the BT-Nurse–using content analysis and discourse analysis. (Sambaraju, R; Reiter, E; Logie, R; McKinlay, A; McVittie, C; Gatt, A and Sykes, C), In Proceedings of the 13th European Workshop on Natural Language Generation (ENLG'11), Association for Computational Linguistics, 2011.
- [19]
- Textual properties and task based evaluation: investigating the role of surface properties, structure and content (Gatt, A and Portet, F), In Proceedings of the 6th International Natural Language Generation Conference (INLG'10), Association for Computational Linguistics, 2010.
- [18]
- Introducing shared tasks to NLG: the TUNA shared task evaluation challenges (Gatt, Albert and Belz, Anja), Chapter in Empirical methods in natural language generation: Data-oriented methods and empirical evaluation (Krahmer, E.; Theune, M., eds.), Springer-Verlag, 2010.
- [17]
- Generating referring expressions in context: The GREC task evaluation challenges (Belz, A; Kow, E; Viethen, J and Gatt, A), Chapter in Empirical methods in natural language generation (Krahmer, E.; Theune, M., eds.), Springer, 2010.
- [16]
- Beyond DICE: Measuring the quality of a referring expression (van Deemter, K and Gatt, A), In Proceedings of the Workshop on Production of Referring Expressions: Bridging Computational and Psycholinguistic Approaches (PRE-CogSci'09), 2009.
- [15]
- A hearer-oriented evaluation of referring expression generation (Khan, IH; van Deemter, K; Ritchie, G; Gatt, A and Cleland, AA), In Proceedings of the 12th European Workshop on Natural Language Generation, Association for Computational Linguistics, 2009.
- [14]
- The TUNA-REG Challenge 2009: Overview and evaluation results (Gatt, Albert; Belz, Anja and Kow, Eric), In Proceedings of the 12th European Workshop on Natural Language Generation (ENLG'09), Association for Computational Linguistics, 2009.
- [13]
- Text Content and Task Performance in the Evaluation of a Natural Language Generation System. (Gatt, A and Portet, F), In Proceedings of the Conference on Recent Advances in Natural Language Processing (RANLP'09), 2009.
- [12]
- Towards a balanced corpus of multimodal referring expressions in dialogue (van der Sluis, I; Piwek, P; Gatt, A and Bangerter, A), In Proceedings of the AISB 2008 Convention Communication, Interaction and Social Intelligence, AISB, 2008.
- [11]
- The importance of narrative and other lessons from an evaluation of an NLG system that summarises clinical data (Reiter, Ehud; Gatt, Albert; Portet, François and Van Der Meulen, Marian), In Proceedings of the 5th International Natural Language Generation Conference (INLG'08), Association for Computational Linguistics, 2008.
- [10]
- Xml format guidelines for the tuna corpus (Gatt, A; van der Sluis, I and van Deemter, K), Technical report, Technical report, Dept of Computing Science, University of Aberdeen, 2008.
- [9]
- The TUNA Challenge 2008: Overview and evaluation results (Gatt, Albert; Belz, Anja and Kow, Eric), In Proceedings of the 5th International Natural Language Generation Conference (INLG'08), Association for Computational Linguistics, 2008.
- [8]
- Attribute selection for referring expression generation: New algorithms and evaluation methods (Gatt, Albert and Belz, Anja), In Proceedings of the 5th International Conference on Natural Language Generation (INLG'08), Association for Computational Linguistics, 2008.
- [7]
- Intrinsic vs. extrinsic evaluation measures for referring expression generation (Belz, Anja and Gatt, Albert), In Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics (ACL'08), Association for Computational Linguistics, 2008.
- [6]
- Evaluating algorithms for the generation of referring expressions: Going beyond toy domains (Van der Sluis, Ielka; Gatt, Albert and Van Deemter, Kees), In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP'07), RANLP, 2007.
- [5]
- Content determination in GRE: Evaluating the evaluator (van Deemter, Kees and Gatt, Albert), In Proceedings of the 2nd UCNLG Workshop: Language Generation and Machine Translation, Association for Computational Linguistics, 2007.
- [4]
- Evaluating algorithms for the generation of referring expressions using a balanced corpus (Gatt, A; van der Sluis, I and vann Deemter, K), In Proceedings of the 11th European Workshop on Natural Language Generation (ENLG'07), Association for Computational Linguistics, 2007.
- [3]
- Corpus-based evaluation of Referring Expressions Generation (Gatt, Albert; Van Der Sluis, Ielka and Van Deemter, Kees), In Proceedings of the Workshop on Shared Tasks and Comparative Evaluation in NLG, 2007.
- [2]
- The attribute selection for GRE challenge: Overview and evaluation results (Belz, Anja and Gatt, Albert), In Proceedings of UCNLG+MT: Language Generation and Machine Translation, Association for Computational Linguistics, 2007.
- [1]
- Building a semantically transparent corpus for the generation of referring expressions (van Deemter, K; van der Sluis, I and Gatt, A), In Proceedings of the 4th International Natural Language Generation Conference (INLG'06), Association for Computational Linguistics, 2006.