Eduardo Calò, Lydia Penkert, Saad Mahamood (2025). Lessons from a User Experience Evaluation of NLP Interfaces. To appear in the findings of NAACL 2025.
Patrícia Schmidtová, Saad Mahamood, Simone Balloccu et al. (2024). Automatic Metrics in Natural Language Generation: A survey of Current Evaluation Practices. Proceedings of INLG 2024. (Awarded Best Evaluation Paper)
Srinivas Ramesh Kamath, Fahime Same, Saad Mahamood (2024). Generating Hotel Highlights from Unstructured Text using LLMs. Proceedings of INLG 2024.
Marcel Nawrath, Agnieszka Nowak, Tristan Ratz et al. (2024). On the Role of Summary Content Units in Text Summarization Evaluation. Published at NAACL 2024.
Saad Mahamood (2024). ReproHum #0124-03: Reproducing Human Evaluations of end-to-end approaches for Referring Expression Generation. Published at HumEval '24 at LREC-COLING 2024.
Saad Mahamood (2023). Reproduction of Human Evaluations in: “It’s not Rocket Science: Interpreting Figurative Language in Narratives”. Published at HumEval '23.
Lining Zhang, João Sedoc, Simon Mille et al. (2023). Needle in a Haystack: An Analysis of Finding Qualified Workers on MTurk for Summarization. Proceedings of ACL 2023.
Kaustubh Dhole, Varun Gangal, Sebastian Gehrmann et al. (2023). NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation. Northern European Journal of Language Technology (NEJLT).
Emiel van Miltenburg, Miruna Clinciu, Ondřej Dušek et al. (2023). Barriers and enabling factors for error analysis in NLG research. Northern European Journal of Language Technology (NEJLT).
Research Experience
Elected to serve on the board for the ACL Special Interest Group for Natural Language Generation (SIGGEN) since January 2025; helps maintain the open source SimpleNLG realiser project on GitHub.
Background
AI Lead with expertise in natural language generation; working remotely for Shopware in Düsseldorf, Germany; leads a team of data scientists to build meaningful applied solutions, while actively participating in research.