Dr Nedjma Ousidhoum

: Available for postgraduate supervision

Teams and roles for Nedjma Ousidhoum

Lecturer
School of Computer Science and Informatics

I am a Lecturer (Assistant Professor) at the School of Computer Science and Informatics at Cardiff University. I lead the Cardiff NLP Group and am also a Visiting Academic at the University of Cambridge.

Previously, I was a Postdoctoral Research Associate at the University of Cambridge, working with Andreas Vlachos. I completed my PhD at the Hong Kong University of Science and Technology (HKUST), supervised by Yangqiu Song and Dit-Yan Yeung.

My research focuses on Natural Language Processing and Computational Social Science, particularly in automated fact-checking, human-centred NLP, bias (and related tasks), and low-resource languages. For more information, please check my personal webpage

Date
Type

2025

Rizvi, N. et al., 2025. From granular grief to binary belief: a collaborative optimization of annotation techniques for anti-autistic language. Proceedings of the ACM on Human-Computer Interaction 9 (7), pp.1-23. CSCW297. (10.1145/3757478)
Muhammad, S. H. et al., 2025. AfriHate: A multilingual collection of hate speech and abusive language datasets for African languages. Presented at: NAACL 2025 Albuquerque, New Mexico, USA 29 April - 4 May 2025. Published in: Chiruzzo, L. , Ritter, A. and Wang, L. eds. Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies. Vol. 1.Albuquerque, New Mexico: Association for Computational Linguistics. , pp.1854-1871. (10.18653/v1/2025.naacl-long.92)
Winata, G. I. et al., 2025. Worldcuisines: a massive-scale benchmark for multilingual and multicultural visual question answering on global cuisines. Presented at: The 2025 Conference of the Nations of the Americas New Mexico, USA 29 April - 4 May 2025. Published in: Chiruzzo, L. , Ritter, A. and Wang, L. eds. Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies. Vol. 1.USA: Association for Computational Linguistics. , pp.3242-3264. (10.18653/v1/2025.naacl-long.167)

2024

Myung, J. et al., 2024. BLEND: A benchmark for LLMs on everyday knowledge in diverse cultures and languages. Presented at: 38th Conference on Neural Information Processing Systems (NeurIPS 2024) Track on Datasets and Benchmarks Vancouver, BC, Canada 9-15 December 2024. Published in: Globerson, A. et al., NeurIPS Proceedings: Advances in Neural Information Processing Systems. Vol. 37.Curran Associates, Inc.. , pp.78104-78146.
Antypas, D. et al. 2024. Words as trigger points in social media discussions. [Online].arXiv. (10.48550/arXiv.2405.10213)Available at: https://doi.org/10.48550/arXiv.2405.10213.
Ousidhoum, N. et al. 2024. SemRel2024: A collection of semantic textual relatedness datasets for 13 languages. Presented at: SemRel2024 Bangkok, Thailand 11-16 August 2024. Published in: Ku, L. , Martins, A. and Srikumar, V. eds. Findings of the Association for Computational Linguistics. Association for Computational Linguistics. , pp.2512 – 2530. (10.18653/v1/2024.findings-acl.147)
Ousidhoum, N. et al. 2024. SemEval Task 1: semantic textual relatedness for African and Asian languages. Presented at: The 18th International Workshop on Semantic Evaluation (SemEval-2024) Mexico City, Mexico 20 - 21 June 2024. Published in: Atul Kr., O. et al., Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024). Association for Computational Linguistics. , pp.1963 – 1978. (10.18653/v1/2024.semeval-1.272)

2023

Muhammad, S. et al., 2023. AfriSenti: a twitter sentiment analysis benchmark for African languages. Presented at: The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP) Singapore 6 - 10 December 2023. Published in: Bouamor, H. , Pino, J. and Bali, K. eds. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics. , pp.13968 – 13981. (10.18653/v1/2023.emnlp-main.862)
Schlichtkrull, M. , Ousidhoum, N. and Vlachos, A. 2023. The intended uses of automated fact-checking artefacts: why, how and who. Presented at: EMNLP 2023 Singapore December 2023. Published in: Bouamor, H. , Pino, J. and Bali, K. eds. Findings of the Association for Computational Linguistics: EMNLP 2023. Association for Computational Linguistics. , pp.8618 – 8642. (10.18653/v1/2023.findings-emnlp.577)

Publications (see Google Scholar for the full list)

Nedjma Ousidhoum, Meriem Beloucif, Saif M. Mohammad. Building Better: Avoiding Pitfalls in Developing Language Resources when Data is Scarce. ACL 2025.
Shamsuddeen Hassan Muhammad*, Nedjma Ousidhoum*, Idris Abdulmumin, Jan Philip Wahle, Terry Ruas, Meriem Beloucif, Christine de Kock, Nirmal Surange, Daniela Teodorescu, Ibrahim Said Ahmad, David Ifeoluwa Adelani, Alham Fikri Aji, Felermino Ali, Ilseyar Alimova, Vladimir Araujo, Nikolay Babakov, Naomi Baes, Ana-Maria Bucur, Andiswa Bukula, Guanqun Cao, Rodrigo Tufino Cardenas, Rendi Chevi, Chiamaka Ijeoma Chukwuneke, Alexandra Ciobotaru, Daryna Dementieva, Murja Sani Gadanya, Robert Geislinger, Bela Gipp, Oumaima Hourrane, Oana Ignat, Falalu Ibrahim Lawan, Rooweither Mabuya, Rahmad Mahendra, Vukosi Marivate, Andrew Piper, Alexander Panchenko, Charles Henrique Porto Ferreira, Vitaly Protasov, Samuel Rutunda, Manish Shrivastava, Aura Cristina Udrea, Lilian Diana Awuor Wanzare, Sophie Wu, Florian Valentin Wunderlich, Hanif Muhammad Zhafran, Tianhui Zhang, Yi Zhou, Saif M. Mohammad. BRIGHTER: BRIdging the Gap in Human-Annotated Textual Emotion Recognition Datasets for 28 Languages. ACL 2025, Best Resource Paper Award [Equal contribution].
Shamsuddeen Hassan Muhammad*, Nedjma Ousidhoum*, Idris Abdulmumin, Seid Muhie Yimam, Jan Philip Wahle, Terry Ruas, Meriem Beloucif, Christine De Kock, Tadesse Destaw Belay, Ibrahim Said Ahmad, Nirmal Surange, Daniela Teodorescu, David Ifeoluwa Adelani, Alham Fikri Aji, Felermino Ali, Vladimir Araujo, Abinew Ali Ayele, Oana Ignat, Alexander Panchenko, Yi Zhou, Saif M. Mohammad. SemEval-2025 Task 11: Bridging the Gap in Text-Based Emotion Detection. SemEval 2025 (ACL 2025), Best Task Award [Equal contribution].
Naba Rizvi, Harper Strickland, Daniel Gitelman, Tristan Cooper, Alexis Morales-Flores, Michael Golden, Aekta Kallepalli, Akshat Alurkar, Haaset Owens, Saleha Ahmedi, Isha Khirwadkar, Imani Munyaka, Nedjma Ousidhoum. AUTALIC: A Dataset for Anti-AUTistic Ableist Language In Context. ACL 2025.
Genta Indra Winata, Frederikus Hudi, Patrick Amadeus Irawan, David Anugraha, Rifki Afina Putri, Yutong Wang, Adam Nohejl, Ubaidillah Ariq Prathama, Nedjma Ousidhoum, Afifa Amriani, Anar Rzayev, Anirban Das, Ashmari Pramodya, Aulia Adila, Bryan Wilie, Candy Olivia Mawalim, Ching Lam Cheng, Daud Abolade, Emmanuele Chersoni, Enrico Santus, Fariz Ikhwantri, Garry Kuwanto, Hanyang Zhao, Haryo Akbarianto Wibowo, Holy Lovenia, Jan Christian Blaise Cruz, Jan Wira Gotama Putra, Junho Myung, Lucky Susanto, Maria Angelica Riera Machin, Marina Zhukova, Michael Anugraha, Muhammad Farid Adilazuarda, Natasha Santosa, Peerat Limkonchotiwat, Raj Dabre, Rio Alexander Audino, Samuel Cahyawijaya, Shi-Xiong Zhang, Stephanie Yulia Salim, Yi Zhou, Yinxuan Gui, David Ifeoluwa Adelani, En-Shiun Annie Lee, Shogo Okada, Ayu Purwarianti, Alham Fikri Aji, Taro Watanabe, Derry Tanti Wijaya, Alice Oh, Chong-Wah Ngo. WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines. NAACL 2025, Best Theme Paper Award.
Shamsuddeen Hassan Muhammad, Idris Abdulmumin, Abinew Ali Ayele, David Ifeoluwa Adelani, Ibrahim Said Ahmad, Saminu Mohammad Aliyu, Nelson Odhiambo Onyango, Lilian DA Wanzare, Samuel Rutunda, Lukman Jibril Aliyu, Esubalew Alemneh, Oumaima Hourrane, Hagos Tesfahun Gebremichael, Elyas Abdi Ismail, Meriem Beloucif, Ebrahim Chekol Jibril, Andiswa Bukula, Rooweither Mabuya, Salomey Osei, Abigail Oppong, Tadesse Destaw Belay, Tadesse Kebede Guge, Tesfa Tegegne Asfaw, Chiamaka Ijeoma Chukwuneke, Paul Röttger, Seid Muhie Yimam, Nedjma Ousidhoum. AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages.NAACL 2025.
Junho Myung, Nayeon Lee, Yi Zhou, Jiho Jin, Rifki Afina Putri, Dimosthenis Antypas, Hsuvas Borkakoty, Eunsu Kim, Carla Perez-Almendros, Abinew Ali Ayele, Victor Gutierrez-Basulto, Yazmin Ibanez-Garcia, Hwaran Lee, Shamsuddeen Hassan Muhammad, Kiwoong Park, Anar Sabuhi Rzayev, Nina White, Seid Muhie Yimam, Mohammad Taher Pilehvar, Nedjma Ousidhoum, Jose Camacho-Collados, Alice Oh. BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages. NeurIPS 2024 (Benchmarks & Datasets Track), Best Non-Archival Paper Award, C3NLP Workshop (ACL 2024).
Nedjma Ousidhoum, Shamsuddeen Hassan Muhammad, Mohamed Abdalla, Idris Abdulmumin, Ibrahim Said Ahmad, Sanchit Ahuja, Alham Fikri Aji, Vladimir Araujo, Meriem Beloucif, Christine De Kock, Oumaima Hourrane, Manish Shrivastava, Thamar Solorio, Nirmal Surange, Krishnapriya Vishnubhotla, Seid Muhie Yimam, Saif M. Mohammad. SemEval-2024 Task 1: Semantic Textual Relatedness for African and Asian Languages. SemEval 2024 (NAACL 2024), Best Task Description Paper – Honourable Mention.
Nedjma Ousidhoum, Shamsuddeen Hassan Muhammad, Mohamed Abdalla, Idris Abdulmumin, Ibrahim Said Ahmad, Sanchit Ahuja, Alham Fikri Aji, Vladimir Araujo, Abinew Ali Ayele, Pavan Baswani, Meriem Beloucif, Chris Biemann, Sofia Bourhim, Christine De Kock, Genet Shanko Dekebo, Oumaima Hourrane, Gopichand Kanumolu, Lokesh Madasu, Samuel Rutunda, Manish Shrivastava, Thamar Solorio, Nirmal Surange, Hailegnaw Getaneh Tilaye, Krishnapriya Vishnubhotla, Genta Winata, Seid Muhie Yimam, Saif M. Mohammad. SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 13 Languages. Findings of ACL 2024.
Shamsuddeen Hassan Muhammad, Idris Abdulmumin, Abinew Ali Ayele, Nedjma Ousidhoum, David Ifeoluwa Adelani, Seid Muhie Yimam, Ibrahim Sa'id Ahmad, Meriem Beloucif, Saif M. Mohammad, Sebastian Ruder, Oumaima Hourrane, Pavel Brazdil, Felermino Ali, Davis David, Salomey Osei, Bello Shehu Bello, Falalu Ibrahim, Tajuddeen Gwadabe, Samuel Rutunda, Tadesse Belay, Wendimu Baye Messelle, Hailu Beshada Balcha, Sisay Adugna Chala, Hagos Tesfahun Gebremichael, Bernard Opoku, Steven Arthur. AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages.EMNLP 2023, Best Non-Archival Paper Award, AfricaNLP Workshop.
Shamsuddeen Hassan Muhammad, Idris Abdulmumin, Seid Muhie Yimam, David Ifeoluwa Adelani, Ibrahim Said Ahmad, Nedjma Ousidhoum, Abinew Ali Ayele, Saif M. Mohammad, Meriem Beloucif, Sebastian Ruder. SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval). SemEval 2023 (ACL 2023 Shared Task).
Nedjma Ousidhoum*, Zhangdie Yuan*, Andreas Vlachos. Varifocal Question Generation for Fact-Checking. EMNLP 2022. [Equal contribution]
Nedjma Ousidhoum. On the Importance and Challenges of the Experimental Design of Multilingual Toxic Content Detection. PhD Thesis, Hong Kong University of Science and Technology, 2021.
Nedjma Ousidhoum, Xinran Zhao, Tianqing Fang, Yangqiu Song, Dit-Yan Yeung. Probing Toxic Content in Large Pre-Trained Language Models. ACL-IJCNLP 2021.
Nedjma Ousidhoum, Yangqiu Song, Dit-Yan Yeung. Comparative Evaluation of Label-Agnostic Selection Bias in Multilingual Hate Speech Datasets. EMNLP 2020.
Nedjma Ousidhoum, Zizheng Lin, Hongming Zhang, Yangqiu Song, Dit-Yan Yeung. Multilingual and Multi-Aspect Hate Speech Analysis. EMNLP 2019.

I am a Lecturer (Assistant Professor) at the School of Computer Science and Informatics at Cardiff University. I lead the CardiffNLP group and am also a Visiting Academic at the University of Cambridge.

My research focuses on Natural Language Processing and Computational Social Science, particularly in automated fact-checking, toxic content detection, and low-resource languages.

For an up-to-date list of publications and CV, please visit my GitHub webpage or my Google Scholar.

Honours and awards

Best Resource Paper Award – ACL 2025
BRIGHTER: BRIdging the Gap in Human-Annotated Textual Emotion Recognition Datasets for 28 Languages
Best Task Award – SemEval 2025 (co-located with ACL 2025)
SemEval-2025 Task 11: Bridging the Gap in Text-Based Emotion Detection
Best Theme Paper Award – NAACL 2025
WORLDCUISINES: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
Best Non-Archival Paper – C3NLP (co-located with ACL 2024)
BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages
Best Task Paper – Honourable Mention – SemEval 2024 (co-located with NAACL 2024)
SemEval-2024 Task 1: Semantic Textual Relatedness for African and Asian Languages
Outstanding Senior Area Chair – EMNLP 2023
Best Non-Archival Paper – AfricaNLP (co-located with ICLR 2023)
AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages

Academic positions

June 2023 – Present Lecturer (Assistant Professor), School of Computer Science and Informatics, Cardiff University
April 2021 – June 2023 Postdoctoral Researcher, University of Cambridge (Advisor: Prof. Andreas Vlachos)
2014 – 2021 PhD Student in Computer Science, HKUST, Hong Kong
April 2014 – August 2014 Postgraduate Intern, Hong Kong University of Science and Technology (HKUST)
December 2012 – April 2014 Research Assistant, University of Science and Technology Houari Boumedienne (USTHB), Algiers, Algeria
2010 – 2012 Master in Software Engineering, USTHB, Algeria
2007 – 2010 Bachelor in Computer Science, USTHB, Algeria

Speaking engagements

Panelist, NLP for Positive Impact Workshop (co-located with ACL 2025)
NLP for Low-resource Languages – flash talk, Multilingualism in the Era of Artificial Intelligence Workshop, July 2024
NLP for Low-resource Languages Discussion, CollaborativeNLP Workshop, July 2024
On Benchmarking and Building Resources: The Inevitable and the Preventable Pitfalls, Queen Mary University of London (Seminar), March 2024
What Is Needed vs. What Is Built in NLP: Toxic Language Detection and Automated Fact-checking Models as Use Cases, Uppsala NLP Seminar, May 2023
What Is Needed vs. What Is Built in NLP: Toxic Language Detection and Automated Fact-checking Models as Use Cases, CohereAI Community Talks (Recording), May 2023
Expectations vs. Reality: Doing Multilingual Toxic Content Detection in NLP, Aston Institute of Forensic Linguistics, May 2023
What Is Needed vs. What Is Built in NLP: Toxic Language Detection and Automated Fact-checking Models as Use Cases, Cardiff NLP Seminar, January 2023
Being a Researcher in Arabic NLP, panel discussion at WiNLP Workshop (co-located with EMNLP 2022), December 2022
Arabic Toxic Content Detection in NLP, “Arabic AI and Toxic Online Content Detection” panel at IWABigDAI, May 2022
Expectations vs. Reality: Lessons Learned from Working on Toxic Content Detection in NLP, Language Technology Group Seminar (Hamburg Universität), February 2022
Expectations vs. Reality: Lessons Learned from Working on Toxic Content Detection in NLP, Cambridge NLIP Seminar, January 2022
Expectations vs. Reality: Lessons Learned from Working on Toxic Content Detection in NLP, MilaNLP Group Seminar (Bocconi University, Milan), September 2021
Challenges in Toxic Content Detection, Language and Multimodal AI Lab (LAMA) Seminar, Imperial College London, August 2021
Normalizing the Experimental Design of Multilingual Hate Speech Detection, Digital Technologies Research Center Seminar (National Research Council, Canada), November 2020

Committees and reviewing

Ethics Chair, EACL 2026
Co-organiser, MELT Workshop (co-located with COLM 2025)
Co-organiser, Cardiff NLP Workshop
Student Volunteer Chair, EMNLP 2025
Senior Area Chair, ACL 2025 (Multilinguality and Cross-Lingual NLP)
Area Chair, COLING 2025 (Ethics and Bias)
Action Editor, ACL Rolling Review; Area Chair, EMNLP 2024, EMNLP 2025, AACL 2025
Reviewer, Computational Linguistics and OSNEM journals
Area Chair (Low-resource and Endangered Languages), LREC-COLING 2024
Area Chair (Ethics in NLP), EACL 2024
Senior Area Chair, EMNLP 2023 (Outstanding SAC Award)
Diversity and Inclusion Chair, ACL 2023
Financial Accessibility Chair (part of D&I), NAACL 2022
Reviewer, ACL Rolling Review and major *CL conferences (ACL/EMNLP/…) 2019–2022

Natural Language

Contact Details

[email protected]
+44 29225 14939

Dr Nedjma Ousidhoum

Teams and roles for Nedjma Ousidhoum

Lecturer

2025

2024

2023

Articles

Conferences

Websites

Publications (see Google Scholar for the full list)

Honours and awards

Academic positions

Speaking engagements

Committees and reviewing

Contact Details

External profiles

Dr Nedjma Ousidhoum

Teams and roles for Nedjma Ousidhoum

Lecturer

Overview

Publication

2025

2024

2023

Articles

Conferences

Websites

Research

Publications (see Google Scholar for the full list)

Biography

Honours and awards

Academic positions

Speaking engagements

Committees and reviewing

Supervisions

Contact Details

External profiles