Skip to main content
Hsuvas Borkakoty

Mr Hsuvas Borkakoty

(he/him)

Teams and roles for Hsuvas Borkakoty

Overview

I am a PhD student under the Supervision of Dr. Luis Espinosa-Anke. My area of research is to understand how Wikipedia can attribute to detect temporal changes in NLP, as well as how we can use NLP to improve content reliability of Wikipedia.

Publication

2024

  • Myung, J. et al. 2024. BLEND: A benchmark for LLMs on everyday knowledge in diverse cultures and languages. Presented at: 38th Conference on Neural Information Processing Systems (NeurIPS 2024) Track on Datasets and Benchmarks, Vancouver, BC, Canada, 9-15 December 2024 Presented at Globerson, A. et al. eds.NeurIPS Proceedings: Advances in Neural Information Processing Systems, Vol. 37. Curran Associates, Inc. pp. 78104-78146.
  • Borkakoty, H. and Espinosa-Anke, L. 2024. HOAXPEDIA: A unified Wikipedia hoax articles dataset. Presented at: 2024 Conference on Empirical Methods in Natural Language Processing, Miami, Florida, 12-16 November 2024 Presented at Lucie-Aimée, L. et al. eds.Proceedings of the First Workshop on Advancing Natural Language Processing for Wikipedia. Association for Computational Linguistics pp. 53–66.

2023

Conferences

  • Myung, J. et al. 2024. BLEND: A benchmark for LLMs on everyday knowledge in diverse cultures and languages. Presented at: 38th Conference on Neural Information Processing Systems (NeurIPS 2024) Track on Datasets and Benchmarks, Vancouver, BC, Canada, 9-15 December 2024 Presented at Globerson, A. et al. eds.NeurIPS Proceedings: Advances in Neural Information Processing Systems, Vol. 37. Curran Associates, Inc. pp. 78104-78146.
  • Borkakoty, H. and Espinosa-Anke, L. 2024. HOAXPEDIA: A unified Wikipedia hoax articles dataset. Presented at: 2024 Conference on Empirical Methods in Natural Language Processing, Miami, Florida, 12-16 November 2024 Presented at Lucie-Aimée, L. et al. eds.Proceedings of the First Workshop on Advancing Natural Language Processing for Wikipedia. Association for Computational Linguistics pp. 53–66.
  • Borkakoty, H. and Espinosa-Anke, L. 2023. WIKITIDE: A Wikipedia-based timestamped definition pairs dataset. Presented at: R A N L P 2 0 2 3 International conference recent advances in natural language processing, 4-6 September 2023Proceedings of Recent Advances in Natural Language Processing. Shoumen, Bulgaria: INCOMA Ltd pp. 207-216., (10.26615/978-954-452-092-2_023)

Research

My topic of research is to study Wikipedia as a temporal data source for Large Language Models and to study Large Language Models for content moderation in Wikipedia. Apart from my topic, I have also collaborated with my peers in CardiffNLP group as well as peers from different universities on topics such as Understanding metaphors and analogies, Developing Cultural Benchmark for LLMs, and Detection of temporal change in Social Media data.

Teaching

CMT307 (PGR Demonstrator): Applied Machine Learning

CMT316 (PGR Demonstrator): Applied Machine Learning- Computer Vision and Natural Language Processing

CM2307 (PGR Demonstrator): Algorithms and Data Structure

Contact Details

Research themes

Specialisms

  • Natural language processing