Mr Jose Camacho Collados
Lecturer
Yr Ysgol Cyfrifiadureg a Gwybodeg
- CamachoColladosJ@caerdydd.ac.uk
- +44 29208 79108
- Abacws, Ystafell 5.68, Ffordd Senghennydd, Cathays, Caerdydd, CF24 4AG
Trosolwyg
I am a UKRI Future Leaders Fellow and a Lecturer at the School of Computer Science and Informatics of Cardiff University. Previously I was a Google Doctoral Fellow in the area of Natural Language Processing and completed his PhD at Sapienza University of Rome.
My main research interest is Natural Language Processing (NLP), where I have worked in different areas such as semantics, multilinguality and social media.
Please check my personal website for more details.
Cyhoeddiad
2023
- Doval, Y., Camacho Collados, J., Espinosa-Anke, L. and Schockaert, S. 2023. Meemi: a simple method for post-processing and integrating cross-lingual word embeddings. Natural Language Engineering 29(3), pp. 746-768. (10.1017/S1351324921000280)
- Owen, D., Antypas, D., Hassoulas, A., Pardinas, A., Espinosa-Anke, L. and Camacho Collados, J. 2023. Enabling early health care intervention by detecting depression in users of web-based forums using Language models: longitudinal analysis and evaluation. JMIR AI 2, article number: e41205. (10.2196/41205)
2022
- Loureiro, D., Mário Jorge, A. and Camacho-Collados, J. 2022. LMMS reloaded: Transformer-based sense embeddings for disambiguation and beyond. Artificial Intelligence 305, article number: 103661. (10.1016/j.artint.2022.103661)
2021
- Camacho Collados, J., Liberatore, F. and Ushio, A. 2021. Back to the basics: a quantitative analysis of statistical and graph-based term weighting schemes for keyword extraction. Presented at: EMNLP 2021 Conference, online and at Punta Cana, Dominican Republic, 7-11 November 2021Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics pp. 8089-8103.
- Ushio, A., Espinosa-Anke, L., Schockaert, S. and Camacho Collados, J. 2021. BERT is to NLP what AlexNet is to CV: can pre-trained language models identify analogies?. Presented at: 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021), Bangkok, Thailand, 1-6 August 2021.
- Li, N., Bouraoui, Z., Camacho Collados, J., Espinosa-Anke, L., Gu, Q. and Schockaert, S. 2021. Modelling general properties of nouns by selectively averaging contextualised embeddings. Presented at: 30th International Joint Conference on Artificial Intelligence (IJCAI 2021), Virtual, 21-26 August 2021.
- Ushio, A., Camacho Collados, J. and Schockaert, S. 2021. Distilling relation embeddings from pre-trained language models. Presented at: 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), Punta Cana, Dominican Republic, 7-11 November 2021.
2020
- Chiang, H., Camacho-Collados, J. and Pardos, Z. 2020. Understanding the source of semantic regularities in word embeddings. Presented at: SIGNLL Conference Computational Natural Language Learning (CoNLL 2020), Virtual, 19-20 November 2020Proceedings of the 24th Conference on Computational Natural Language Learning. Association for Computational Linguistics pp. 119-131.
- Bouraoui, Z., Camacho Collados, J. and Schockaert, S. 2020. Inducing relational knowledge from BERT. Presented at: Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20), New York, NY, USA, 7-12 February 2020Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. Vol. 5. pp. 7456-7463., (10.1609/aaai.v34i05.6242)
- Bouraoui, Z., Camacho Collados, J., Espinosa-Anke, L. and Schockaert, S. 2020. Modelling semantic categories using conceptual neighborhood. Presented at: Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20), New York, NY, USA, 7-12 February 2020. pp. -.
- Camacho Collados, J., Doval, Y., Martínez-Cámara, E., Espinosa-Anke, L., Barbieri, F. and Schockaert, S. 2020. Learning cross-lingual word embeddings from Twitter via distant supervision. Proceedings of the International AAAI Conference on Web and Social Media 14(1), pp. 72-82.
- Hee Lee, J., Camacho Collados, J., Espinosa-Anke, L. and Schockaert, S. 2020. Capturing word order in averaging based sentence embeddings. Presented at: European Conference on Artificial Intelligence (ECAI2020), Santiago de Compostela, Spain, 29 August - 2 September.
- Ito, T., Camacho Collados, J., Sakaji, H. and Schockaert, S. 2020. Learning company embeddings from annual reports for fine-grained industry characterization. Presented at: FinNLP-2020 @ IJCAI-PRICAI 2020: The Second Workshop on Financial Technology and Natural Language Processing, Yokohama, Japan, 11-13 July 2020.
- Owen, D., Camacho Collados, J. and Espinosa-Anke, L. 2020. Towards preemptive detection of depression and anxiety in Twitter. Presented at: Social Media Mining for Health Applications Workshop & Shared Task 2020, Barcelona, Spain, 8-13 December 2020.
- Tuxworth, D., Antypas, D., Espinosa-Anke, L., Camacho-Collados, J., Preece, A. and Rogers, D. 2020. Deriving disinformation insights from geolocalized Twitter callouts. Presented at: Workshop On Deriving Insights From User-Generated Text @KDD2021, 14 -18 August 2021.
2019
- Sinoara, R., Camacho Collados, J., Rossi, R., Navigli, R. and Rezende, S. 2019. Knowledge-enhanced document embeddings for text classification. Knowledge-Based Systems 163, pp. 955-971. (10.1016/j.knosys.2018.10.026)
- Camacho Collados, J., Espinosa-Anke, L., Jameel, S. and Schockaert, S. 2019. A latent variable model for learning distributional relation vectors. Presented at: IJCAI-19: International Joint Conference on Artificial Intelligence, Macau, China, 10-16 August 2019.
- Camacho Collados, J., Espinosa-Anke, L. and Schockaert, S. 2019. Relational word embeddings. Presented at: 57th Annual Meeting of the Association for Computational Linguistics (ACL), Florence, Italy, 28 July - 2 August 2019.
2018
- Camacho Collados, J. and Pilehvar, M. T. 2018. From word to sense embeddings: a survey on vector representations of meaning. Journal of Artificial Intelligence Research 63, pp. 743-788. (10.1613/jair.1.11259)
- Barbieri, F. and Camacho-Collados, J. 2018. How Gender and Skin Tone Modifiers Affect Emoji Semantics in Twitter. Presented at: 7th Conference on Lexical and Computational Semantics (*SEM 2018), New Orleans, Louisiana, 5-6 June 2018Proceedings of the 7th Joint Conference on Lexical and Computational Semantics (*SEM). Stroudsburg, PA: The Association for Computational Linguistics pp. 101-106., (10.18653/v1/S18-2011)
- Quijano-Sánchez, L., Liberatore, F., Camacho Collados, J. and Camacho-Collados, M. 2018. Applying automatic text-based detection of deceptive language to police reports: Extracting behavioral patterns from a multi-step classification model to understand how we lie to the police. Knowledge-Based Systems 149, pp. 155-168. (10.1016/j.knosys.2018.03.010)
2017
- Camacho Collados, J., Pilehvar, M. T., Collier, N. and Navigli, R. 2017. SemEval-2017 Task 2: Multilingual and cross-lingual semantic word similarity. Presented at: 11th International Workshop on Semantic Evaluations (SemEval-2017), Vancouver, Canada, 3rd-4th August 2017Proceedings of the 11th International Workshop on Semantic Evaluations (SemEval-2017). Stroudsburg, PA: The Association for Computational Linguistics pp. 15-26., (10.18653/v1/S17-2002)
- Delli Bovi, C., Camacho Collados, J., Raganato, A. and Navigli, R. 2017. EuroSense: Automatic Harvesting of Multilingual Sense Annotations from Parallel Text. Presented at: The 55th Annual Meeting of the Association for Computational Linguistics, Vancouver, Canada, 30th July - 4th August 2017Proceedings of the the 55th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, PA: The Association for Computational Linguistics pp. 594-600., (10.18653/v1/P17-2094)
- Pilehvar, M. T., Camacho Collados, J., Navigli, R. and Collier, N. 2017. Towards a seamless integration of word senses into downstream NLP applications. Presented at: The 55th Annual Meeting of the Association for Computational Linguistics, Vancouver, Canada, 30th July - 4th August 2017Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Stroudsburg, PA: The Association for Computational Linguistics pp. 1857-1869., (10.18653/v1/P17-1170)
- Mancini, M., Camacho Collados, J., Iacobacci, I. and Navigli, R. 2017. Embedding words and senses together via joint knowledge-enhanced training. Presented at: 21st Conference on Computational Natural Language Learning (CoNLL 2017), Vancouver, Canada, 3rd-4th August 2017Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017). Stroudsburg, PA: The Association for Computational Linguistics pp. 100-111., (10.18653/v1/K17-1012)
2016
- Camacho Collados, J., Pilehvar, M. T. and Navigli, R. 2016. NASARI: integrating explicit knowledge and corpus statistics for a multilingual representation of concepts and entities. Artificial Intelligence 240, pp. 36-64. (10.1016/j.artint.2016.07.005)
- Camacho Collados, J. and Navigli, R. 2016. Find the word that does not belong: a framework for an intrinsic evaluation of word vector representations. Presented at: 1st Workshop on Evaluating Vector Space Representations for NLP, Berlin, 12 August 2016Proceedings of the 1st Workshop on Evaluating Vector Space Representations for NLP. Stroudsburg, PA: The Association for Computational Linguistics pp. 43-50., (10.18653/v1/W16-2508)
2015
- Camacho Collados, J., Pilehvar, M. T. and Navigli, R. 2015. A framework for the construction of monolingual and cross-lingual word similarity datasets. Presented at: ACL-IJCNLP 2015: 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, Beijing, China, 26-31 July 2015Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers). Association for Computational Linguistics pp. 1-7.
- Camacho Collados, J., Pilehvar, M. T. and Navigli, R. 2015. NASARI: A novel approach to a semantically-aware representation of items. Presented at: NAACL HLT 2015, Denver, CO, 31 May - 5 JuneProceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. pp. 567-577.
- Camacho Collados, J., Pilehvar, M. T. and Navigli, R. 2015. A unified multilingual semantic representation of concepts. Presented at: ACL-IJCNLP 2015, Beijing, China, 26-31 July 2015Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). pp. 741-751.
2014
- Camacho Collados, J., Billami, M. B., Jacquey, E. and Kister, L. 2014. Approche statistique pour le filtrage terminologique des occurrences de candidats termes en texte intégral. Presented at: JADT 2014, Paris, 3-6 June 2014Proceedings of the 12th International Conference on the Statistical Analysis of Textual Data.
- Billami, M., Camacho Collados, J., Jacquey, E. and Kister, L. 2014. Annotation sémantique et validation terminologique en texte intégral en SHS. Presented at: TALN 2014, Marseille, 1-4 July 2014Actes de la 21e conférence sur le Traitement Automatique des Langues Naturelles.
2013
- Camacho Collados, J. 2013. Splitting complex sentences for natural language processing applications: Building a simplified Spanish corpus. Procedia Social and Behavioral Sciences 95, pp. 464-472. (10.1016/j.sbspro.2013.10.670)
- Camacho Collados, J. 2013. Syntactic simplification for machine translation. BULAG: Bulletin de Linguistique Appliquée et Générale 38
Articles
- Doval, Y., Camacho Collados, J., Espinosa-Anke, L. and Schockaert, S. 2023. Meemi: a simple method for post-processing and integrating cross-lingual word embeddings. Natural Language Engineering 29(3), pp. 746-768. (10.1017/S1351324921000280)
- Owen, D., Antypas, D., Hassoulas, A., Pardinas, A., Espinosa-Anke, L. and Camacho Collados, J. 2023. Enabling early health care intervention by detecting depression in users of web-based forums using Language models: longitudinal analysis and evaluation. JMIR AI 2, article number: e41205. (10.2196/41205)
- Loureiro, D., Mário Jorge, A. and Camacho-Collados, J. 2022. LMMS reloaded: Transformer-based sense embeddings for disambiguation and beyond. Artificial Intelligence 305, article number: 103661. (10.1016/j.artint.2022.103661)
- Camacho Collados, J., Doval, Y., Martínez-Cámara, E., Espinosa-Anke, L., Barbieri, F. and Schockaert, S. 2020. Learning cross-lingual word embeddings from Twitter via distant supervision. Proceedings of the International AAAI Conference on Web and Social Media 14(1), pp. 72-82.
- Sinoara, R., Camacho Collados, J., Rossi, R., Navigli, R. and Rezende, S. 2019. Knowledge-enhanced document embeddings for text classification. Knowledge-Based Systems 163, pp. 955-971. (10.1016/j.knosys.2018.10.026)
- Camacho Collados, J. and Pilehvar, M. T. 2018. From word to sense embeddings: a survey on vector representations of meaning. Journal of Artificial Intelligence Research 63, pp. 743-788. (10.1613/jair.1.11259)
- Quijano-Sánchez, L., Liberatore, F., Camacho Collados, J. and Camacho-Collados, M. 2018. Applying automatic text-based detection of deceptive language to police reports: Extracting behavioral patterns from a multi-step classification model to understand how we lie to the police. Knowledge-Based Systems 149, pp. 155-168. (10.1016/j.knosys.2018.03.010)
- Camacho Collados, J., Pilehvar, M. T. and Navigli, R. 2016. NASARI: integrating explicit knowledge and corpus statistics for a multilingual representation of concepts and entities. Artificial Intelligence 240, pp. 36-64. (10.1016/j.artint.2016.07.005)
- Camacho Collados, J. 2013. Splitting complex sentences for natural language processing applications: Building a simplified Spanish corpus. Procedia Social and Behavioral Sciences 95, pp. 464-472. (10.1016/j.sbspro.2013.10.670)
- Camacho Collados, J. 2013. Syntactic simplification for machine translation. BULAG: Bulletin de Linguistique Appliquée et Générale 38
Conferences
- Camacho Collados, J., Liberatore, F. and Ushio, A. 2021. Back to the basics: a quantitative analysis of statistical and graph-based term weighting schemes for keyword extraction. Presented at: EMNLP 2021 Conference, online and at Punta Cana, Dominican Republic, 7-11 November 2021Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics pp. 8089-8103.
- Ushio, A., Espinosa-Anke, L., Schockaert, S. and Camacho Collados, J. 2021. BERT is to NLP what AlexNet is to CV: can pre-trained language models identify analogies?. Presented at: 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021), Bangkok, Thailand, 1-6 August 2021.
- Li, N., Bouraoui, Z., Camacho Collados, J., Espinosa-Anke, L., Gu, Q. and Schockaert, S. 2021. Modelling general properties of nouns by selectively averaging contextualised embeddings. Presented at: 30th International Joint Conference on Artificial Intelligence (IJCAI 2021), Virtual, 21-26 August 2021.
- Ushio, A., Camacho Collados, J. and Schockaert, S. 2021. Distilling relation embeddings from pre-trained language models. Presented at: 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), Punta Cana, Dominican Republic, 7-11 November 2021.
- Chiang, H., Camacho-Collados, J. and Pardos, Z. 2020. Understanding the source of semantic regularities in word embeddings. Presented at: SIGNLL Conference Computational Natural Language Learning (CoNLL 2020), Virtual, 19-20 November 2020Proceedings of the 24th Conference on Computational Natural Language Learning. Association for Computational Linguistics pp. 119-131.
- Bouraoui, Z., Camacho Collados, J. and Schockaert, S. 2020. Inducing relational knowledge from BERT. Presented at: Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20), New York, NY, USA, 7-12 February 2020Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. Vol. 5. pp. 7456-7463., (10.1609/aaai.v34i05.6242)
- Bouraoui, Z., Camacho Collados, J., Espinosa-Anke, L. and Schockaert, S. 2020. Modelling semantic categories using conceptual neighborhood. Presented at: Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20), New York, NY, USA, 7-12 February 2020. pp. -.
- Hee Lee, J., Camacho Collados, J., Espinosa-Anke, L. and Schockaert, S. 2020. Capturing word order in averaging based sentence embeddings. Presented at: European Conference on Artificial Intelligence (ECAI2020), Santiago de Compostela, Spain, 29 August - 2 September.
- Ito, T., Camacho Collados, J., Sakaji, H. and Schockaert, S. 2020. Learning company embeddings from annual reports for fine-grained industry characterization. Presented at: FinNLP-2020 @ IJCAI-PRICAI 2020: The Second Workshop on Financial Technology and Natural Language Processing, Yokohama, Japan, 11-13 July 2020.
- Owen, D., Camacho Collados, J. and Espinosa-Anke, L. 2020. Towards preemptive detection of depression and anxiety in Twitter. Presented at: Social Media Mining for Health Applications Workshop & Shared Task 2020, Barcelona, Spain, 8-13 December 2020.
- Tuxworth, D., Antypas, D., Espinosa-Anke, L., Camacho-Collados, J., Preece, A. and Rogers, D. 2020. Deriving disinformation insights from geolocalized Twitter callouts. Presented at: Workshop On Deriving Insights From User-Generated Text @KDD2021, 14 -18 August 2021.
- Camacho Collados, J., Espinosa-Anke, L., Jameel, S. and Schockaert, S. 2019. A latent variable model for learning distributional relation vectors. Presented at: IJCAI-19: International Joint Conference on Artificial Intelligence, Macau, China, 10-16 August 2019.
- Camacho Collados, J., Espinosa-Anke, L. and Schockaert, S. 2019. Relational word embeddings. Presented at: 57th Annual Meeting of the Association for Computational Linguistics (ACL), Florence, Italy, 28 July - 2 August 2019.
- Barbieri, F. and Camacho-Collados, J. 2018. How Gender and Skin Tone Modifiers Affect Emoji Semantics in Twitter. Presented at: 7th Conference on Lexical and Computational Semantics (*SEM 2018), New Orleans, Louisiana, 5-6 June 2018Proceedings of the 7th Joint Conference on Lexical and Computational Semantics (*SEM). Stroudsburg, PA: The Association for Computational Linguistics pp. 101-106., (10.18653/v1/S18-2011)
- Camacho Collados, J., Pilehvar, M. T., Collier, N. and Navigli, R. 2017. SemEval-2017 Task 2: Multilingual and cross-lingual semantic word similarity. Presented at: 11th International Workshop on Semantic Evaluations (SemEval-2017), Vancouver, Canada, 3rd-4th August 2017Proceedings of the 11th International Workshop on Semantic Evaluations (SemEval-2017). Stroudsburg, PA: The Association for Computational Linguistics pp. 15-26., (10.18653/v1/S17-2002)
- Delli Bovi, C., Camacho Collados, J., Raganato, A. and Navigli, R. 2017. EuroSense: Automatic Harvesting of Multilingual Sense Annotations from Parallel Text. Presented at: The 55th Annual Meeting of the Association for Computational Linguistics, Vancouver, Canada, 30th July - 4th August 2017Proceedings of the the 55th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, PA: The Association for Computational Linguistics pp. 594-600., (10.18653/v1/P17-2094)
- Pilehvar, M. T., Camacho Collados, J., Navigli, R. and Collier, N. 2017. Towards a seamless integration of word senses into downstream NLP applications. Presented at: The 55th Annual Meeting of the Association for Computational Linguistics, Vancouver, Canada, 30th July - 4th August 2017Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Stroudsburg, PA: The Association for Computational Linguistics pp. 1857-1869., (10.18653/v1/P17-1170)
- Mancini, M., Camacho Collados, J., Iacobacci, I. and Navigli, R. 2017. Embedding words and senses together via joint knowledge-enhanced training. Presented at: 21st Conference on Computational Natural Language Learning (CoNLL 2017), Vancouver, Canada, 3rd-4th August 2017Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017). Stroudsburg, PA: The Association for Computational Linguistics pp. 100-111., (10.18653/v1/K17-1012)
- Camacho Collados, J. and Navigli, R. 2016. Find the word that does not belong: a framework for an intrinsic evaluation of word vector representations. Presented at: 1st Workshop on Evaluating Vector Space Representations for NLP, Berlin, 12 August 2016Proceedings of the 1st Workshop on Evaluating Vector Space Representations for NLP. Stroudsburg, PA: The Association for Computational Linguistics pp. 43-50., (10.18653/v1/W16-2508)
- Camacho Collados, J., Pilehvar, M. T. and Navigli, R. 2015. A framework for the construction of monolingual and cross-lingual word similarity datasets. Presented at: ACL-IJCNLP 2015: 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, Beijing, China, 26-31 July 2015Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers). Association for Computational Linguistics pp. 1-7.
- Camacho Collados, J., Pilehvar, M. T. and Navigli, R. 2015. NASARI: A novel approach to a semantically-aware representation of items. Presented at: NAACL HLT 2015, Denver, CO, 31 May - 5 JuneProceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. pp. 567-577.
- Camacho Collados, J., Pilehvar, M. T. and Navigli, R. 2015. A unified multilingual semantic representation of concepts. Presented at: ACL-IJCNLP 2015, Beijing, China, 26-31 July 2015Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). pp. 741-751.
- Camacho Collados, J., Billami, M. B., Jacquey, E. and Kister, L. 2014. Approche statistique pour le filtrage terminologique des occurrences de candidats termes en texte intégral. Presented at: JADT 2014, Paris, 3-6 June 2014Proceedings of the 12th International Conference on the Statistical Analysis of Textual Data.
- Billami, M., Camacho Collados, J., Jacquey, E. and Kister, L. 2014. Annotation sémantique et validation terminologique en texte intégral en SHS. Presented at: TALN 2014, Marseille, 1-4 July 2014Actes de la 21e conférence sur le Traitement Automatique des Langues Naturelles.
Addysgu
I am currently teaching the following master modules:
- CMT307: Applied Machine Learning, MSc Data Science and Analytics.
- CMT316: Applications of Machine Learning: Natural Language Processing and Computer Vision, MSc Artificial Intelligence.
Meysydd goruchwyliaeth
I am currently actively supervising or co-supervising the following PhD students:
- Aleks Edwards, with Alun Preece and Hélène de Ribaupierre.
- David Owen, with Luis Espinosa-Anke.
- Joanne Boisson, with Luis Espinosa-Anke.
- Asahi Ushio, with Steven Schockaert
- Dimosthenis Antypas, with Alun Preece.
- David Tuxworth, with Alun Preece, Luis Espinosa-Anke and Martin Innes.