Dr Dawn Knight BA, MA, PhD (Nottingham), FLSW
Reader
Ysgol Saesneg, Cyfathrebu ac Athroniaeth
- KnightD5@caerdydd.ac.uk
- +44 29208 76325
- Adeilad John Percival , Ystafell 3.57, Rhodfa Colum, Caerdydd, CF10 3EU
- Ar gael fel goruchwyliwr ôl-raddedig
Trosolwyg
I am a member of the Centre for Language and Communication Research.
I am the Principal Investigator on a 3 ½ year, £1.8 million ESRC/AHRC funded inter-disciplinary and multi-institutional project which began in March 2016. The project, which is entitled ‘CorCenCC: Corpws Cenedlaethol Cymraeg Cyfoes (The National Corpus of Contemporary Welsh): A community driven approach to linguistic corpus construction’, will create a large scale, open source corpus of contemporary Welsh language.
The creation of CorCenCC is community-driven with impact being generated through a user-informed design, harnessing opportunities afforded by mobile technologies, specifically crowdsourcing (via an app) and community collaboration. Academic partners (CIs) on this project include colleagues from Cardiff, Swansea, Lancaster and Bangor Universities. More information is available on our website.
Other contributors and collaborators include software engineers, Welsh language experts and a range of external stakeholders including the Welsh Government, National Assembly for Wales, Gwasg y Lolfa, Welsh for Adults, Welsh Joint Education Committee and University of Wales Dictionary of the Welsh Language.
Professional memberships
Editorial positions and other external activity
- Member of the ESRC’s Centres for Doctoral Training (CDT) Peer Review College (2016+).
- General Secretary for BAAL, the British Association for Applied Linguistics (2013 - present). BAAL is a professional association based in the UK (with an international professional membership of just under 1000 members), which provides a forum for people interested in language and applied linguistics. Some responsibilities of this post include: communicating messages from outside bodies/individuals to the Chair, the EC or the membership (e.g. notice of meetings, research opportunities, etc.); responding where necessary or raise issues with EC/Chair and writing letters on behalf of BAAL to various bodies.
- Former Meetings Secretary for BAAL (2010-2013) where I was responsible for coordinating the organisation of the annual conference for the Association, prior to that I was the Postgraduate Development and Liaison Officer for BAAL (2007-2009).
- Co-organiser of the IVACS (Inter-Varietal and Applied Corpus Studies) 2006 and IVACS 2014 conferences.
- Editor (with Professor Svenja Adolphs) of the Routledge Handbook of English Language and the Digital Humanities [under contract].
- Reviews Editor for the Yearbook of Corpus Linguistics and Pragmatics, 2012-2015 (Springer Verlag).
- Reviewer for International Journal of Corpus Linguistics (IJCL), Journal of Pragmatics, Context and Discourse, Corpora Journal and the BAAL annual book prize.
- Programme committee member: Big Data and Natural Language Processing workshop hosted at IEEE Big Data, December 2016.
- Programme committee member: 9th International Corpus Linguistics conference, July 2017, University of Birmingham; Challenges in the Management of Large Corpora + Big Data and Natural Language Processing joint meeting, July 2017, University of Birmingham.
- Advisory Editorial Board member for the Journal of Corpus Linguistics and Pragmatics (Springer Verlag).
- Advisory board member for Language, Texts and Society (LTS) – a journal produced at the University of Nottingham.
- Advisory board member for CLiC – a corpus tool for the analysis of literary texts, led by Professor Mahlberg, University of Birmingham (funded by the AHRC).
Membership of professional bodies and learned societies
- Associate Fellow of the Higher Education Academy (AFHEA), 2013 – present.
- Member, BAAL (British Association for Applied Linguistics).
- Executive Committee member, CRiLLS (Centre for Research in Linguistics and Language Sciences, Newcastle University), 2011 – 2015.
- Member, CRAL (Centre for Research in Applied Linguistics), 2006 – 2011.
- Member, IVACS (Inter-Varietal Applied Corpus Studies), 2004 – present
- Member, AILA (International Association of Applied Linguistics), 2004 – present
- Member, Language Teaching and Technology; Language Learning and Teaching and iLaB (ICT) research clusters in ECLS, 2012 – 2015.
Previous academic positions
- 2016 – present: Reader in Applied Linguistics, Cardiff University
- 2015 – 2016: Senior Lecturer in Applied Linguistics, Cardiff University
- 2014 – 2015: Senior Lecturer in Applied Linguistics, Newcastle University
- 2011 – 2014: Lecturer in Applied Linguistics, Newcastle University
- 2006 – 2011: Research Assistant (then Associate, then Fellow), The University of Nottingham
Speaking engagements
Invitations to address conferences, workshops and seminars
- Knight, D. (2017). Qualitative Analysis of NSS Reponses. Invited presentation delivered to the Business Intelligence Unit, 3/4/17, Cardiff University.
- Knight, D. (2017). Big Data and Corpus Construction Introducing CorCenCC. Invited seminar presentation at the Investigating (with) Big Data event run by the Cardiff University Digital Humanities Network, 24/5/17, Cardiff University.
- Knight, D. (2017). Research funding and building networks in the Arts, Humanities and Social Sciences: the case of CorCenCC (Corpws Cenedlaethol Cymraeg Cyfoes - The National Corpus of Contemporary Welsh). Invited seminar presentation as part of the Cardiff School of Journalism, Media and Cultural Studies 2016/17 research seminar series, 5/4/17, Cardiff University.
- Knight, D. (2017). Constructing corpora of minoritized languages: A focus on CorCenCC. Invited plenary presentation delivered as part of the Corpus Linguistics in the South Conference, 4/3/17, Birkbeck University.
- Knight, D. (2016). Constructing E-Language Corpora: a focus on CorCenCC (The National Corpus of Contemporary Welsh). Invited plenary presentation at the 4th Computer-Mediated Communication and Social Media Corpora for the Humanities conference, 27-28/9/16, University of Ljubljana, Slovenia.
- Knight, D. (2016). Innovations in corpus-based research. Invited seminar presentation at the Tokyo Chapter of the Japanese Association of Language Teachers (JALT) meeting, 9/9/16, Tokyo, Japan.
- Knight, D. (2016). The application of corpora: supporting and informing the pedagogic landscape. Invited plenary presentation at the InForm Conference, 16/7/16, Durham University.
- Knight, D. (2016). Corpora and Pedagogy: developing the community-driven National Corpus of Contemporary Welsh. Invited presentation at the Welsh for Adults annual conference, 8/7/16, Cardiff.
- Knight, D. (2016). The National Corpus of Contemporary Welsh: A community driven approach to linguistic corpus construction. Invited presentation at the UCREL Corpus Research Seminar Series, 9/6/16, Lancaster University.
- Knight, D. (2015). Dispelling the myths: the ubiquity of corpora in linguistic research. Invited keynote presentation at the annual. Cardiff University ENCAP Postgraduate Conference, 2/6/15, Cardiff University.
- Knight, D. (2015). Multimodal Corpus Linguistics. Invited presentation delivered at the joint seminar between Lund University Humanities Lab and the Linneaus Centre CCL, 26/5/15, Lund University.
- Knight, D. (2015). Analysing Literature using Corpora. Invited presentation delivered as part of the Cardiff BookTalk series, 30/4/15, Cardiff University.
- Knight, D. (2015). Multimodal Corpus Linguistics. Invited presentation delivered as part of the Vlunch Seminar Series, School of Computer Science and informatics, Cardiff University, 30th April.
- Knight, D. (2014). Practical applications for corpora. Invited workshop at Welsh tutors conference, 5/12/14, Cardiff University.
- Knight, D. (2014). (Re)defining context in corpus linguistics. Invited keynote presentation at Information Visualization seminar series, 5/11/14, Potsdam University of Applied Sciences.
- Knight, D. and Murphy, B. (2014). Exploring the meta in 'meta-data': corpus investigations in sociolinguistic contexts. Invited keynote presentation at IVACS 2014, 13/6/14, Newcastle University.
- Knight, D. (2013). A corpus-based approach to Digital Discourse. Invited keynote presentation at the BAAL Language and New Media SIG event ‘Research Methods and Approaches for Analysing Social Media’, 22/11/13, Leicester University.
- Knight, D. (2013). A corpus-based approach to Digital Discourse. Invited keynote presentation at theBAAL Language and New Media SIG event ‘Research Methods and Approaches for Analysing Social Media’, 22/11/13, Leicester University.
- Knight, D. (2013). Record – Transcribe – Code – Analyse: Tackling Multimodal Data. Invited keynote presentation at the annualNewcastle University ECLS Postgraduate Conference, 20/6/13, Newcastle University.
- Knight, D. (2013). Recording and analysing real-life interaction ‘in the wild’. Invited keynote presentation at the Cardiff School of English PhD Applied Linguistics (Lexical Studies) Annual conference, 21/3/13, Cardiff University, Wales, UK.
- Knight, D. (2013). Gesture and talk ‘in the wild’. Invited keynote presentation at the BAAL Corpus Linguistics SIG event “Building and Mining Small Specialised Corpora”. Edinburgh, 22/2/13.
- Carter, R. and Knight, D. (2012). CANELC – The Cambridge and Nottingham eLanguage Corpus. Invited keynote presentation at theELT Insights Seminar, 24/1/13, Cambridge University Press, Cambridge
- Knight, D. and Adolphs, S. (2011). Multimodal Corpora for Sign Language Research. Invited keynote presentation at the 2nd Symposium in Applied Sign Linguistics. “Documenting Sign Languages for Learning and Teaching Purposes”. Bristol, June 2011.
- Knight, D. (2011). Mobile and Location-based Data: Capture, Representation and Analysis. Paper presented at the CAQDAS digital social research showcase event, 23rd March 2011, Oxford, UK.
Conference papers, workshops, posters and conference demonstrations
- Knight, D., Fitzpatrick, T. and Morris, S. (2017). CorCenCC (Corpws Cenedlaethol Cymraeg Cyfoes - The National Corpus of Contemporary Welsh): An overview. Paper presented as part of the annual British Association for Applied Linguistics (BAAL) conference, September 2017, University of Leeds.
- Morris, S., Fitzpatrick, T. and Knight, D. (2017). Creating pedagogic wordlists in an under-resourced language. Poster presented as part of the annual British Association for Applied Linguistics (BAAL) conference, September 2017, University of Leeds.
- Rees, M., Watkins, G., Needs, J., Morris, S. and Knight, D. (2017). Creating a Bespoke Corpus Sampling Frame for a Minoritised Language: CorCenCC, the National Corpus of Contemporary Welsh. Paper presented at the CL2017 conference, University of Birmingham, Birmingham, 24-28 July 2017.
- Piao, S., Rayson, P., Watkins, G., Knight, D. and Donnelly, K. (2017). Towards a Welsh Semantic Tagger: Creating Lexicons for A Resource Poor Language. Paper presented at the Corpus Linguistics Conference 2017, July 2017, University of Birmingham.
- Piao, S., Rayson, P., Knight, D., Watkins, G. and Donnelly, K. (2017). Towards a Welsh Semantic Tagger: Creating Lexicons for A Resource Poor Language. Paper presented at the CL2017 conference, University of Birmingham, Birmingham, 24-28 July 2017.
- Needs, J., Knight, D., Morris, S., Fitzpatrick, T., Thomas, E. and Neale, S. (2017). "How will you make sure the material is suitable for children?": User-informed design of Welsh corpus-based learning/teaching tools. Paper presented at the CL2017 conference, University of Birmingham, Birmingham, 24-28 July 2017.
- Neale, S., Spasić, I., Needs, J., Watkins, G., Morris, S., Fitzpatrick, T., Marshall, L. and Knight, D. (2017). The CorCenCC Crowdsourcing App: A Bespoke Tool for the User-Driven Creation of the National Corpus of Contemporary Welsh. Paper presented at the CL2017 conference, University of Birmingham, Birmingham, 24-28 July 2017.
- Knight, D., Morris, S., Fitzpatrick, T. and Anthony, L. (2016). Charting the vocabulary of a minoritised language: Challenges and opportunities in the creation and application of the National Corpus of Contemporary Welsh. Paper presented at the Vocab@Tokyo international conference, September 2016, Tokyo, Japan.
- Fitzpatrick, T., Knight, D. and Morris, S. (2016). Creating pedagogical wordlists: a comparison of thematic and corpus approaches. Poster presented at the Pacific Second Language Research Forum (PacSLRF2016), September 2016, Tokyo, Japan.
- Knight, D., Fitzpatrick, T. and Morris, S. (2016). CorCenCC - Corpws Cenedlaethol Cymraeg Cyfoes (The National Corpus of Contemporary Welsh). WISERD (Wales Institute of Social and Economic Research, Data and Methods), July 2016, Swansea University.
- Handford, M. and Knight, D. (2016). Corpus-informed discourse analysis: a methodology for exploring context in spoken corpora. Paper presented at the IVACS 2016 conference, June 2016, Bath Spa University.
- Knight, D., Neale, S., Spasic, I., Morris, S. and Fitzpatrick, T. (2016). Crowdsourcing corpus construction: contextualizing plans for CorCenCC (Corpws Cenedlaethol Cymraeg Cyfoes - The National Corpus of Contemporary Welsh). Paper presented at the IVACS 2016 conference, June 2016, Bath Spa University.
- Needs, J., Rees, M., Watkins, G., Morris, S., Knight, D. and Fitzpatrick, T. (2016). CorCenCC (Corpws Cenedlaethol Cymraeg Cyfoes – The National Corpus of Contemporary Welsh): Challenges and applications in a minoritised language context. Paper presented at the IVACS 2016 conference, June 2016, Bath Spa University.
- Piao, S., Rayson, P., Archer, D., Bianchi, F., Dayrell, C. El-Haj, M., Jiménez, R-M., Knight, D., Michal Křen, M., Löfberg, L., Nawab, R., Shafi, J., The, P-L. and Mudraya, O. (2016). Lexical Coverage Evaluation of Large-scale Multilingual Semantic Lexicons for Twelve Languages. Paper delivered at the LREC (Language Resources Evaluation) 2016 Conference, May 2016, Slovenia.
- Adolphs, S. Knight, D., Ofemile, A. and Clark, L. (2016). Crowdsourcing new communities of discourse: analysing human-computer interaction in different contexts. Paper presented at the AAAL (American Association of Applied Linguistics) conference, March 2016, Florida, USA.
- Knight, D. and Adolphs, S. (2015). Language in the Digital Age: Revisiting the Speech-Writing Continuum. Paper delivered as part of the Topics in Corpus Linguistics for Social Media Research workshop, Corpus Linguistics 2015, July 2015, Lancaster University.
- Knight, D. (2014). The ‘spokenness’ of e-language. Paper presented at the Corpus Linguistics in the South Conference, November 2014, Reading University.
- Seedhouse, P. and Knight, D. (2014). Using technology to solve a research problem. Paper presented at the British Association for Applied Linguistics (BAAL) annual meeting, September 2014, Warwick University.
- Adolphs, S. and Knight, D. (2014). Capturing Formulaic Sequences 'in the wild'. Paper presented as part of the New Insights into the Acquisition, Assessment and Pedagogy of Formulaic Languagecolloquium held at the American Association for Applied Linguistics conference, 23/3/14, Portland, Oregon.
- Knight, D. (2013). Gesture and talk ‘in the wild’. Paper presented at the American Association for Applied Linguistics 2013 Conference (AAAL), March 2013, Dallas, Texas.
- Knight, D. (2012). Gesture and talk ‘in the wild’. Paper presented at the British Association for Applied Linguistics (BAAL) annual meeting, September 2012, Southampton University.
- Adolphs, S. and Knight, D. (2012). Formality and professional discourse in online contexts. Paper presented at the British Association for Applied Linguistics (BAAL) annual meeting, September 2012, Southampton University.
- Knight, D. and Adolphs, S. (2012). CANELC: Cambridge and Nottingham eLanguage Corpus. Paper presented at the Inter-Varietal Applied Corpus Studies (IVACS) symposium, January 2012, Cambridge University.
- Walsh, S. and Knight, D. (2012). Investigating small group teaching in a higher education context. Paper presented at the CUP English Profile Seminar, February 2012, Cambridge University.
- Knight, D. and Adolphs, S. (2012). CANELC: Cambridge and Nottingham eLanguage Corpus. Paper presented at the Inter-Varietal Applied Corpus Studies (IVACS) symposium, January 2012, Cambridge University.
- Knight, D. and Walsh, S. (2012). Investigating small group teaching in a higher education context. Paper presented at the Inter-Varietal Applied Corpus Studies (IVACS) symposium, January 2012, Cambridge University.
- Knight, D., Mullany, L., Adolphs, S., Harvey, K., Hunt, D., Smith, C. and Atkins, S. (2011). New Developments in multi-modal Corpus Analysis. Paper presented at World Congress of Applied Linguistics (AILA), August 2011, Beijing, China.
- Knight, D. And Adolphs, S. (2011). Experiencing space and place: A multi-modal corpus approach. Paper presented at the BAAL annual conference, September 2011, Bristol UWE.
- Knight, D. and Adolphs, S. (2010). Space, place and SMS: capturing context and network in multimodal corpus development. Paper presented at the BAAL annual conference, September 2010, Edinburgh.
- Adolphs, S., Carter, R. and Knight, D. (2010). Second phase multi-modal corpora: Heterogeneous datasets for linguistic analysis. Paper delivered at the 5th Inter-Varietal Applied Corpus Studies (IVACS) conference, June 2010, University of Edinburgh.
- Knight, D., Tennent, P., Adolphs, S. and Carter, R. (2010). Developing heterogeneous corpora using the Digital Replay System (DRS). Paper delivered at the LREC 2010 (Language Resources Evaluation Conference) Workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, May 2010, Giessen, Germany.
- Knight, D., Carter, R. and Adolphs, S. (2010). Corpora and context: A discussion of ‘Thrill’. Paper presented at the 31st ICAME conference, May 2010, Giessen University, Germany.
- Knight, D. (2010). Language, Corpora and Context: A 'Thrilling' Case Study. Paper presented at theIVACS Annual Research Symposium, January 2010, University of Leeds.
- Adolphs, S. and Knight, D. (2009). Language, Corpus and Context: Record, represent and replay. Presentation delivered as part of the Second and Foreign Language Pedagogy Seminar Series, October 2009, School of Education, Nottingham University.
- Carter, R., Adolphs, S. and Knight, D. (2009). Language, Corpus and Context: ubiquitous computing and corpus development. Paper delivered at the BAAL annual conference, September 2009, Newcastle.
- Knight, D., Adolphs, S., Carter, R. and Tennent, P. (2009). A multi-modal approach to the construction and analysis of spoken corpora. A two-hour workshop co-ordinated at the Corpus Linguistics 2009 conference, July 2009, Liverpool University.
- Knight, D. (2009). Collecting and collating heterogeneous datasets for multi-modal corpora. Paper be presented at the Corpus Linguistics 2009 conference, July 2009, Liverpool University.
- Adolphs, S., Carter, R. Knight, D., Brundell, P. and Tennent, P. (2009). Constructing and interrogating linguistic corpora using heterogeneous datasets. A two-hour workshop co-ordinated at the 5th International Conference on e-Social Science (ICeSS), Cologne, June 2009.
- Adolphs, S., Knight, D. and Carter, R. (2009). Redefining context in communication: a multi-modal perspective. Paper presented at the 30th ICAME conference, May 2009, Lancaster University.
- Knight, D., Adolphs, S. and Carter, R. (2009). Multi-modal corpus construction and analysis. Poster presented at the 30th ICAME conference, May 2009, Lancaster University.
- Knight, D. and Adolphs, S. (2009). Corpus Perspectives: from production to reception. Paper presented at the Inter-Varietal Applied Corpus Studies (IVACS) Annual Research Symposium, January 2009, University of Edinburgh.
- Adolphs, S. and Knight, D. (2008). Analysing Discourse Markers: A Multi-Modal Approach. Paper presented at the BAAL 2008 annual conference, September 2008, University of Swansea.
- Brundell, P., Tennent, P., Greenhalgh, C., Knight, D., Crabtree, A., O’Malley, C., Ainsworth, S., Clarke, D., Carter, R. & Adolphs, S. (2008). Digital Replay system (DRS): A Tool for Interaction Analysis. Paper delivered at the International Conference for the Learning Sciences 2008 (ICLS), Utrecht, The Netherlands. June-July 2008.
- Brundell, P., Knight, D., Tennent, P., Naeem, A., Adolphs, S., Ainsworth, S., Carter, R., Clarke, D., Crabtree, A., Greenhalgh, C., O’Malley, C., Pridmore, T. and Rodden, T. (2008). The experience of using Digital Replay System for social science research. Paper presented at the 4th International Conference on e-Social Science (ICeSS), the University of Manchester, June 2008.
- Knight, D. and Evans, D. (2008). Multi-Modal Corpora, Discourse and Gesture. Paper presented atAAAL 2008, Washington DC, US.
- Knight, D. (2008). Gesturing power in dyadic conversations: A study of academic supervisory meetings. Paper delivered at the 4th Inter-Varietal Applied Corpus Studies (IVACS) conference, June 2008, University of Limerick.
- Knight, D. and Tennent, P. (2008). Introducing DRS: A tool for the future of Corpus Linguistic research and analysis. Poster presentation with demo, delivered at the 6th Language Resources and Evaluation Conference (LREC), Palais des Congrés Mansour Eddahbi, Marrakech, Morocco.
- Knight, D., Adolphs, S., Tennent, P. and Carter, R. (2008). The Nottingham Multi-Modal Corpus: A Demonstration. Paper during the ‘Multimodal Corpora’ workshop held at the 6th Language Resources and Evaluation Conference (LREC), Palais des Congrés Mansour Eddahbi, Marrakech, Morocco.
- Adolphs, S., Knight, D. and Evans, D. (2007). Multi-modal corpora. Presentation delivered as part of the CRAL seminar series, University of Nottingham, October 2007.
- Adolphs, S. and Knight, D. (2007). HeadTalk. Presentation delivered as part of the Technologies for Enhancing Visual Methods workshop at the 3rd International eSocial Science Conference (ICeSS), University of Michigan, US.
- Tennent, P. and Knight, D. (2007). Multi-modal corpora: Adapting gesture recognition techniques for linguistic analysis. Poster delivered at the 3rd International eSocial Science Conference (ICeSS), October 2007, University of Michigan, US.
- Adolphs, S., Carter, R., Knight, D. and Evans, D. (2007). e-Social Science and Applied Linguistics: a multimodal corpus case study. Paper delivered at the New Horizons in Linguistics symposium, September 2007, University of Oxford.
- Knight, D., Evans, D., Carter, R. and Adolphs, S. (2007). Multi-modal corpus design, construction and use. Paper delivered at the BAAL 2007 annual conference, September 2007, University of Edinburgh.
- Knight, D., Evans, D., Adolphs, S. and Carter, R. (2007). Approaching the problems: Capturing, coding and analysing gesture in multi-modal communication data. Paper delivered at the Corpus Linguistics 2007 Conference, July 2007, University of Birmingham.
- Knight, D. (2007). HeadTalk: The development and exploration of multi-modal linguistic corpora. Paper delivered at the Annual Nottingham University Postgraduate Symposium, 2007.
- Knight, D. (2006). Little old ladies and dodgy old men: An exploration of the representation of old age in everyday spoken discourse. Paper delivered at GLoBE conference, September 2006, University of Warsaw.
- Carter, R., Knight, D. and Adolphs, S. (2006). Head-talk: Towards a Multi-Modal Corpus. Paper delivered at the BAAL 2006 annual conference, September 2006, University College, Cork.
- Knight, D., Bayoumi, S., Mills, S., Crabtree, A., Adolphs, S., Pridmore, T. and Carter, R. (2006). Beyond the Text: Construction and Analysis of Multi-Modal Linguistic Corpora. Paper presented at 2nd International Conference on e-Social Science (ICeSS), Manchester, 28 - 30 June 2006.
- French, A., Wright, M., Greenhalgh, C., Knight, D., Brundell, P., O'Malley, C., Ainsworth, S., Clarke, D. and Tom Rodden. (2006). ‘Replaytool’ software in practice. Poster presented at the 2nd annual international eSocial Science Conference (ICeSS), June 2006, Manchester University.
- Knight, D., Adolphs, S. and Carter, R. (2006). The Multi-Modal Corpus: Coding and representing data- the issues. Paper delivered at the 3rd Inter-Varietal Applied Corpus Studies conference, May 2006, University of Nottingham.
- Knight, D. and Adolphs, S. (2006). Analysing Spoken Corpora: Methodological Issues and Technological Challenges. Delivered at theBAAL SIG Seminar (Special Interest Group: Corpus Linguistics), April 2006, The Open University, Milton Keynes.
- Knight, D. (2006). Developing a Multi-Modal Corpus: Data Coding Issues. Paper delivered at theInter-Varietal Applied Corpus Studies (IVACS) Annual Research Symposium, February 2006, University of Limerick.
Cyhoeddiad
2023
- Knight, D., Fitzpatrick, T., Morris, S., Tovey-Walsh, B., Prosser, H. and Davies, E. 2023. Corpus to curriculum: Developing word lists for adult learners of Welsh. Applied Corpus Linguistic 3(2), article number: 100052. (10.1016/j.acorp.2023.100052)
- Vilar-Lluch, S., McClaughlin, E., Knight, D., Adolphs, S. and Nichele, E. 2023. The language of vaccination campaigns during COVID-19. Medical Humanities (10.1136/medhum-2022-012583)
- Adolphs, S. et al. 2023. Communicating health threats: Linguistic evidence for effective public health messaging during the Covid-19 pandemic. University of Nottingham.
- Knight, D., O'Keeffe, A., Fitzgerald, C., Mark, G., McNamara, J. and Farr, F. 2023. Indicating engagement in online workplace meetings: The role of backchannelling head nods. International Journal of Corpus Linguistics (IJCL)
- Khallaf, N. et al. 2023. Open-source thesaurus development for under-resourced languages: a Welsh case study. Presented at: LDK 2023 – 4th Conference on Language, Data and Knowledge, Vienna, Austria, 12-15 September 2023.
2022
- McClaughlin, E. et al. 2022. The reception of public health messages during the COVID-19 pandemic. Applied Corpus Linguistics 3(1), article number: 100037. (10.1016/j.acorp.2022.100037)
- Ezeani, I., El-Haj, M., Morris, J. and Knight, D. 2022. Introducing the Welsh text summarisation dataset and baseline systems. Presented at: 13th ELRA Language Resources and Evaluation Conference (LREC 2022), Marseille, France, 20-25 June 2022.
- El-Haj, M., Ezeani, I., Morris, J. and Knight, D. 2022. Creation of an evaluation corpus and baseline evaluation scores for Welsh text summarisation. Presented at: 4th Celtic Language Technology Workshop (CLTW 2022), Marseille, France, 20 June 2022.
- Clos, J., McClaughlin, E., Barnard, P., Nichele, E., Knight, D., McAuley, D. and Adolphs, S. 2022. PriPA: a tool for privacy-preserving analytics of linguistic data. Presented at: Legal and Ethical Issues in Human Language Technologies 2022, Marseille, France, 24 June 2022.
- Morris, J., Ezeani, I., Gruffydd, I., Young, K., Davies, L., El-Haj, M. and Knight, D. 2022. Welsh automatic text summarisation. Presented at: Wales Academic Symposium on Language Technologies 2022, Bangor, Wales, 28/01/2022Language and Technology in Wales, Vol. 2. Bangor: Banolfan Bedwyr
2021
- McClaughlin, E. et al. 2021. Privacy preserving corpus linguistics: investigating the trajectories of public health messaging online. University of Nottingham.
- Muralidaran, V., Spasic, I. and Knight, D. 2021. A systematic review of unsupervised approaches to grammar induction. Natural Language Engineering 27(6), pp. 647-689. (10.1017/S1351324920000327)
- Knight, D., Morris, S., Arman, L., Needs, J. and Rees, M. 2021. Building a national corpus: a Welsh language case study. Basingstoke: Palgrave Macmillan.
- Knight, D., Loizides, F., Neale, S., Anthony, L. and Spasic, I. 2021. Developing computational infrastructure for the CorCenCC corpus - the National Corpus of Contemporary Welsh. Language Resources and Evaluation 55, pp. 789-816. (10.1007/s10579-020-09501-9)
- McClaughlin, E. et al. 2021. Public health messaging by political leaders: a corpus linguistic analysis of COVID-19 speeches delivered by Boris Johnson. University of Nottingham. Available at: https://doi.org/10.17639/3fgb-fn44
- Corcoran, P., Palmer, G., Arman, L., Knight, D. and Spasic, I. 2021. Creating Welsh language word embeddings. Applied Sciences 11(15), article number: 6896. (10.3390/app11156896)
- Espinosa-Anke, L., Palmer, G., Filimonov, M., Corcoran, P., Spasic, I. and Knight, D. 2021. English–Welsh cross-lingual embeddings. Applied Sciences 11(14), article number: 6541. (10.3390/app11146541)
- Knight, D., Morris, S. and Fitzpatrick, T. 2021. Corpus design and construction in minoritised language contexts - Cynllunio a chreu corpws mewn cyd-destunau Ieithoedd lleiafrifoledig: The National Corpus of Contemporary Welsh - Corpws Cenedlaethol Cymraeg Cyfoes. Basingstoke: Palgrave Macmillan.
- McClaughlin, E. et al. 2021. Using online news comments to gather fast feedback on issues with public health messaging: The Guardian as a case study. Project Report. [Online]. University of Nottingham. Available at: https://nottingham-repository.worktribe.com/output/5717332
- Palmer, G., Corcoran, P., Arman, L., Knight, D. and Spasic, I. 2021. A closer look at Welsh word embeddings. In: Prys, D. ed. Language and Technology in Wales: Volume 1. Bangor: Bangor University, pp. 21-29.
- Muralidaran, V., Palmer, G., Arman, L., O'Hare, K., Knight, D. and Spasic, I. 2021. A practical implementation of a porter stemmer for Welsh. In: Prys, D. ed. Language and Technology in Wales: Volume 1. Bangor: Bangor University, pp. 30-43.
2020
- Chen, Y., Adolphs, S. and Knight, D. 2020. Multimodal discourse analysis. In: Friginal, E. and Hardy, J. eds. The Routledge Handbook of Corpus Approaches to Discourse Analysis. London: Routledge
- Knight, D. and Adolphs, S. 2020. Multimodal corpora. In: Paquot, M. and Gries, S. T. eds. A Practical Handbook of Corpus Linguistics. Springer International Publishing, pp. 351-369.
- Knight, D., Morris, S., Fitzpatrick, T., Rayson, P., Spasić, I. and Môn Thomas, E. 2020. The national corpus of contemporary Welsh: project report | Y corpws cenedlaethol Cymraeg cyfoes: adroddiad y prosiect.. Project Report. CorCenCC.
- Muralidaran, V., Spasic, I. and Knight, D. 2020. A cognitive approach to parsing with neural networks. Presented at: International Conference on Statistical Language and Speech Processing (SLSP), Cardiff, UK, 14–16 Oct 2020Statistical Language and Speech Processing, Vol. 12379. Springer Verlag pp. 71-84., (10.1007/978-3-030-59430-5_6)
- Adolphs, S., Knight, D., Smith, C. and Price, D. 2020. Crowdsourcing formulaic phrases: towards a new type of spoken corpus. Corpora 15(2), pp. 141-168. (10.3366/COR.2020.0192)
- Adolphs, S. and Knight, D. eds. 2020. The Routledge handbook of English language and digital humanities. Routledge Handbooks in English Language Studies. Abingdon: Routledge.
2019
- Ezeani, I., Piao, S., Neale, S., Rayson, P. and Knight, D. 2019. Leveraging pre-trained embeddings for Welsh Taggers. Presented at: 4th Workshop on Representation Learning for NLP, Florence, Italy, July 2019ACL Anthology: Proceedings of the 4th Workshop on Representation Learning for NLP, Vol. W19-43. Association for Computational Linguistics pp. -., (10.18653/v1/W19-4332)
- Spasic, I., Owen, D., Knight, D. and Artemiou, A. 2019. Unsupervised multi-word term recognition in Welsh. Presented at: Celtic Language Technology Workshop 2019, Dublin, Ireland, 19 August 2019 Presented at Lynn, T. et al. eds.Proceedings of the Celtic Language Technology Workshop. European Association for Machine Translation
2018
- Piao, S., Rayson, P., Knight, D. and Watkins, G. 2018. Towards a Welsh semantic annotation system.. Presented at: LREC (Language Resources Evaluation) 2018 Conference, Miyazaki, Japan., 7 - 12 May 2018.
- Neale, S., Donnelly, K., Watkins, G. and Knight, D. 2018. Leveraging lexical resources and constraint grammar for rule-based part-of-speech tagging in Welsh. Presented at: LREC (Language Resources Evaluation) 2018 Conference, Miyazaki, Japan, 7 - 12 May 2018.
2017
- Knight, D., Walsh, S. and Papagiannidis, S. 2017. I’m having a spring clear out: a corpus-based analysis of e-transactional discourse. Applied Linguistics 38(2), pp. 234-257. (10.1093/applin/amv019)
- Neale, S. et al. 2017. The CorCenCC crowdsourcing app: a bespoke tool for the user-driven creation of the national corpus of contemporary Welsh. Presented at: The 9th International Corpus Linguistics Conference, Birmingham, UK, 24-28 July 2017.
2016
- Walsh, S. and Knight, D. 2016. Analysing spoken discourse in University small group teaching. In: Corrigan, K. P. and Mearns, A. eds. Creating and Digitizing Language Corpora: Volume 3: Databases for Public Engagement., Vol. 3. Basingstoke: Palgrave Macmillan, pp. 291-319.
- Knight, D. et al. 2016. Lexical coverage evaluation of large-scale multilingual semantic lexicons for twelve languages. Presented at: LREC 2016, Tenth International Conference on Language Resources and Evaluation. European Language Resources Association (ELRA), Portoro, Slovenia, 23-28 May 2016.
- Seedhouse, P. and Dawn, K. 2016. Applying digital sensor technology: A problem-solving approach. Applied Linguistics 37(1), pp. 7-32. (10.1093/applin/amv065)
2015
- Knight, D. 2015. e-Language: communication in the digital age. In: Baker, P. and McEnery, T. eds. Corpora and Discourse Studies: Integrating Discourse and Corpora. Palgrave Advances in Language and Linguistics Basingstoke: Palgrave Macmillan, London, pp. 20-40., (10.1057/9781137431738_2)
- Crabtree, A., Tennent, P., Brundell, P. and Knight, D. 2015. Digital records and the digital replay system. In: Halfpenny, P. J. and Proctor, R. eds. Innovations in Digital Research Methods. London: Sage
- Dörk, M. and Knight, D. 2015. WordWanderer: A navigational approach to text visualisation. Corpora 10(1), pp. 83-94. (10.3366/cor.2015.0067)
- Adolphs, S. and Knight, D. 2015. Beyond monomodal spoken corpora. In: Baker, P. and McEnery, T. eds. Corpora and Discourse Studies: Integrating Discourse and Corpora. Palgrave Advances in Language and Linguistics Houndsmill, Basingstoke: Palgrave Macmillan, pp. 41-62.
2014
- Knight, D., Adolphs, S. and Ronald, C. 2014. CANELC – constructing an e-language corpus. Corpora 9(1), pp. 29-56. (10.3366/cor.2014.0050)
2013
- Knight, D., Adolphs, S. and Carter, R. 2013. Formality in digital discourse: a study of hedging in CANELC. In: Romero-Trillo, J. ed. Yearbook of corpus linguistics and pragmatics 2013: new domains and methodologies. Yearbook of corpus linguistics and pragmatics Vol. 1. Springer Netherlands, pp. 131-152., (10.1007/978-94-007-6250-3_7)
- Knight, D. 2013. Corpus linguistics: methods, theory and practice by Tony McEnery and Andrew Hardie [Book Review]. In: Romero-Trillo, J. ed. Yearbook of corpus linguistics and pragmatics 2013: new domains and methodologies. Yearbook of corpus linguistics and pragmatics Vol. 1. Springer Netherlands, pp. 275-277., (10.1007/978-94-007-6250-3_13)
2011
- Knight, D. 2011. Multimodality and active listenership: a corpus approach. Corpus and discourse. London: Bloomsbury.
- Knight, D. 2011. The future of multimodal corpora. Revista Brasileira de Linguística Aplicada 11(2), pp. 391-415. (10.1590/S1984-63982011000200006)
- Adolphs, S., Knight, D. and Carter, R. 2011. Capturing context for heterogeneous corpus analysis: some first steps. International journal of corpus linguistics 16(3), pp. 305-324. (10.1075/ijcl.16.3.02ado)
2010
- Knight, D., Tennent, P., Adolphs, S. and Carter, R. 2010. Developing heterogeneous corpora using the Digital Replay System (DRS).. Presented at: Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, Malta, 18 May 2010 Presented at Kipp, M. et al. eds.Proceedings of the LREC 2010 (Language Resources Evaluation Conference) Workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, May 2010, Malta.. European Language Resources Association pp. 16-21.
- Adolphs, S. and Knight, D. 2010. Building a spoken corpus: What are the basics?. In: O’Keeffe, A. and McCarthy, M. eds. The Routledge handbook of corpus linguistics. Routledge handbooks in applied linguistics Oxford: Routledge
2009
- Knight, D., Evans, D., Carter, R. and Adolphs, S. 2009. HeadTalk, HandTalk and the corpus: towards a framework for multi-modal, multi-media corpus development. Corpora 4(1), pp. 1-32. (10.3366/E1749503209000203)
- Knight, D. 2009. A multi-modal corpus approach to the analysis of backchanneling behaviour. PhD Thesis, University of Nottingham.
2008
- Brundell, P. et al. 2008. The experience of using Digital Replay System for social science research. Presented at: 4th International Conference on e-Social Science (ICeSS), Manchester, UK, 18-20 June 2008Proceedings of the 4th International Conference on e-Social Science (ICeSS), Manchester, 18-20 June 2008. ICeSS pp. 1-10.
- Knight, D. and Tennent, P. 2008. Introducing DRS (The Digital Replay System): A tool for the future of corpus linguistic research and analysis. Presented at: Sixth International Conference on Language Resources and Evaluation (LREC'08, Marrakesh, Morocco, 26 May -1 June 2008 Presented at Calzolari, N. et al. eds.Proceedings of the 6th Language Resources and Evaluation Conference (LREC), Palais des Congrés, Marrakech, Morocco, 28-30th May 2008. European Language Resources Association pp. 26-31.
- Knight, D., Adolphs, S., Tennent, P. and Carter, R. 2008. The Nottingham Multi-Modal Corpus: a demonstration. Presented at: 6th Language Resources and Evaluation Conference (LREC), Marrakesh, Morocco, 28-30 May 2008Proceedings of the 6th Language Resources and Evaluation Conference (LREC), Palais des Congrés, Marrakech, Morocco, 28-30th May 2008. European Language Resources Association pp. 1-7.
- Knight, D. and Adolphs, S. 2008. Multi-modal corpus pragmatics: the case of active listenership. In: Romero-Trillo, J. ed. Pragmatics and corpus linguistics: a mutualistic entente. Mouton series in pragmatics Vol. 2. Mouton de Gruyter, pp. 175-190.
- Brundell, P. et al. 2008. Digital Replay System (DRS): a tool for interaction analysis. Presented at: ICLS2008: International Perspectives in the Learning Sciences Cre8ing a learning world, Utrecht, The Netherlands, 23-28 June 2008.
2006
- Knight, D., Bayoumi, S., Mills, S., Crabtree, A., Adolphs, S., Pridmore, T. and Carter, R. 2006. Beyond the text: construction and analysis of multi-modal linguistic corpora. Presented at: 2nd International Conference on e-Social Science, Manchester, UK, 28-30 June 2006Proceedings of the 2nd International Conference on e-Social Science, Manchester, 28 - 30 June 2006.. ICeSS pp. n/a.
Articles
- Knight, D., Fitzpatrick, T., Morris, S., Tovey-Walsh, B., Prosser, H. and Davies, E. 2023. Corpus to curriculum: Developing word lists for adult learners of Welsh. Applied Corpus Linguistic 3(2), article number: 100052. (10.1016/j.acorp.2023.100052)
- Vilar-Lluch, S., McClaughlin, E., Knight, D., Adolphs, S. and Nichele, E. 2023. The language of vaccination campaigns during COVID-19. Medical Humanities (10.1136/medhum-2022-012583)
- Knight, D., O'Keeffe, A., Fitzgerald, C., Mark, G., McNamara, J. and Farr, F. 2023. Indicating engagement in online workplace meetings: The role of backchannelling head nods. International Journal of Corpus Linguistics (IJCL)
- McClaughlin, E. et al. 2022. The reception of public health messages during the COVID-19 pandemic. Applied Corpus Linguistics 3(1), article number: 100037. (10.1016/j.acorp.2022.100037)
- Muralidaran, V., Spasic, I. and Knight, D. 2021. A systematic review of unsupervised approaches to grammar induction. Natural Language Engineering 27(6), pp. 647-689. (10.1017/S1351324920000327)
- Knight, D., Loizides, F., Neale, S., Anthony, L. and Spasic, I. 2021. Developing computational infrastructure for the CorCenCC corpus - the National Corpus of Contemporary Welsh. Language Resources and Evaluation 55, pp. 789-816. (10.1007/s10579-020-09501-9)
- Corcoran, P., Palmer, G., Arman, L., Knight, D. and Spasic, I. 2021. Creating Welsh language word embeddings. Applied Sciences 11(15), article number: 6896. (10.3390/app11156896)
- Espinosa-Anke, L., Palmer, G., Filimonov, M., Corcoran, P., Spasic, I. and Knight, D. 2021. English–Welsh cross-lingual embeddings. Applied Sciences 11(14), article number: 6541. (10.3390/app11146541)
- Adolphs, S., Knight, D., Smith, C. and Price, D. 2020. Crowdsourcing formulaic phrases: towards a new type of spoken corpus. Corpora 15(2), pp. 141-168. (10.3366/COR.2020.0192)
- Knight, D., Walsh, S. and Papagiannidis, S. 2017. I’m having a spring clear out: a corpus-based analysis of e-transactional discourse. Applied Linguistics 38(2), pp. 234-257. (10.1093/applin/amv019)
- Seedhouse, P. and Dawn, K. 2016. Applying digital sensor technology: A problem-solving approach. Applied Linguistics 37(1), pp. 7-32. (10.1093/applin/amv065)
- Dörk, M. and Knight, D. 2015. WordWanderer: A navigational approach to text visualisation. Corpora 10(1), pp. 83-94. (10.3366/cor.2015.0067)
- Knight, D., Adolphs, S. and Ronald, C. 2014. CANELC – constructing an e-language corpus. Corpora 9(1), pp. 29-56. (10.3366/cor.2014.0050)
- Knight, D. 2011. The future of multimodal corpora. Revista Brasileira de Linguística Aplicada 11(2), pp. 391-415. (10.1590/S1984-63982011000200006)
- Adolphs, S., Knight, D. and Carter, R. 2011. Capturing context for heterogeneous corpus analysis: some first steps. International journal of corpus linguistics 16(3), pp. 305-324. (10.1075/ijcl.16.3.02ado)
- Knight, D., Evans, D., Carter, R. and Adolphs, S. 2009. HeadTalk, HandTalk and the corpus: towards a framework for multi-modal, multi-media corpus development. Corpora 4(1), pp. 1-32. (10.3366/E1749503209000203)
Book sections
- Palmer, G., Corcoran, P., Arman, L., Knight, D. and Spasic, I. 2021. A closer look at Welsh word embeddings. In: Prys, D. ed. Language and Technology in Wales: Volume 1. Bangor: Bangor University, pp. 21-29.
- Muralidaran, V., Palmer, G., Arman, L., O'Hare, K., Knight, D. and Spasic, I. 2021. A practical implementation of a porter stemmer for Welsh. In: Prys, D. ed. Language and Technology in Wales: Volume 1. Bangor: Bangor University, pp. 30-43.
- Chen, Y., Adolphs, S. and Knight, D. 2020. Multimodal discourse analysis. In: Friginal, E. and Hardy, J. eds. The Routledge Handbook of Corpus Approaches to Discourse Analysis. London: Routledge
- Knight, D. and Adolphs, S. 2020. Multimodal corpora. In: Paquot, M. and Gries, S. T. eds. A Practical Handbook of Corpus Linguistics. Springer International Publishing, pp. 351-369.
- Walsh, S. and Knight, D. 2016. Analysing spoken discourse in University small group teaching. In: Corrigan, K. P. and Mearns, A. eds. Creating and Digitizing Language Corpora: Volume 3: Databases for Public Engagement., Vol. 3. Basingstoke: Palgrave Macmillan, pp. 291-319.
- Knight, D. 2015. e-Language: communication in the digital age. In: Baker, P. and McEnery, T. eds. Corpora and Discourse Studies: Integrating Discourse and Corpora. Palgrave Advances in Language and Linguistics Basingstoke: Palgrave Macmillan, London, pp. 20-40., (10.1057/9781137431738_2)
- Crabtree, A., Tennent, P., Brundell, P. and Knight, D. 2015. Digital records and the digital replay system. In: Halfpenny, P. J. and Proctor, R. eds. Innovations in Digital Research Methods. London: Sage
- Adolphs, S. and Knight, D. 2015. Beyond monomodal spoken corpora. In: Baker, P. and McEnery, T. eds. Corpora and Discourse Studies: Integrating Discourse and Corpora. Palgrave Advances in Language and Linguistics Houndsmill, Basingstoke: Palgrave Macmillan, pp. 41-62.
- Knight, D., Adolphs, S. and Carter, R. 2013. Formality in digital discourse: a study of hedging in CANELC. In: Romero-Trillo, J. ed. Yearbook of corpus linguistics and pragmatics 2013: new domains and methodologies. Yearbook of corpus linguistics and pragmatics Vol. 1. Springer Netherlands, pp. 131-152., (10.1007/978-94-007-6250-3_7)
- Knight, D. 2013. Corpus linguistics: methods, theory and practice by Tony McEnery and Andrew Hardie [Book Review]. In: Romero-Trillo, J. ed. Yearbook of corpus linguistics and pragmatics 2013: new domains and methodologies. Yearbook of corpus linguistics and pragmatics Vol. 1. Springer Netherlands, pp. 275-277., (10.1007/978-94-007-6250-3_13)
- Adolphs, S. and Knight, D. 2010. Building a spoken corpus: What are the basics?. In: O’Keeffe, A. and McCarthy, M. eds. The Routledge handbook of corpus linguistics. Routledge handbooks in applied linguistics Oxford: Routledge
- Knight, D. and Adolphs, S. 2008. Multi-modal corpus pragmatics: the case of active listenership. In: Romero-Trillo, J. ed. Pragmatics and corpus linguistics: a mutualistic entente. Mouton series in pragmatics Vol. 2. Mouton de Gruyter, pp. 175-190.
Books
- Knight, D., Morris, S., Arman, L., Needs, J. and Rees, M. 2021. Building a national corpus: a Welsh language case study. Basingstoke: Palgrave Macmillan.
- Knight, D., Morris, S. and Fitzpatrick, T. 2021. Corpus design and construction in minoritised language contexts - Cynllunio a chreu corpws mewn cyd-destunau Ieithoedd lleiafrifoledig: The National Corpus of Contemporary Welsh - Corpws Cenedlaethol Cymraeg Cyfoes. Basingstoke: Palgrave Macmillan.
- Adolphs, S. and Knight, D. eds. 2020. The Routledge handbook of English language and digital humanities. Routledge Handbooks in English Language Studies. Abingdon: Routledge.
- Knight, D. 2011. Multimodality and active listenership: a corpus approach. Corpus and discourse. London: Bloomsbury.
Conferences
- Khallaf, N. et al. 2023. Open-source thesaurus development for under-resourced languages: a Welsh case study. Presented at: LDK 2023 – 4th Conference on Language, Data and Knowledge, Vienna, Austria, 12-15 September 2023.
- Ezeani, I., El-Haj, M., Morris, J. and Knight, D. 2022. Introducing the Welsh text summarisation dataset and baseline systems. Presented at: 13th ELRA Language Resources and Evaluation Conference (LREC 2022), Marseille, France, 20-25 June 2022.
- El-Haj, M., Ezeani, I., Morris, J. and Knight, D. 2022. Creation of an evaluation corpus and baseline evaluation scores for Welsh text summarisation. Presented at: 4th Celtic Language Technology Workshop (CLTW 2022), Marseille, France, 20 June 2022.
- Clos, J., McClaughlin, E., Barnard, P., Nichele, E., Knight, D., McAuley, D. and Adolphs, S. 2022. PriPA: a tool for privacy-preserving analytics of linguistic data. Presented at: Legal and Ethical Issues in Human Language Technologies 2022, Marseille, France, 24 June 2022.
- Morris, J., Ezeani, I., Gruffydd, I., Young, K., Davies, L., El-Haj, M. and Knight, D. 2022. Welsh automatic text summarisation. Presented at: Wales Academic Symposium on Language Technologies 2022, Bangor, Wales, 28/01/2022Language and Technology in Wales, Vol. 2. Bangor: Banolfan Bedwyr
- Muralidaran, V., Spasic, I. and Knight, D. 2020. A cognitive approach to parsing with neural networks. Presented at: International Conference on Statistical Language and Speech Processing (SLSP), Cardiff, UK, 14–16 Oct 2020Statistical Language and Speech Processing, Vol. 12379. Springer Verlag pp. 71-84., (10.1007/978-3-030-59430-5_6)
- Ezeani, I., Piao, S., Neale, S., Rayson, P. and Knight, D. 2019. Leveraging pre-trained embeddings for Welsh Taggers. Presented at: 4th Workshop on Representation Learning for NLP, Florence, Italy, July 2019ACL Anthology: Proceedings of the 4th Workshop on Representation Learning for NLP, Vol. W19-43. Association for Computational Linguistics pp. -., (10.18653/v1/W19-4332)
- Spasic, I., Owen, D., Knight, D. and Artemiou, A. 2019. Unsupervised multi-word term recognition in Welsh. Presented at: Celtic Language Technology Workshop 2019, Dublin, Ireland, 19 August 2019 Presented at Lynn, T. et al. eds.Proceedings of the Celtic Language Technology Workshop. European Association for Machine Translation
- Piao, S., Rayson, P., Knight, D. and Watkins, G. 2018. Towards a Welsh semantic annotation system.. Presented at: LREC (Language Resources Evaluation) 2018 Conference, Miyazaki, Japan., 7 - 12 May 2018.
- Neale, S., Donnelly, K., Watkins, G. and Knight, D. 2018. Leveraging lexical resources and constraint grammar for rule-based part-of-speech tagging in Welsh. Presented at: LREC (Language Resources Evaluation) 2018 Conference, Miyazaki, Japan, 7 - 12 May 2018.
- Neale, S. et al. 2017. The CorCenCC crowdsourcing app: a bespoke tool for the user-driven creation of the national corpus of contemporary Welsh. Presented at: The 9th International Corpus Linguistics Conference, Birmingham, UK, 24-28 July 2017.
- Knight, D. et al. 2016. Lexical coverage evaluation of large-scale multilingual semantic lexicons for twelve languages. Presented at: LREC 2016, Tenth International Conference on Language Resources and Evaluation. European Language Resources Association (ELRA), Portoro, Slovenia, 23-28 May 2016.
- Knight, D., Tennent, P., Adolphs, S. and Carter, R. 2010. Developing heterogeneous corpora using the Digital Replay System (DRS).. Presented at: Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, Malta, 18 May 2010 Presented at Kipp, M. et al. eds.Proceedings of the LREC 2010 (Language Resources Evaluation Conference) Workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, May 2010, Malta.. European Language Resources Association pp. 16-21.
- Brundell, P. et al. 2008. The experience of using Digital Replay System for social science research. Presented at: 4th International Conference on e-Social Science (ICeSS), Manchester, UK, 18-20 June 2008Proceedings of the 4th International Conference on e-Social Science (ICeSS), Manchester, 18-20 June 2008. ICeSS pp. 1-10.
- Knight, D. and Tennent, P. 2008. Introducing DRS (The Digital Replay System): A tool for the future of corpus linguistic research and analysis. Presented at: Sixth International Conference on Language Resources and Evaluation (LREC'08, Marrakesh, Morocco, 26 May -1 June 2008 Presented at Calzolari, N. et al. eds.Proceedings of the 6th Language Resources and Evaluation Conference (LREC), Palais des Congrés, Marrakech, Morocco, 28-30th May 2008. European Language Resources Association pp. 26-31.
- Knight, D., Adolphs, S., Tennent, P. and Carter, R. 2008. The Nottingham Multi-Modal Corpus: a demonstration. Presented at: 6th Language Resources and Evaluation Conference (LREC), Marrakesh, Morocco, 28-30 May 2008Proceedings of the 6th Language Resources and Evaluation Conference (LREC), Palais des Congrés, Marrakech, Morocco, 28-30th May 2008. European Language Resources Association pp. 1-7.
- Brundell, P. et al. 2008. Digital Replay System (DRS): a tool for interaction analysis. Presented at: ICLS2008: International Perspectives in the Learning Sciences Cre8ing a learning world, Utrecht, The Netherlands, 23-28 June 2008.
- Knight, D., Bayoumi, S., Mills, S., Crabtree, A., Adolphs, S., Pridmore, T. and Carter, R. 2006. Beyond the text: construction and analysis of multi-modal linguistic corpora. Presented at: 2nd International Conference on e-Social Science, Manchester, UK, 28-30 June 2006Proceedings of the 2nd International Conference on e-Social Science, Manchester, 28 - 30 June 2006.. ICeSS pp. n/a.
Monographs
- Adolphs, S. et al. 2023. Communicating health threats: Linguistic evidence for effective public health messaging during the Covid-19 pandemic. University of Nottingham.
- McClaughlin, E. et al. 2021. Privacy preserving corpus linguistics: investigating the trajectories of public health messaging online. University of Nottingham.
- McClaughlin, E. et al. 2021. Public health messaging by political leaders: a corpus linguistic analysis of COVID-19 speeches delivered by Boris Johnson. University of Nottingham. Available at: https://doi.org/10.17639/3fgb-fn44
- McClaughlin, E. et al. 2021. Using online news comments to gather fast feedback on issues with public health messaging: The Guardian as a case study. Project Report. [Online]. University of Nottingham. Available at: https://nottingham-repository.worktribe.com/output/5717332
- Knight, D., Morris, S., Fitzpatrick, T., Rayson, P., Spasić, I. and Môn Thomas, E. 2020. The national corpus of contemporary Welsh: project report | Y corpws cenedlaethol Cymraeg cyfoes: adroddiad y prosiect.. Project Report. CorCenCC.
Thesis
- Knight, D. 2009. A multi-modal corpus approach to the analysis of backchanneling behaviour. PhD Thesis, University of Nottingham.
Ymchwil
Research interests
My research interests lie in the areas of corpus linguistics, discourse analysis, lexico-grammar, digital interaction, non-verbal communication and the socio-linguistic contexts of communication. The main contribution of my work has been to pioneer the development of a new research area in applied linguistics: multimodal corpus-based discourse analysis. This has included the introduction of a novel methodological approach to the analysis of the relationships between language and gesture-in-use based on large-scale real-life records of interaction (corpora).
My current research interests build on this work by piloting the use of wearable technologies as a means of capturing language, gesture and embodied actions in naturally occurring interaction (‘in the wild’). I am also working on developing methodological and technical frameworks for crowdsourcing data collection for corpus compilation.
In September 2015, I secured funding from the ESRC and AHRC to lead (as Principal Investigator) a £1.8 million inter-disciplinary and multi-institutional project that will run from March 2016 to August 2019. The project, which is entitled ‘CorCenCC: Corpws Cenedlaethol Cymraeg Cyfoes (The National Corpus of Contemporary Welsh): A community driven approach to linguistic corpus construction’, will create a large scale, open source corpus of contemporary Welsh language.
The creation of CorCenCC is community-driven with impact being generated through a user-informed design, harnessing opportunities afforded by mobile technologies, specifically crowdsourcing (via an app) and community collaboration.
Academic partners (CIs) on this project include colleagues from Cardiff, Swansea, Lancaster and Bangor Universities. Other contributors and collaborators include computer programmers, Welsh language experts and a range of external stakeholders including the Welsh Government, National Assembly for Wales, Gwasg y Lolfa, Welsh for Adults, Welsh Joint Education Committee and University of Wales Dictionary of the Welsh Language. More detail can be found on our website.
Postgraduate students
I would be happy to supervise students in the following areas:
- Corpus Linguistics
- Digital interaction (‘E-language’)
- Language use in context
- Non-verbal communication
- Discourse analysis
I previously supervised:
- Shanru Yang, who completed her PhD between 2011 and 2014 at Newcastle University. Her thesis was entitled: Investigating discourse markers in Chinese college EFL teacher talk: A multi-layered analytical approach. This was a co-supervision with Steve Walsh.
- Rezan Mohammed Alharbi, who completed her PhD in 2016 at Newcastle University. Her thesis was entitled: Acquisition of Lexical Collocations: A corpus-assisted contrastive analysis and translation approach. This was a co-supervision with Mei Lin.
I am currently supervising:
- Emily Powell (co-supervision with Chris Heffer, ENCAP)
- Vigneshwaran Muralidaran (co-supervision with Irena Spasic, COMSC)
- Jennifer Jordan-Grote
- David Griffin (co-supervision with Chris Heffer, ENCAP)
Recent projects
- CI on NUCASE (Newcastle University Corpus of Academic Spoken English) – in collaboration with Cambridge University Press, funding also received from the British Council (2011-2014). Work carried out at Newcastle University.
- Research Fellow on Crowd Sourcing: A Toolkit-based Approach (2010-2011). RCUK Grant EP/G065802/1 Horizon Digital Economy Research. Work carried out at The University of Nottingham.
- Research Associate on DReSS II (Understanding Digital Records for eSocial Science (2008-2011). ESRC Grant No. RES-149-25-1067. Work carried out at The University of Nottingham.
- Research Assistant on DReSS I (Understanding Digital Records for eSocial Science (2005-2008). ESRC Grant No. RES-149-25-0035RA on Headtalk (2005-2006). ESRC Grant No. RES-149-25-1016. Work carried out at The University of Nottingham.
I have been involved in include work with the Cambridge University Press (CUP) on the English Profile (EP) Project and from 2009-2012 I was involved in the construction of CANELC, the Cambridge and Nottingham e-Language Corpus (working with CUP and staff from the University of Nottingham), the first large-scale corpus of digital discourse.
Funding
- 2017: £2000 received (as PI) from the British Council in support of a launch event for the CorCenCC project (held on 28th February 2017).
- 2016-19: PI on the £1.8 ESRC and AHRC funded CorCenCC project (Corpws Cenedlaethol Cymraeg Cyfoes (The National Corpus of Contemporary Welsh): A community driven approach to linguistic corpus construction).
- 2016: £1600 received for the internally funded CUROP project entitled ‘Analysis on non-verbal communication in construction industry interactions’. I was CI on this project (with Mike Handford).
- 2014: £3850 received from the Newcastle University Faculty Research Fund for a project entitled ‘Crowdsourcing data collection for corpus compilation: Scoping methods for the future’ (with Patrick Olivier).
- 2013: £3900 received from the Newcastle University Faculty Bid Preparation Fund for Corpws Cenedlaethol Cymraeg (CorCenCS) to support the development of the bid application.
- 2013: £17,500 funding received from the British Council Aptis Research Grants for a project entitled ‘Characterising interactional competence in higher education small group talk’. I am a Co-I on this project with Steve Walsh (PI) and Paul Seedhouse.
- 2012: £3920 received from the Newcastle University Faculty Research Fund for a pilot project entitled ‘Gesture and talk ‘in the wild’ (with Professor Olivier).
Bywgraffiad
- 2015: Tystysgrif mewn Astudiaethau Uwch mewn Ymarfer Academaidd, Prifysgol Newcastle
- 2004 – 2009: PhD mewn Ieithyddiaeth Gymhwysol, Prifysgol Nottingham
- Teitl traethawd ymchwil: Dull corpws amlfodd o ddadansoddi ymddygiad ôl-sianelu
- Cyllid: Enillydd gwobr ESRC + 3
- 2003 – 2004: MA mewn Ieithyddiaeth Gymhwysol, Prifysgol Nottingham
- 2000 – 2003: BA mewn Astudiaethau Saesneg, Prifysgol Nottingham
Aelodaethau proffesiynol
Editorial positions and other external activity
- Elected to stand as the General Secretary for BAAL, the British Association for Applied Linguistics (2013 - present). BAAL is a professional association based in the UK (with an international professional membership of just under 1000 members), which provides a forum for people interested in language and applied linguistics. Some responsibilities of this post include: communicating messages from outside bodies/individuals to the Chair, the EC or the membership (e.g. notice of meetings, research opportunities, etc.); responding where necessary or raise issues with EC/Chair and writing letters on behalf of BAAL to various bodies.
- Former Meetings Secretary for BAAL (2010-2013) where I was responsible for coordinating the organisation of the annual conference for the Association, prior to that I was the Postgraduate Development and Liaison Officer for BAAL (2007-2009).
- Co-organiser of the IVACS (Inter-Varietal and Applied Corpus Studies) 2006 and IVACS 2014 conferences.
- Editor (with Professor Svenja Adolphs) of the Routledge Handbook of English Language and the Digital Humanities [under contract].
- Reviews Editor for the Yearbook of Corpus Linguistics and Pragmatics (Springer Verlag).
- Reviewer for International Journal of Corpus Linguistics (IJCL), Journal of Pragmatics, Context and Discourse, Corpora Journal and the BAAL annual book prize.
Membership of professional bodies and learned societies
- Associate Fellow of the Higher Education Academy (AFHEA), 2013 – present.
- Member, BAAL (British Association for Applied Linguistics).
- EC member, CRiLLS (Centre for Research in Linguistics and Language Sciences), 2011 – present
- Member, CRAL (Centre for Research in Applied Linguistics), 2006 – present
- Member, IVACS (Inter-Varietal Applied Corpus Studies), 2004 – present
- Member, AILA (International Association of Applied Linguistics), 2004 – present
- Member, Language Teaching and Technology; Language Learning and Teaching and iLaB (ICT) research clusters in ECLS, 2012 – present
Safleoedd academaidd blaenorol
- 2015 – present: Senior Lecturer in Applied Linguistics, Cardiff University
- 2014 – 2015: Senior Lecturer in Applied Linguistics, Newcastle University
- 2011 – 2014: Lecturer in Applied Linguistics, Newcastle University
- 2006 – 2011: Research Assistant (then Associate, then Fellow), The University of Nottingham
Pwyllgorau ac adolygu
- Aelod o fwrdd golygyddol Ieithyddiaeth Gymhwysol (cyfnodolyn, 2021+)
- Llysgennad y Sefydliad Ymchwil Arloesedd Data (DIRI) ym Mhrifysgol Caerdydd. Yn y rôl hon rwy'n arwain grŵp diddordeb arbennig (SIG) sy'n hwyluso cydweithio rhyngddisgyblaethol dwfn ledled y Brifysgol ym maes gwyddor data (2018+).
- Aelod o fwrdd golygyddol Elements in Corpus Linguistics (cyfres lyfrau) a gyhoeddwyd gan Cambridge University Press.
- Trefnydd arweiniol a Chadeirydd cynhadledd BAAL ar-lein 2020. Cofrestrodd dros 400 o aelodau o'r gymdeithas i gymryd rhan yn y gynhadledd hon.
- Prif drefnydd Cynhadledd Ryngwladol Ieithyddol Corpus (CL2019), cynhadledd 5 diwrnod sy'n arwain y byd ar gyfer academyddion sy'n gweithio yn y ddisgyblaeth hon (2018-2019).
- Aelod o Goleg Adolygu Cymheiriaid Canolfannau Hyfforddiant Doethurol ESRC (2016+)
- Cymrawd Gwadd er Anrhydedd yn y Ganolfan Ymchwil mewn Ieithyddiaeth Gymhwysol (CRAL), Prifysgol Nottingham (Mai-Gorffennaf 2018, yn ystod Gwyliau Ymchwil)
- Ymchwilydd Gwadd yn Adran Iaith Saesneg ac Ieithyddiaeth Gymhwysol, Prifysgol Abertawe (Ebrill–Gorffennaf 2018, yn ystod Absenoldeb Ymchwil)
- Ysgrifennydd Cyffredinol BAAL, Cymdeithas Ieithyddiaeth Gymhwysol Prydain (2013 - 2018); Ysgrifennydd Cyfarfodydd BAAL (2010-2013); Swyddog Datblygu a Chyswllt Ôl-raddedig ar gyfer BAAL (2007-2009).
- Cyd-drefnydd cynadleddau IVACS (Astudiaethau Corpws Rhyng-Amrywiol a Chymhwysol) 2006 ac IVACS 2014.
- Golygydd (gyda'r Athro Svenja Adolphs) o Lawlyfr Routledge Language a'r Dyniaethau Digidol [dan gontract].
- Golygydd Adolygiadau ar gyfer y Yearbook of Corpus Linguistics and Pragmatics, 2012-2015 (Springer Verlag).
- Aelod o fwrdd golygyddol y cyfnodolyn Discourse, Context and Media
- Adolygydd ar gyfer International Journal of Corpus Linguistics (IJCL), Journal of Pragmatics, Cyd-destun a Discourse, Corpora Journal a gwobr llyfr flynyddol BAAL.
- Aelod o bwyllgor y rhaglen: Gweithdy Prosesu Data Mawr ac Iaith Naturiol a gynhaliwyd yn IEEE Big Data, Rhagfyr 2016.
- Aelod o bwyllgor y rhaglen: 9fed cynhadledd Ryngwladol Corpus Ieithyddiaeth, Gorffennaf 2017, Prifysgol Birmingham; Heriau wrth reoli Cydgyfarfod ar y cyd Prosesu Data Mawr + Data Mawr ac Iaith Naturiol , Gorffennaf 2017, Prifysgol Birmingham.
- Aelod o'r Bwrdd Golygyddol Ymgynghorol ar gyfer y Journal of Corpus Linguistics and Pragmatics (Springer Verlag).
- Aelod o'r bwrdd ymgynghorol dros Iaith, Testunau a Chymdeithas (LTS) – cyfnodolyn a gynhyrchwyd ym Mhrifysgol Nottingham.
- Aelod o'r Bwrdd Cynghori ar CLiC – offeryn corpws ar gyfer dadansoddi testunau llenyddol, dan arweiniad yr Athro Mahlberg, Prifysgol Birmingham (a ariennir gan yr AHRC).
Meysydd goruchwyliaeth
- Ieithyddiaeth Corpus
- Corpus pragmatics
- Defnydd iaith mewn cyd-destun
- Cyfathrebu di-eiriau
- Dadansoddiad disgwrs
- Rhyngweithio digidol ('E-iaith')
Goruchwyliaeth gyfredol
Prosiectau'r gorffennol
Wrth roi ychwanegyn i'r myfyrwyr a restrir uchod, goruchwyliais hefyd yr RAs sy'n ymwneud â gwaith ar brosiectau CorCenCC, IVO a FreeTxt a chyd-oruchwylio (ar 50%) y myfyrwyr PhD canlynol i'w cwblhau:
- Vigneshwaran Muralidaran (gyda Irena Spasic, COMSC)
- Debora Cabral Lima (gyda Christopher Heffer, ENCAP)
- David Griffin (gyda Christopher Heffer, ENCAP)
- Emily Powell (gyda Christopher Heffer, ENCAP)
- Kate Barber (gyda Amanda Potts, ENCAP)