Academic Profile

I am a research fellow at the Data Science Institute (DSI) and Adjunct Lecturer at the School of Computer Science at the University of Galway. I am working in the Unit for Natural Language Processing (UNLP) under the supervision of Dr Paul Buitelaar, where my main research topic focuses on terminology and knowledge graph injection into neural machine translation architecture. Recently, I am also following the work on dialogue systems and natural language generation with multi-modal data. I am primarily funded by the nationwide Insight SFI Research Centre for Data Analytics, which is also hosted within DSI.

Interests: • Natural Language Processing • Machine Translation • Dialogue Systems • Knowledge Graphs

Activities

Co-supervision of PhD and Master Students

2017-
  • Tunde Ajayi (2022-/PhD/with Paul Buitelaar) on Empathetic Dialogue Systems
  • Ali Hatami (2021-/PhD/with Paul Buitelaar) on Multi-Modal (Speech/Video) Machine Translation
  • Dhairya Dalal (2020-/PhD/with Paul Buitelaar) on Causal Reasoning
  • Nivranshu Pasricha (2019-/PhD/with Paul Buitelaar) on Natural Language Generation from Structured Data
  • Shardul Suryawanshi (2019-/PhD/with Paul Buitelaar) on Offensive Content Detection
  • Rajdeep Sarkar (2019-/PhD/with John P. McCrae) on incorporating Semantic Technologies into Dialogue Systems
  • Jennifer Farrell (2020/MSc) on Detecting Dementia from Linguistic Characteristics using Natural Language Processing Techniques
  • Bharathi Raja Asoka Chakravarthi (2016-2020/PhD/with John P. McCrae) on Machine Translation for Under-Resourced Languages
  • Fionn Delahunty (2019/MSc/with Robert Johansson) on Passive Diagnosis of Mental Health Disorders

Teaching - as part of the MSc in Computer Science @ University of Galway - Data Analytics and AI.

2016-2022
  • Advanced Natural Language Processing (2018/2019-2021/2022)
  • Introduction to Natural Language Processing (2016/2017, 2017/2018)

Event Organisation

  • Organising Committee ESSLLI 2022
  • Workshop on Multilingualism at the intersection of Knowledge Bases and Machine Translation (MomenT):
    @LREC 2018 (proceedings), @MT-Summit 2019 (proceedings)
  • Local organiser of Language, Data and Knowledge (LDK) 2017, Galway, Ireland
  • Local organiser ESSLLI 2011

Program Committee

Invited Talks

  • Presentation to the MLT department at DFKI, Saarbrücken, Germany on Utilising of Terminological Expressions in Knowledge Bases and Knowledge Graphs for Neural Machine Translation. 2021. Online.
  • Workshop #1 on NER in the Context of MT (CEF Automated Translation) on Knowledge Graphs and NMT. 2020. Online.
  • Meet Central Europe (MCE 2018) on The Neural Age of Machine Translation. 2018. Budapest, Hungary.
  • Translation Technology Terminology Conference (TTT-2014) on Statistical Machine Translation and Terminology. 2014. Bled, Slovenia
  • Translation Technology Terminology Conference (TTT-2013) on Statistical Machine Translation. 2013. Zadar, Croatia.

Projects

Passive Prediction - The project focuses on combining proven psychological diagnostic methods with AI. Passive Prediction is a privacy-aware mental health diagnostic sensor that analyses our written forms of expression to catch early indicators of mental health issues (Funder: Enterprise Ireland; role: PI, 2021-)
Huawei Project - The project focuses on Knowledge Graphs for Intent Classification in Goal-Oriented Dialog Systems. (Funder: Industry Partner/Science Foundation Ireland; role: Project Management; Scientific Leads/PIs: Paul Buitelaar/John McCrae, 2020-2022)
DANTE Project - The project focuses on terminology identification in the pharmaceutical domain and the creation of multilingual terminological resources using neural machine translation approaches. (Funder: Industry Partner/Science Foundation Ireland; role: PI, 2021-2021)
NURS - The project focused on neural machine translation (NMT) for under-resourced scenarios, i.e. languages or technical domains, using sequence-to-sequence models (Funder: Industry Partner/Science Foundation Ireland; role: PI, 2018-2019)
MixedEmotions - The project aimed to develop innovative multilingual multi-modal Big Data analytics applications (Funder: EU FP7; role: PhD student, 2015-2017)
LIDER - The project aimed to provide the basis of a Linguistic Linked Data cloud for content analytics tasks (Funder: EU FP7; role: PhD student, 2013-2015)
EuroSentiment - The project aimed to develop a shared data pool for multilingual sentiment analysis systems (Funder: EU FP7; role: PhD student, 2012-2014)
Monnet - The project provided a semantics-based solution for accessing information across language barriers (Funder: EU FP7; role: PhD student, 2011-2013)

Demos

Marvin - a conversational chatbot with major depression disorder detection.
GrumpyBot - a sequence to sequence deep learning chatbot using linked data.
IRIS - an SMT/NMT system translating from English to (less-resourced) Irish and v.v.
ASISTENT - an SMT/NMT system translating between English and morphological rich South Slavic languages.
Insight META System - an SMT system accessible through an API request, supporting all official European Union languages.
OTTO - an SMT system for multilingual enhancement of ontologies. (CURRENTLY OFFLINE)
TeTra - a system for extracting and translating specific vocabulary (CURRENTLY OFFLINE)

Resources

BitterCorpus – Bilingual IT Terminology Annotated Corpus - annotated corpora for the evaluation of monolingual and bilingual domain-specific term extraction (with HLT FBK).
PE²rr - PostEdited and ERRor annotated corpus covers machine translations, their post-edited versions and error annotations of the performed edit-operations.
Polylingual WordNet - extends WordNet for 23 languages by automatic translation and is released as both OntoLex JSON-LD as well as in the Global WordNet LMF. This resource is available for re-use under the Creative Commons Attribution 4.0 License.

Publications