WordNet Gloss Translation for Under-resourced Languages using Multilingual Neural Machine Translation
Authors:
Bharathi Raja, Mihael Arcan, John McCrae
Publication Type:
Refereed Conference Meeting Proceeding
Abstract:
In this paper, we translate the glosses in the English WordNet based on the expand approach for improving and generating wordnets with the help of multilingual neural machine translation. Neural Machine Translation (NMT) has recently been applied to many tasks in natural language processing, leading to state-of-the-art performance. However, the performance of NMT often suffers from low resource scenarios where large corpora cannot be obtained. Using training data from closely related language have proven to be invaluable for improving performance. In this paper, we describe how we trained multilingual NMT from closely related language utilizing phonetic transcription for Dravidian languages. We report the evaluation result of the generated wordnets sense in terms of precision. By comparing to the recently proposed approach, we show improvement in terms of precision.
Conference Name:
Second Workshop on Multilingualism at the intersection of Knowledge Bases and Machine Translation co-located with Machine Translation Summit
(MT-Summit 2019), Dublin, Ireland
Proceedings:
European Association for Machine Translation
Digital Object Identifer (DOI):
10.xx
Publication Date:
19/08/2019
Volume:
Proceedings of the Second Workshop on Multilingualism at the Intersection of Knowledge Bases and Machine Translation
Conference Location:
Ireland
Research Group:
Linked Data
Institution:
National University of Ireland, Galway (NUIG)
Project Acknowledges:
European Lexicographic Infrastructure
Open access repository:
Yes