Morphological features of the Irish universal dependency treebank
Lynn, Teresa, Foster, JenniferORCID: 0000-0002-7789-4853 and Dras, Mark
(2017)
Morphological features of the Irish universal dependency treebank.
In: 15th International Workshop on Treebanks and Linguistic Theories (TLT15), 20-21 Jan 2017, Bloomington, IN, USA.
The Universal Dependencies Project1
(Nivre, [9]; Nivre et al., [10]) is an
ongoing effort towards creating a set of harmonised dependency treebanks
that are annotated and structured according to universal guidelines. This paper reports on the addition of morphological features to the Irish Universal
Dependencies Treebank (IUDT). Our feature set subscribes to the feature inventory of the UD Project and has been mapped from Irish morpho-syntactic
tags – the output of a Finite State Morphological Analyser for Irish (Uí Dhonnchadha and van Genabith [16]). Irish, a Celtic language, has some relatively unusual morphological features that require language-specific labels
not covered by the universal feature set. In this paper, we summarise the
Irish-specific features that we have added to this set by explaining the linguistic properties that they each describe. We also report on the first parsing
experiments using the IUDT by assessing the effect that the inclusion of morphological features has on parsing accuracy.
Dickinson, Markus, Hajič, Jan, Kübler, Sandra and Przepiórkowski, Adam, (eds.)
Proceedings of the 15th International Workshop on Treebanks and Linguistic Theories (TLT15). CEUR Workshop Proceedings
1779.
CEUR-WS.
This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:
ADAPT Centre for Digital Content Technology (www.adaptcentre.ie) at Dublin City University, funded under the SFI Research Centres Programme (Grant 13/RC/2106) and is co-funded under the European Regional Development Fund.
ID Code:
23609
Deposited On:
31 Jul 2019 11:24 by
Thomas Murtagh
. Last Modified 27 May 2020 11:00