Lynn, Teresa and Foster, Jennifer ORCID: 0000-0002-7789-4853 (2016) Universal dependencies for Irish. In: Second Celtic Language Technology Workshop. (CLTW 2016), 4 July 2016, Paris, France.
Abstract
Les ressources linguistiques permettant aux études cross-langues de se développer sont très importantes pour les langues minoritaires telles que l’irlandais, car elles favorisent le partage des ressources
pour palier au problème du manque de données. Le projet «Universal Dependencies » (UD) a pour
but de faciliter les études cross-langues des arbres syntaxiques, des structures linguistiques et de
l’analyse syntaxique. L’objectif principal de ce projet est de former un ensemble harmonieux d’arbres
syntaxiques en utilisant un schéma d’annotations universelles. Dans cet article, nous présentons
la transformation de l’arbre de dépendance syntaxique irlandais (IDT) (Lynn, 2016) au schéma
d’annotations universelles du projet UD, suivie d’une description claire des changements structurels
nécessaires à cette conversion. Le nouvel arbre est ainsi appelé « Irish Universal Dependency
Treebank » ( IUDT ).
Language resources that enable cross-lingual studies have become increasingly valuable for lesserresourced languages such as Irish, as they allow for easier sharing of resources, thus overcoming
the problem of data scarcity. The Universal Dependencies (UD) Project1
is an initiative aimed at
cross-lingual studies of treebanks, linguistic structures and parsing. Its goal is to create a set of
multilingual harmonised treebanks that are designed according to a universal annotation scheme. In
this paper, we report on the conversion of the Irish Dependency Treebank (IDT) (Lynn, 2016) to a
UD version of the treebank which we term the Irish Universal Dependency Treebank (IUDT). We
report on the mapping of the IDT labelling scheme to the UD scheme, along with a clear description
of the structural changes required in this conversion.
Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Event Type: | Conference |
Refereed: | Yes |
Uncontrolled Keywords: | Analyse syntaxique; irlandais; langue irlandaise; arbre de dépendance syntaxique; dépendances syntaxiques universelles; conversion; étiquettes; parsing; Irish; dependency treebank; universal dependencies; mapping; labels |
Subjects: | Computer Science > Machine translating |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing Research Institutes and Centres > ADAPT |
Copyright Information: | © 2016 The Authors |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
Funders: | Science Foundation Ireland (SFI) through the CNGL Programme (Grant 12/CE/I2267) at Dublin City University and supported by the ADAPT Centre for Digital Content Technology, Science Foundation Ireland (SFI) Research Centres Programme (Grant 13/RC/2016) and is co-funded by the European Regional Development Fund. |
ID Code: | 23607 |
Deposited On: | 31 Jul 2019 15:12 by Thomas Murtagh . Last Modified 31 Jul 2019 15:12 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
156kB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record