Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

A Part-of-Speech tagger for Irish using finite state morphology and constraint grammar disambiguation

Uí Dhonnchadha, Elaine and van Genabith, Josef (2006) A Part-of-Speech tagger for Irish using finite state morphology and constraint grammar disambiguation. In: LREC 2006, May 2006, Genoa.

Abstract
This paper describes the methodology used to develop a part-of-speech tagger for Irish, which is used to annotate a corpus of 30 million words of text with part-of-speech tags and lemmas. The tagger is evaluated using a manually disambiguated test corpus and it currently achieves 95% accuracy on unrestricted text. To our knowledge, this is the first part-of-speech tagger for Irish.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Refereed:Yes
Uncontrolled Keywords:E-Learning; finite state morphology; constraint grammar disambiguation
Subjects:Social Sciences > Educational technology
DCU Faculties and Centres:Research Institutes and Centres > National Centre for Language Technology (NCLT)
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:16367
Deposited On:02 Jun 2011 08:42 by Shane Harper . Last Modified 14 Oct 2016 10:00
Documents

Full text available as:

[thumbnail of A_Part-of-Speech_Tagger_for_Irish_using_Finite_State_Morphology_and_Constraint_Grammar_Disambiguation.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
347kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record