A Part-of-Speech tagger for Irish using finite state morphology and constraint grammar disambiguation
Uí Dhonnchadha, Elaine and van Genabith, Josef
(2006)
A Part-of-Speech tagger for Irish using finite state morphology and constraint grammar disambiguation.
In: LREC 2006, May 2006, Genoa.
This paper describes the methodology used to develop a part-of-speech tagger for Irish, which is used to annotate a corpus of 30 million words of text with part-of-speech tags and lemmas. The tagger is evaluated using a manually disambiguated test corpus and it currently achieves 95% accuracy on unrestricted text. To our knowledge, this is the first part-of-speech tagger for Irish.
Metadata
Item Type:
Conference or Workshop Item (Paper)
Event Type:
Conference
Refereed:
Yes
Uncontrolled Keywords:
E-Learning; finite state morphology; constraint grammar disambiguation