Browse DORAS
Browse Theses
Latest Additions
Creative Commons License
Except where otherwise noted, content on this site is licensed for use under a:

Parsing Arabic using treebank-based LFG resources

Tounsi, Lamia and Attia, Mohammed and van Genabith, Josef (2009) Parsing Arabic using treebank-based LFG resources. In: Lexical Functional Grammar 2009, 13-16 July 2009, Cambridge, UK.

Full text available as:

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader


In this paper we present initial results on parsing Arabic using treebank-based parsers and automatic LFG f-structure annotation methodologies. The Arabic Annotation Algorithm (A3) (Tounsi et al., 2009) exploits the rich functional annotations in the Penn Arabic Treebank (ATB) (Bies and Maamouri, 2003; Maamouri and Bies, 2004) to assign LFG f-structure equations to trees. For parsing, we modify Bikel’s (2004) parser to learn ATB functional tags and merge phrasal categories with functional tags in the training data. Functional tags in parser output trees are then "unmasked" and available to A3 to assign f-structure equations. We evaluate the resulting f-structures against the DCU250 Arabic gold standard dependency bank (Al-Raheb et al., 2006). Currently we achieve a dependency f-score of 77%.

Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Uncontrolled Keywords:lexical functional grammar; parsing; Arabic;
Subjects:Computer Science > Machine translating
DCU Faculties and Centres:Research Initiatives and Centres > National Centre for Language Technology (NCLT)
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Published in:Proceedings of the LFG09 Conference. . CSLI Publications.
Publisher:CSLI Publications
Official URL:
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:15147
Deposited On:12 Feb 2010 14:16 by DORAS Administrator. Last Modified 12 Feb 2010 14:16

Download statistics

Archive Staff Only: edit this record