Browse DORAS
Browse Theses
Latest Additions
Creative Commons License
Except where otherwise noted, content on this site is licensed for use under a:

Treebank-based acquisition of wide-coverage, probabilistic LFG resources: project overview, results and evaluation

Burke, Michael and Cahill, Aoife and O'Donovan, Ruth and van Genabith, Josef and Way, Andy (2004) Treebank-based acquisition of wide-coverage, probabilistic LFG resources: project overview, results and evaluation. In: IJCNLP-04 Workshop - The First International Joint Conference on Natural Language Processing, 21 March 2004, Sanya City, Hainan Island, China.

Full text available as:

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader


This paper presents an overview of a project to acquire wide-coverage, probabilistic Lexical-Functional Grammar (LFG) resources from treebanks. Our approach is based on an automatic annotation algorithm that annotates “raw” treebank trees with LFG f-structure information approximating to basic predicate-argument/dependency structure. From the f-structure-annotated treebank we extract probabilistic unification grammar resources. We present the annotation algorithm, the extraction of lexical information and the acquisition of wide-coverage and robust PCFG-based LFG approximations including long-distance dependency resolution. We show how the methodology can be applied to multilingual, treebank-based unification grammar acquisition. Finally we show how simple (quasi-)logical forms can be derived automatically from the f-structures generated for the treebank trees.

Item Type:Conference or Workshop Item (Paper)
Event Type:Workshop
Uncontrolled Keywords:lexical-functional grammar;
Subjects:Computer Science > Machine translating
DCU Faculties and Centres:Research Initiatives and Centres > National Centre for Language Technology (NCLT)
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Official URL:
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:Enterprise Ireland, EI SC/2001/186, Irish Research Council for Science Engineering and Technology
ID Code:15302
Deposited On:15 Mar 2010 11:11 by DORAS Administrator. Last Modified 28 Apr 2010 11:22

Download statistics

Archive Staff Only: edit this record