Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

TTS – A Treebank Tool Suite

Cahill, Aoife orcid logoORCID: 0000-0002-3519-7726 and van Genabith, Josef orcid logoORCID: 0000-0003-1322-7944 (2002) TTS – A Treebank Tool Suite. In: The Third International Conference on Language Resources and Evaluation, May 27th--June 2nd, 2002, Las Palmas de Grand Canaria, Spain.

Abstract
Treebanks are important resources in descriptive, theoretical and computational linguistic research, development and teaching. This paper presents a treebank tool suite (TTS) for and derived from the Penn-II treebank resource (Marcus et al, 1993). The tools include treebank inspection and viewing options which support search for CF-PSG rule tokens extracted from the treebank, graphical display of complete trees containing the rule instance, display of subtrees rooted by the rule instance and display of the yield of the subtree (with or without context). The search can be further restricted by constraining the yield to contain particular strings. Rules can be ordered by frequency and the user can set frequency thresholds. To process new text, the tool suite provides a PCFG chart parser (based on the CYK algorithm) operating on CFG grammars extracted from the treebank following the method of (Charniak, 1996) as well as a HMM bi-/trigram tagger trained on the tagged version of the treebank resource. The system is implemented in Java and Perl. We employ the InterArbora module based on the Thistle display engine (LTG, 2001) as our tree grapher.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Refereed:Yes
Uncontrolled Keywords:Treebanks; TTS; Treebank Tool Suite
Subjects:Computer Science > Machine translating
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:16175
Deposited On:08 Jun 2011 10:32 by Shane Harper . Last Modified 21 Jan 2022 16:36
Documents

Full text available as:

[thumbnail of TTS_–_A_Treebank_Tool_Suite.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
390kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record