Unsupervised generation of parallel treebanks through sub-tree alignment
Zhechev, Ventsislav (2009) Unsupervised generation of parallel treebanks through sub-tree alignment. In: MT Marathon 2009, 26-30 January 2009, Prague, Czech Republic.
Full text available as:
The need for syntactically annotated data for use in natural language processing has increased dramatically
in recent years. This is true especially for parallel treebanks, of which very few exist. The ones
that exist are mainly hand-crafted and too small for reliable use in data-oriented applications. In this
paper we introduce an open-source system for fast and robust automatic generation of parallel treebanks.
We expect the opening of the presented platform to the scientific community to help boost research
in the field of data-oriented machine translation and lead to advancements in other fields where
parallel treebanks can be employed.
Archive Staff Only: edit this record