DCU-Paris13 systems for the SANCL 2012 shared task
Le Roux, Joseph, Foster, JenniferORCID: 0000-0002-7789-4853, Wagner, JoachimORCID: 0000-0002-8290-3849, Samad Zadeh Kaljahi, Rasoul and Bryl, Anton
(2012)
DCU-Paris13 systems for the SANCL 2012 shared task.
In: The NAACL 2012 First Workshop on Syntactic Analysis of Non-Canonical Language (SANCL), 7-8 Jun 2012, Montreal, Quebec, Canada.
The DCU-Paris13 team submitted three systems to the SANCL 2012 shared task on parsing English web text. The first submission, the highest ranked constituency parsing system, uses a combination of PCFG-LA product grammar parsing and self-training. In the second submission, also a constituency parsing system, the n-best lists of various parsing models are combined using an approximate sentence-level product model. The third system, the highest ranked system in the dependency parsing track, uses voting over dependency arcs to combine the output of three constituency parsing systems which have been converted to dependency trees. All systems make use of a data-normalisation component, a parser accuracy predictor and a genre classifier.
Metadata
Item Type:
Conference or Workshop Item (Poster)
Event Type:
Workshop
Refereed:
No
Uncontrolled Keywords:
natural language processing; parsing; user-generated content