Automatic acquisition of named entities for rule-based machine translation
Toral, AntonioORCID: 0000-0003-2357-2960 and Way, AndyORCID: 0000-0001-5736-5930
(2011)
Automatic acquisition of named entities for rule-based machine translation.
In: the Second International Workshop on Free/Open-Source Rule-Based Machine Translation., 20-21 Jan 2011, Barcelona (Spain).
This paper proposes to enrich RBMT dictionaries with Named Entities (NEs) automatically acquired from Wikipedia. The method is applied to the Apertium English–Spanish system and its performance compared to that of Apertium with and without handtagged NEs. The system with automatic NEs outperforms the one without NEs, while results vary when
compared to a system with handtagged NEs (results are comparable for Spanish! English but slightly worst for English!Spanish). Apart from that, adding automatic NEs contributes to decreasing the amount of unknown terms by more than 10%.
Item Type:
Conference or Workshop Item (Paper)
Event Type:
Workshop
Refereed:
Yes
Uncontrolled Keywords:
rule based machine translation; RBMT; named entities; NE