Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Automatic extraction of Arabic multiword expressions

Attia, Mohammed, Tounsi, Lamia, Pecina, Pavel, van Genabith, Josef orcid logoORCID: 0000-0003-1322-7944 and Toral, Antonio (2010) Automatic extraction of Arabic multiword expressions. In: the 7th Conference on Language Resources and Evaluation (LREC 2010)., May 2010., Valletta (Malta)..

Abstract
In this paper we investigate the automatic acquisition of Arabic Multiword Expressions (MWE). We propose three complementary approaches to extract MWEs from available data resources. The first approach relies on the correspondence asymmetries between Arabic Wikipedia titles and titles in 21 different languages. The second approach collects English MWEs from Princeton WordNet 3.0, translates the collection into Arabic using Google Translate, and utilizes different search engines to validate the output. The third uses lexical association measures to extract MWEs from a large unannotated corpus. We experimentally explore the feasibility of each approach and measure the quality and coverage of the output against gold standards.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Refereed:Yes
Uncontrolled Keywords:Arabic Multiword Expressions; MWE
Subjects:Computer Science > Machine translating
Computer Science > Information retrieval
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:16155
Deposited On:07 Jun 2011 13:41 by Shane Harper . Last Modified 20 Jan 2022 16:05
Documents

Full text available as:

[thumbnail of Automatic_Extraction_of_Arabic_Multiword_Expressions.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
258kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record