Acquisition des traductions de requêtes à partir de wikipédia pour la recherche d'information translingue
|Acquisition des traductions de requêtes à partir de wikipédia pour la recherche d'information translingue|
|Author(s)||Chakour H., Sadat F.|
|Published in||Vision 2020: Sustainable Growth, Economic Development, and Global Competitiveness - Proceedings of the 23rd International Business Information Management Association Conference, IBIMA 2014|
|Keyword(s)||Unknown (Extra: Competition, Computational linguistics, Information management, Search engines, Cross language information retrieval, Linguistic resources, Multiple tokens, Query translations, Transitive translation, Translation method, Weighting scheme, Wikipedia articles, Translation (languages))|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Remote mirror(s)||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of conference papers|
Acquisition des traductions de requêtes à partir de wikipédia pour la recherche d'information translingue is a 2014 conference paper written in French by Chakour H., Sadat F. and published in Vision 2020: Sustainable Growth, Economic Development, and Global Competitiveness - Proceedings of the 23rd International Business Information Management Association Conference, IBIMA 2014.
The multilingual encyclopedia Wikipedia has become a very useful resource for the construction and enrichment of linguistic resources, such as dictionaries and ontologies. In this study, we are interested by the exploitation of Wikipedia for query translation in Cross-Language Information Retrieval. An application is completed for the Arabic-English pair of languages. All possible translation candidates are extracted from the titles of Wikipedia articles based on the inter-links between Arabic and English; which is considered as direct translation. Furthermore, other links such as Arabic to French and French to English are exploited for a transitive translation. A slight stemming and segmentation of the query into multiple tokens can be made if no translation can be found for the entire query. Assessments monolingual and cross-lingual systems were conducted using three weighting schemes of the Lucene search engine (default, Tf-Idf and BM25). In addition, the performance of the so-called translation method was compared with those of GoogleTranslate and MyMemory.
- This section requires expansion. Please, help!
Probably, this publication is cited by others, but there are no articles available for them in WikiPapers.