Cross-language retrieval using link-based language models
|Cross-language retrieval using link-based language models|
|Author(s)||Roth B., Klakow D.|
|Published in||SIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval|
|Keyword(s)||CLIR, Language modeling, LDA, Wikipedia (Extra: Bilingual tracks, CLIR, Cross-language retrieval, Language model, Language modeling, Latent Dirichlet allocation, LDA, On-machines, Term weighting, Training corpus, Translation models, Wikipedia, Computational linguistics, Information retrieval, Information theory, Natural language processing systems, Speech transmission, Translation (languages))|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Remote mirror(s)||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of conference papers|
Cross-language retrieval using link-based language models is a 2010 conference paper written in English by Roth B., Klakow D. and published in SIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.
We propose a cross-language retrieval model that is solely based on Wikipedia as a training corpus. The main contributions of our work are: 1. A translation model based on linked text in Wikipedia and a term weighting method associated with it. 2. A combination scheme to interpolate the link translation model with retrieval based on Latent Dirichlet Allocation. On the CLEF 2000 data we achieve improvement with respect to the best German-English system at the bilingual track (non-significant) and improvement against a baseline based on machine translation (significant).
- This section requires expansion. Please, help!
Probably, this publication is cited by others, but there are no articles available for them in WikiPapers.