Djoerd Hiemstra

From WikiPapers
Jump to: navigation, search

Djoerd Hiemstra is an author.

Publications

Only those publications related to wikis are shown here.
Title Keyword(s) Published in Language DateThis property is a special property in this wiki. Abstract R C
WikiTranslate: Query translation for cross-lingual information retrieval using only wikipedia Comparable corpus
Cross-lingual information retrieval
Query translation
Wikipedia
Word sense disambiguation
Lecture Notes in Computer Science English 2009 This paper presents WikiTranslate, a system which performs query translation for cross-lingual information retrieval (CLIR) using only Wikipedia to obtain translations. Queries are mapped to Wikipedia concepts and the corresponding translations of these concepts in the target language are used to create the final query. WikiTranslate is evaluated by searching with topics formulated in Dutch, French and Spanish in an English data collection. The system achieved a performance of 67% compared to the monolingual baseline. 0 0
WikiTranslate: query translation for cross-lingual information retrieval using only Wikipedia Wikipedia
Comparable corpus
Cross-lingual information retrieval
Query translation
Word sense disambiguation
CLEF English 2009 0 0
Structured document retrieval, multimedia retrieval, and entity ranking using PF/Tijah Lecture Notes in Computer Science English 2008 CWI and University of Twente used PF/Tijah, a flexible XML retrieval system, to evaluate structured document retrieval, multimedia retrieval, and entity ranking tasks in the context of INEX 2007. For the retrieval of textual and multimedia elements in the Wikipedia data, we investigated various length priors and found that biasing towards longer elements than the ones retrieved by our language modelling approach can be useful. For retrieving images in isolation, we found that their associated text is a very good source of evidence in the Wikipedia collection. For the entity ranking task, we used random walks to model multi-step relevance propagation from the articles describing entities to all related entities and further, and obtained promising results. 0 0
Evaluating structured information retrieval and multimedia retrieval using PF/Tijah Lecture Notes in Computer Science English 2007 We used a flexible XML retrieval system for evaluating structured document retrieval and multimedia retrieval tasks in the context of the INEX 2006 benchmarks. We investigated the differences between article and element retrieval for Wikipedia data as well as the influence of an elements context on its ranking. We found that article retrieval performed well on many tasks and that pinpointing the relevant passages inside an article may hurt more than it helps. We found that for finding images in isolation the associated text is a very good descriptor in the Wikipedia collection, but we were not very succesful at identifying relevant multimedia fragments consisting of a combination of text and images. 0 0