Browse wiki

Jump to: navigation, search
Learning to link with Wikipedia
Abstract This paper describes how to automatically This paper describes how to automatically cross-reference documents with Wikipedia: the largest knowledge base ever known. It explains how machine learning can be used to identify significant terms within unstructured text, and enrich it with links to the appropriate Wikipedia articles. The resulting link detector and disambiguator performs very well, with recall and precision of almost 75%. This performance is constant whether the system is evaluated on Wikipedia articles or "real world" documents. This work has implications far beyond enriching documents with explanatory links. It can provide structured knowledge about any unstructured fragment of text. Any task that is currently addressed with bags of words—indexing, clustering, retrieval, and summarization to name a few—could use the techniques described here to draw on a vast network of concepts and semantics. a vast network of concepts and semantics.
Abstractsub This paper describes how to automatically This paper describes how to automatically cross-reference documents with Wikipedia: the largest knowledge base ever known. It explains how machine learning can be used to identify significant terms within unstructured text, and enrich it with links to the appropriate Wikipedia articles. The resulting link detector and disambiguator performs very well, with recall and precision of almost 75%. This performance is constant whether the system is evaluated on Wikipedia articles or "real world" documents. This work has implications far beyond enriching documents with explanatory links. It can provide structured knowledge about any unstructured fragment of text. Any task that is currently addressed with bags of words—indexing, clustering, retrieval, and summarization to name a few—could use the techniques described here to draw on a vast network of concepts and semantics. a vast network of concepts and semantics.
Bibtextype misc  +
Citeulike 3849424  +
Doi 10.1145/1458082.1458150  +
Has author David N. Milne + , Ian H. Witten +
Has remote mirror http://www.cs.waikato.ac.nz/~ihw/papers/08-DNM-IHW-LearningToLinkWithWikipedia.pdf  +
Language English +
Number of citations by publication 2  +
Number of references by publication 0  +
Pages 509-518  +
Title Learning to link with Wikipedia +
Type unknown  +
Year 2008 +
Creation dateThis property is a special property in this wiki. 28 January 2012 20:46:49  +
Categories Publications without published in parameter  + , Publications without keywords parameter  + , Publications without license parameter  + , Publications without archive mirror parameter  + , Publications without paywall mirror parameter  + , Publications without references parameter  + , Publications  +
Modification dateThis property is a special property in this wiki. 6 February 2012 20:53:19  +
DateThis property is a special property in this wiki. 2008  +
hide properties that link here 
A Cross-Lingual Dictionary for English Wikipedia Concepts + , Semantic Content Filtering with Wikipedia and Ontologies + Has reference
Learning to link with Wikipedia + Title
 

 

Enter the name of the page to start browsing from.