Browse wiki

An open-source toolkit for mining Wikipedia
Abstract The online encyclopedia Wikipedia is a vasThe online encyclopedia Wikipedia is a vast, constantly evolving tapestry of interlinked articles. For developers and researchers it represents a giant multilingual database of concepts and semantic relations, a potential resource for natural language processing and many other research areas. This paper introduces the Wikipedia Miner toolkit, an open-source software system that allows researchers and developers to integrate Wikipedia's rich semantics into their own applications. The toolkit creates databases that contain summarized versions of Wikipedia's content and structure, and includes a Java API to provide access to them. Wikipedia's articles, categories and redirects are represented as classes, and can be efficiently searched, browsed, and iterated over. Advanced features include parallelized processing of Wikipedia dumps, machine-learned semantic relatedness measures and annotation features, and XML-based web services. Wikipedia Miner is intended to be a platform for sharing data mining techniques. © 2012 Elsevier B.V. All rights reserved. © 2012 Elsevier B.V. All rights reserved.
Abstractsub The online encyclopedia Wikipedia is a vasThe online encyclopedia Wikipedia is a vast, constantly evolving tapestry of interlinked articles. For developers and researchers it represents a giant multilingual database of concepts and semantic relations, a potential resource for natural language processing and many other research areas. This paper introduces the Wikipedia Miner toolkit, an open-source software system that allows researchers and developers to integrate Wikipedia's rich semantics into their own applications. The toolkit creates databases that contain summarized versions of Wikipedia's content and structure, and includes a Java API to provide access to them. Wikipedia's articles, categories and redirects are represented as classes, and can be efficiently searched, browsed, and iterated over. Advanced features include parallelized processing of Wikipedia dumps, machine-learned semantic relatedness measures and annotation features, and XML-based web services. Wikipedia Miner is intended to be a platform for sharing data mining techniques. © 2012 Elsevier B.V. All rights reserved. © 2012 Elsevier B.V. All rights reserved.
Bibtextype article  +
Doi 10.1016/j.artint.2012.06.007  +
Has author Milne D. + , Witten I.H. +
Has extra keyword Annotation + , Disambiguation + , Ontology Extraction + , Semantic relatedness + , Toolkit + , Wikipedia + , Miners + , Natural language processing systems + , Research + , Semantics + , Web services + , Websites +
Has keyword Annotation + , Disambiguation + , Ontology extraction + , Semantic relatedness + , Toolkit + , Wikipedia +
Issn 43702  +
Language English +
Number of citations by publication 1  +
Number of references by publication 0  +
Pages 222–239  +
Published in Artificial Intelligence +
Title An open-source toolkit for mining Wikipedia +
Type journal article  +
Volume 194  +
Year 2013 +
Creation dateThis property is a special property in this wiki. 7 November 2014 06:18:37  +
Categories Publications without license parameter  + , Publications without remote mirror parameter  + , Publications without archive mirror parameter  + , Publications without paywall mirror parameter  + , Journal articles  + , Publications without references parameter  + , Publications  +
Modification dateThis property is a special property in this wiki. 7 November 2014 06:18:37  +
DateThis property is a special property in this wiki. 2013  +
hide properties that link here 
An open-source toolkit for mining Wikipedia + Title
 

 

Enter the name of the page to start browsing from.