Browse wiki

Jump to: navigation, search
Extracting structured information from wikipedia articles to populate infoboxes
Abstract Roughly every third Wikipedia article contRoughly every third Wikipedia article contains an infobox - a table that displays important facts about the subject in attribute-value form. The schema of an infobox, i.e., the attributes that can be expressed for a concept, is defined by an infobox template. Often, authors do not specify all template attributes, resulting in incomplete infoboxes. With iPopulator, we introduce a system that automatically populates infoboxes of Wikipedia articles by extracting attribute values from the article's text. In contrast to prior work, iPopulator detects and exploits the structure of attribute values to independently extract value parts. We have tested iPopulator on the entire set of infobox templates and provide a detailed analysis of its effectiveness. For instance, we achieve an average extraction precision of 91% for 1,727 distinct infobox template attributes.,727 distinct infobox template attributes.
Abstractsub Roughly every third Wikipedia article contRoughly every third Wikipedia article contains an infobox - a table that displays important facts about the subject in attribute-value form. The schema of an infobox, i.e., the attributes that can be expressed for a concept, is defined by an infobox template. Often, authors do not specify all template attributes, resulting in incomplete infoboxes. With iPopulator, we introduce a system that automatically populates infoboxes of Wikipedia articles by extracting attribute values from the article's text. In contrast to prior work, iPopulator detects and exploits the structure of attribute values to independently extract value parts. We have tested iPopulator on the entire set of infobox templates and provide a detailed analysis of its effectiveness. For instance, we achieve an average extraction precision of 91% for 1,727 distinct infobox template attributes.,727 distinct infobox template attributes.
Bibtextype inproceedings  +
Doi 10.1145/1871437.1871698  +
Has author Lange D. + , Bohm C. + , Naumann F. +
Has extra keyword Attribute values + , Information extraction + , Linked datum + , Structured information + , Wikipedia + , Knowledge management + , Information analysis +
Has keyword Information extraction + , Linked data + , Wikipedia +
Isbn 9781450300995  +
Language English +
Number of citations by publication 0  +
Number of references by publication 0  +
Pages 1661–1664  +
Published in International Conference on Information and Knowledge Management, Proceedings +
Title Extracting structured information from wikipedia articles to populate infoboxes +
Type conference paper  +
Year 2010 +
Creation dateThis property is a special property in this wiki. 7 November 2014 17:51:09  +
Categories Publications without license parameter  + , Publications without remote mirror parameter  + , Publications without archive mirror parameter  + , Publications without paywall mirror parameter  + , Conference papers  + , Publications without references parameter  + , Publications  +
Modification dateThis property is a special property in this wiki. 7 November 2014 17:51:09  +
DateThis property is a special property in this wiki. 2010  +
hide properties that link here 
Extracting structured information from wikipedia articles to populate infoboxes + Title
 

 

Enter the name of the page to start browsing from.