Browse wiki

Jump to: navigation, search
Constructing a focused taxonomy from a document collection
Abstract We describe a new method for constructing We describe a new method for constructing custom taxonomies from document collections. It involves identifying relevant concepts and entities in text; linking them to knowledge sources like Wikipedia, DBpedia, Freebase, and any supplied taxonomies from related domains; disambiguating conflicting concept mappings; and selecting semantic relations that best group them hierarchically. An RDF model supports interoperability of these steps, and also provides a flexible way of including existing NLP tools and further knowledge sources. From 2000 news articles we construct a custom taxonomy with 10,000 concepts and 12,700 relations, similar in structure to manually created counterparts. Evaluation by 15 human judges shows the precision to be 89% and 90% for concepts and relations respectively; recall was 75% with respect to a manually generated taxonomy for the same domain.ly generated taxonomy for the same domain.
Abstractsub We describe a new method for constructing We describe a new method for constructing custom taxonomies from document collections. It involves identifying relevant concepts and entities in text; linking them to knowledge sources like Wikipedia, DBpedia, Freebase, and any supplied taxonomies from related domains; disambiguating conflicting concept mappings; and selecting semantic relations that best group them hierarchically. An RDF model supports interoperability of these steps, and also provides a flexible way of including existing NLP tools and further knowledge sources. From 2000 news articles we construct a custom taxonomy with 10,000 concepts and 12,700 relations, similar in structure to manually created counterparts. Evaluation by 15 human judges shows the precision to be 89% and 90% for concepts and relations respectively; recall was 75% with respect to a manually generated taxonomy for the same domain.ly generated taxonomy for the same domain.
Bibtextype inproceedings  +
Doi 10.1007/978-3-642-38288-8-25  +
Has author Olena Medelyan + , Manion S. + , Broekstra J. + , Divoli A. + , Huang A.-L. + , Witten I.H. +
Has extra keyword Concept mapping + , Document collection + , Knowledge sources + , News articles + , NLP tools + , RDF model + , Semantic relations + , Wikipedia + , Semantic web + , Taxonomies +
Isbn 9783642382871  +
Language English +
Number of citations by publication 0  +
Number of references by publication 0  +
Pages 367–381  +
Published in Lecture Notes in Computer Science +
Title Constructing a focused taxonomy from a document collection +
Type conference paper  +
Volume 7882 LNCS  +
Year 2013 +
Creation dateThis property is a special property in this wiki. 7 November 2014 09:39:25  +
Categories Publications without keywords parameter  + , Publications without license parameter  + , Publications without remote mirror parameter  + , Publications without archive mirror parameter  + , Publications without paywall mirror parameter  + , Conference papers  + , Publications without references parameter  + , Publications  +
Modification dateThis property is a special property in this wiki. 7 November 2014 09:39:25  +
DateThis property is a special property in this wiki. 2013  +
hide properties that link here 
Constructing a focused taxonomy from a document collection + Title
 

 

Enter the name of the page to start browsing from.