Browse wiki

Jump to: navigation, search
Automatic taxonomy extraction in different languages using wikipedia and minimal language-specific information
Abstract Knowledge bases extracted from Wikipedia aKnowledge bases extracted from Wikipedia are particularly useful for various NLP and Semantic Web applications due to their co- verage, actuality and multilingualism. This has led to many approaches for automatic knowledge base extraction from Wikipedia. Most of these approaches rely on the English Wikipedia as it is the largest Wikipedia version. However, each Wikipedia version contains socio-cultural knowledge, i.e. knowledge with relevance for a specific culture or language. In this work, we describe a method for extracting a large set of hyponymy relations from the Wikipedia category system that can be used to acquire taxonomies in multiple languages. More specifically, we describe a set of 20 features that can be used for for Hyponymy Detection without using additional language-specific corpora. Finally, we evaluate our approach on Wikipedia in five different languages and compare the results with the WordNet taxonomy and a multilingual approach based on interwiki links of the Wikipedia.based on interwiki links of the Wikipedia.
Abstractsub Knowledge bases extracted from Wikipedia aKnowledge bases extracted from Wikipedia are particularly useful for various NLP and Semantic Web applications due to their co- verage, actuality and multilingualism. This has led to many approaches for automatic knowledge base extraction from Wikipedia. Most of these approaches rely on the English Wikipedia as it is the largest Wikipedia version. However, each Wikipedia version contains socio-cultural knowledge, i.e. knowledge with relevance for a specific culture or language. In this work, we describe a method for extracting a large set of hyponymy relations from the Wikipedia category system that can be used to acquire taxonomies in multiple languages. More specifically, we describe a set of 20 features that can be used for for Hyponymy Detection without using additional language-specific corpora. Finally, we evaluate our approach on Wikipedia in five different languages and compare the results with the WordNet taxonomy and a multilingual approach based on interwiki links of the Wikipedia.based on interwiki links of the Wikipedia.
Bibtextype inproceedings  +
Doi 10.1007/978-3-642-28604-9_4  +
Has author Dominguez Garcia R. + , Schmidt S. + , Rensing C. + , Steinmetz R. +
Has extra keyword Category systems + , Hyponymy + , Hyponymy relation + , Knowledge base + , Knowledge basis + , Multilingual approach + , Multiple languages + , Natural Language Processing + , Semantic web applications + , Wikipedia + , Wordnet + , Computational linguistics + , Knowledge based systems + , Natural language processing systems + , Taxonomies + , Text processing + , Websites +
Has keyword Hyponymy Detection + , Multilingual large-scale taxonomies + , Natural Language Processing + , Data mining +
Isbn 9783642286032  +
Issue PART 1  +
Language English +
Number of citations by publication 0  +
Number of references by publication 0  +
Pages 42–53  +
Published in Lecture Notes in Computer Science +
Title Automatic taxonomy extraction in different languages using wikipedia and minimal language-specific information +
Type conference paper  +
Volume 7181 LNCS  +
Year 2012 +
Creation dateThis property is a special property in this wiki. 6 November 2014 21:02:16  +
Categories Publications without license parameter  + , Publications without remote mirror parameter  + , Publications without archive mirror parameter  + , Publications without paywall mirror parameter  + , Conference papers  + , Publications without references parameter  + , Publications  +
Modification dateThis property is a special property in this wiki. 6 November 2014 21:02:16  +
DateThis property is a special property in this wiki. 2012  +
hide properties that link here 
Automatic taxonomy extraction in different languages using wikipedia and minimal language-specific information + Title
 

 

Enter the name of the page to start browsing from.