Browse wiki

Jump to: navigation, search
Extracting Ontologies from Arabic Wikipedia: A Linguistic Approach
Abstract As one of the important aspects of semantiAs one of the important aspects of semantic web, building ontological models became a driving demand for developing a variety of semantic web applications. Through the years, much research was conducted to investigate the process of generating ontologies automatically from semi-structured knowledge sources such as Wikipedia. Different ontology building techniques were investigated, e.g., NLP tools and pattern matching, infoboxes and structured knowledge sources (Cyc and WordNet). Looking at the results of previous approaches we can see that the vast majority of employed techniques did not consider the linguistic aspect of Wikipedia. In this article, we present our solution to extract ontologies from Wikipedia using a linguistic approach based on the semantic field theory introduced by Jost Trier. Linguistic ontologies are significant in many applications for both linguists and Web researchers. We applied the proposed approach on the Arabic version of Wikipedia. The semantic relations were extracted from infoboxes, hyperlinks within infoboxes and list of categories that articles belong to. Our system successfully extracted approximately (760,000) triples from the Arabic Wikipedia. We conducted three experiments to evaluate the system output, namely: Validation Test, Crowd Evaluation and Domain Experts' evaluation. The system output achieved an average precision of 65 %.put achieved an average precision of 65 %.
Abstractsub As one of the important aspects of semantiAs one of the important aspects of semantic web, building ontological models became a driving demand for developing a variety of semantic web applications. Through the years, much research was conducted to investigate the process of generating ontologies automatically from semi-structured knowledge sources such as Wikipedia. Different ontology building techniques were investigated, e.g., NLP tools and pattern matching, infoboxes and structured knowledge sources (Cyc and WordNet). Looking at the results of previous approaches we can see that the vast majority of employed techniques did not consider the linguistic aspect of Wikipedia. In this article, we present our solution to extract ontologies from Wikipedia using a linguistic approach based on the semantic field theory introduced by Jost Trier. Linguistic ontologies are significant in many applications for both linguists and Web researchers. We applied the proposed approach on the Arabic version of Wikipedia. The semantic relations were extracted from infoboxes, hyperlinks within infoboxes and list of categories that articles belong to. Our system successfully extracted approximately (760,000) triples from the Arabic Wikipedia. We conducted three experiments to evaluate the system output, namely: Validation Test, Crowd Evaluation and Domain Experts' evaluation. The system output achieved an average precision of 65 %.put achieved an average precision of 65 %.
Bibtextype article  +
Doi 10.1007/s13369-013-0791-y  +
Has author Al-Rajebah N.I. + , Al-Khalifa H.S. +
Has keyword Linguistics + , Ontology + , Semantic field theory + , Wikipedia +
Issn 13198025  +
Issue 4  +
Language English +
Number of citations by publication 0  +
Number of references by publication 0  +
Pages 2749–2771  +
Published in Arabian Journal for Science and Engineering +
Title Extracting Ontologies from Arabic Wikipedia: A Linguistic Approach +
Type journal article  +
Volume 39  +
Year 2014 +
Creation dateThis property is a special property in this wiki. 6 November 2014 16:09:01  +
Categories Publications without license parameter  + , Publications without remote mirror parameter  + , Publications without archive mirror parameter  + , Publications without paywall mirror parameter  + , Journal articles  + , Publications without references parameter  + , Publications  +
Modification dateThis property is a special property in this wiki. 6 November 2014 16:09:01  +
DateThis property is a special property in this wiki. 2014  +
hide properties that link here 
Extracting Ontologies from Arabic Wikipedia: A Linguistic Approach + Title
 

 

Enter the name of the page to start browsing from.