Semi-automatic extraction and modeling of ontologies using wikipedia XML corpus

From WikiPapers
Jump to: navigation, search

Semi-automatic extraction and modeling of ontologies using wikipedia XML corpus is a 2009 conference paper written in English by De Silva L., Jayaratne L. and published in 2nd International Conference on the Applications of Digital Information and Web Technologies, ICADIWT 2009.

[edit] Abstract

This paper introduces WikiOnto: a system that assists in the extraction and modeling of topic ontologies in a semi-automatic manner using a preprocessed document corpus derived from Wikipedia. Based on the Wikipedia XML Corpus, we present a three-tiered framework for extracting topic ontologies in quick time and a modeling environment to refine these ontologies. Using Natural Language Processing (NLP) and other Machine Learning (ML) techniques along with a very rich document corpus, this system proposes a solution to a task that is generally considered extremely cumbersome. The initial results of the prototype suggest strong potential of the system to become highly successful in ontology extraction and modeling and also inspire further research on extracting ontologies from other semi-structured document corpora as well.

[edit] References

This section requires expansion. Please, help!

Cited by

Probably, this publication is cited by others, but there are no articles available for them in WikiPapers. Cited 2 time(s)