Text classification using Wikipedia knowledge
|Text classification using Wikipedia knowledge|
|Author(s)||Su C., Yanne P., Zhang Y.|
|Published in||ICIC Express Letters, Part B: Applications|
|Keyword(s)||Semi-supervised learning, Text classification, Wikipedia (Extra: Data sets, Labeled documents, Semi-supervised learning, Text classification, Text classifiers, Text document, Wikipedia, Supervised learning, Websites)|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Remote mirror(s)||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of journal articles|
In the real world, there are large amounts of unlabeled text documents, but traditional approaches usually require a lot of labeled documents, which are expensive to obtain. In this paper we propose an approach using the Wikipedia for text classification. We firstly extract the related wiki documents with the given keywords, then label the documents with the representative features selected from the related wiki documents, and finally build an SVM text classifier. Experimental results on 20-Newsgroup dataset show that the proposed method performs well and stably.
- This section requires expansion. Please, help!
Probably, this publication is cited by others, but there are no articles available for them in WikiPapers.