Chinese text filtering based on domain keywords extracted from Wikipedia
|Chinese text filtering based on domain keywords extracted from Wikipedia|
|Author(s)||Wang X., Li H., Jia Y., Jin S.|
|Published in||Lecture Notes in Electrical Engineering|
|Keyword(s)||Text filtering, User profile, Wikipedia (Extra: Chinese text, Common ground, Negative examples, Text filtering, Training documents, User profile, Wikipedia, Software engineering, World Wide Web, Information technology)|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Remote mirror(s)||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of conference papers|
Chinese text filtering based on domain keywords extracted from Wikipedia is a 2013 conference paper written in English by Wang X., Li H., Jia Y., Jin S. and published in Lecture Notes in Electrical Engineering.
Several machine learning and information retrieval algorithms have been used for text filtering. All these methods have a common ground that they need positive and negative examples to build user profile. However, not all applications can get good training documents. In this paper, we present a Wikipedia based method to build user profile without any other training documents. The proposed method extracts keywords of a special category from Wikipedia taxonomy and computes the weights of the extracted keywords based on Wikipedia pages. Experiment results on Chinese news text dataset SogouC show that the proposed method achieves good performance.
- This section requires expansion. Please, help!
Probably, this publication is cited by others, but there are no articles available for them in WikiPapers.