Chinese text filtering based on domain keywords extracted from Wikipedia

From WikiPapers
Jump to: navigation, search

Chinese text filtering based on domain keywords extracted from Wikipedia is a 2013 conference paper written in English by Wang X., Li H., Jia Y., Jin S. and published in Lecture Notes in Electrical Engineering.

[edit] Abstract

Several machine learning and information retrieval algorithms have been used for text filtering. All these methods have a common ground that they need positive and negative examples to build user profile. However, not all applications can get good training documents. In this paper, we present a Wikipedia based method to build user profile without any other training documents. The proposed method extracts keywords of a special category from Wikipedia taxonomy and computes the weights of the extracted keywords based on Wikipedia pages. Experiment results on Chinese news text dataset SogouC show that the proposed method achieves good performance.

[edit] References

This section requires expansion. Please, help!

Cited by

Probably, this publication is cited by others, but there are no articles available for them in WikiPapers.