Short-text domain specific key terms/phrases extraction using an n-gram model with Wikipedia

From WikiPapers
Jump to: navigation, search

Short-text domain specific key terms/phrases extraction using an n-gram model with Wikipedia is a 2012 conference paper written in English by Qureshi M.A., O'Riordan C., Pasi G. and published in ACM International Conference Proceeding Series.

[edit] Abstract

Finding domain specific key terms/phrases from a given set of documents is a challenging task. A domain may be defined as an area of interest over a collection of documents which may not be explicitly defined but implicitly observable in those documents. When considering a collection of documents related to academic research, examples of key terms/phrases may be Information Retrieval", "Marine Biology", etc. In this paper a technique for extracting important key terms/phrases in a considered topical domain is proposed using external evidence from the titles of Wikipedia articles and the Wikipedia category graph. We performed some experiments over the document collection of Web sites of different post-graduate schools. Our preliminary evaluations show promising results for the detection of domain specific key terms/phrases from the given set of domain focused Web pages.

[edit] References

This section requires expansion. Please, help!

Cited by

Probably, this publication is cited by others, but there are no articles available for them in WikiPapers. Cited 2 time(s)