Browse wiki

Jump to: navigation, search
A statistical approach for automatic keyphrase extraction
Abstract Due to availability of voluminous textual Due to availability of voluminous textual data either on the World Wide Web or in textual databases automatic keyphrase extraction has gained increasing popularity in recent past to summarize and characterize text documents. Consequently, a number of machine learning techniques, mostly supervised, have been proposed to mine keyphrases in an automatic way. But, the non-availability of annotated corpus for training such systems is the main hinder for their success. In this paper, we propose the design of an automatic keyphrase extraction system which uses NLP and statistical approach to mine keyphrases from unstructured text documents. The efficacy of the proposed system is established over texts crawled from Wikipedia server. On evaluation we found that the proposed method outperforms KEA which uses naïve Bayes classification technique for keyphrase extraction.cation technique for keyphrase extraction.
Abstractsub Due to availability of voluminous textual Due to availability of voluminous textual data either on the World Wide Web or in textual databases automatic keyphrase extraction has gained increasing popularity in recent past to summarize and characterize text documents. Consequently, a number of machine learning techniques, mostly supervised, have been proposed to mine keyphrases in an automatic way. But, the non-availability of annotated corpus for training such systems is the main hinder for their success. In this paper, we propose the design of an automatic keyphrase extraction system which uses NLP and statistical approach to mine keyphrases from unstructured text documents. The efficacy of the proposed system is established over texts crawled from Wikipedia server. On evaluation we found that the proposed method outperforms KEA which uses naïve Bayes classification technique for keyphrase extraction.cation technique for keyphrase extraction.
Bibtextype inproceedings  +
Has author Abulaish M. + , Jahiruddin + , Dey L. +
Has extra keyword Bayes classification + , Information extraction + , Keyphrase extraction + , Machine learning techniques + , NAtural language processing + , Statistical approach + , Text document + , Text mining + , Textual data + , Textual database + , Wikipedia + , Artificial intelligence + , Learning algorithms + , Learning systems + , Natural language processing systems + , World Wide Web + , Data mining +
Has keyword Information extraction + , Keyphrase extraction + , Natural Language Processing + , Text mining +
Isbn 9780972741286  +
Language English +
Number of citations by publication 0  +
Number of references by publication 0  +
Pages 1100–1112  +
Published in Proceedings of the 5th Indian International Conference on Artificial Intelligence, IICAI 2011 +
Title A statistical approach for automatic keyphrase extraction +
Type conference paper  +
Year 2011 +
Creation dateThis property is a special property in this wiki. 6 November 2014 16:39:56  +
Categories Publications without license parameter  + , Publications without DOI parameter  + , Publications without remote mirror parameter  + , Publications without archive mirror parameter  + , Publications without paywall mirror parameter  + , Conference papers  + , Publications without references parameter  + , Publications  +
Modification dateThis property is a special property in this wiki. 6 November 2014 16:39:56  +
DateThis property is a special property in this wiki. 2011  +
hide properties that link here 
A statistical approach for automatic keyphrase extraction + Title
 

 

Enter the name of the page to start browsing from.