Browse wiki

Jump to: navigation, search
Automatic subject metadata generation for scientific documents using wikipedia and genetic algorithms
Abstract Topical annotation of documents with keyphTopical annotation of documents with keyphrases is a proven method for revealing the subject of scientific and research documents. However, scientific documents that are manually annotated with keyphrases are in the minority. This paper describes a machine learning-based automatic keyphrase annotation method for scientific documents, which utilizes Wikipedia as a thesaurus for candidate selection from documents' content and deploys genetic algorithms to learn a model for ranking and filtering the most probable keyphrases. Reported experimental results show that the performance of our method, evaluated in terms of inter-consistency with human annotators, is on a par with that achieved by humans and outperforms rival supervised methods. and outperforms rival supervised methods.
Abstractsub Topical annotation of documents with keyphTopical annotation of documents with keyphrases is a proven method for revealing the subject of scientific and research documents. However, scientific documents that are manually annotated with keyphrases are in the minority. This paper describes a machine learning-based automatic keyphrase annotation method for scientific documents, which utilizes Wikipedia as a thesaurus for candidate selection from documents' content and deploys genetic algorithms to learn a model for ranking and filtering the most probable keyphrases. Reported experimental results show that the performance of our method, evaluated in terms of inter-consistency with human annotators, is on a par with that achieved by humans and outperforms rival supervised methods. and outperforms rival supervised methods.
Bibtextype inproceedings  +
Doi 10.1007/978-3-642-33876-2_6  +
Has author Joorabchi A. + , Mahdi A.E. +
Has extra keyword Annotation methods + , Candidate selection + , Key-phrase + , Metadata generation + , Scientific documents + , Text mining + , Wikipedia + , Data mining + , Digital libraries + , Genetic algorithms + , Knowledge engineering + , Knowledge management + , Websites + , Metadata +
Has keyword Genetic algorithms + , Keyphrase annotation + , Keyphrase indexing + , Scientific digital libraries + , Subject metadata + , Text mining + , Wikipedia +
Isbn 9783642338755  +
Language English +
Number of citations by publication 0  +
Number of references by publication 0  +
Pages 32–41  +
Published in Lecture Notes in Computer Science +
Title Automatic subject metadata generation for scientific documents using wikipedia and genetic algorithms +
Type conference paper  +
Volume 7603 LNAI  +
Year 2012 +
Creation dateThis property is a special property in this wiki. 7 November 2014 04:01:49  +
Categories Publications without license parameter  + , Publications without remote mirror parameter  + , Publications without archive mirror parameter  + , Publications without paywall mirror parameter  + , Conference papers  + , Publications without references parameter  + , Publications  +
Modification dateThis property is a special property in this wiki. 7 November 2014 04:01:49  +
DateThis property is a special property in this wiki. 2012  +
hide properties that link here 
Automatic subject metadata generation for scientific documents using wikipedia and genetic algorithms + Title
 

 

Enter the name of the page to start browsing from.