Browse wiki

Jump to: navigation, search
DoSO: A document self-organizer
Abstract In this paper, we propose a Document Self In this paper, we propose a Document Self Organizer (DoSO), an extension of the classic Self Organizing Map (SOM) model, in order to deal more efficiently with a document clustering task. Starting from a document representation model, based on important "concepts" exploiting Wikipedia knowledge, that we have previously developed in order to overcome some of the shortcomings of the Bag-of-Words (BOW) model, we demonstrate how SOM's performance can be boosted by using themost important concepts of the document collection to explicitly initialize the neurons. We also show how a hierarchical approach can be utilized in the SOMmodel and how this can lead to amore comprehensive final clustering result with hierarchical descriptive labels attached to neurons and clusters. Experiments show that the proposed model (DoSO) yields promising results both in terms of extrinsic and SOM evaluation measures. of extrinsic and SOM evaluation measures.
Abstractsub In this paper, we propose a Document Self In this paper, we propose a Document Self Organizer (DoSO), an extension of the classic Self Organizing Map (SOM) model, in order to deal more efficiently with a document clustering task. Starting from a document representation model, based on important "concepts" exploiting Wikipedia knowledge, that we have previously developed in order to overcome some of the shortcomings of the Bag-of-Words (BOW) model, we demonstrate how SOM's performance can be boosted by using themost important concepts of the document collection to explicitly initialize the neurons. We also show how a hierarchical approach can be utilized in the SOMmodel and how this can lead to amore comprehensive final clustering result with hierarchical descriptive labels attached to neurons and clusters. Experiments show that the proposed model (DoSO) yields promising results both in terms of extrinsic and SOM evaluation measures. of extrinsic and SOM evaluation measures.
Bibtextype article  +
Doi 10.1007/s10844-012-0204-9  +
Has author Gerasimos Spanakis + , Georgios Siolas + , Andreas Stafylopatis +
Has extra keyword Bag of words + , Clustering results + , Document Clustering + , Document collection + , Document Representation + , Evaluation measures + , Hierarchical approach + , SOM + , Wikipedia + , Clustering algorithms + , Knowledge representation + , Labels + , Self organizing maps + , Websites + , Information retrieval +
Has keyword Document clustering + , Document representation + , SOM + , Wikipedia +
Issn 9259902  +
Issue 3  +
Language English +
Number of citations by publication 0  +
Number of references by publication 0  +
Pages 577–610  +
Published in Journal of Intelligent Information Systems +
Title DoSO: A document self-organizer +
Type journal article  +
Volume 39  +
Year 2012 +
Creation dateThis property is a special property in this wiki. 7 November 2014 12:14:45  +
Categories Publications without license parameter  + , Publications without remote mirror parameter  + , Publications without archive mirror parameter  + , Publications without paywall mirror parameter  + , Journal articles  + , Publications without references parameter  + , Publications  +
Modification dateThis property is a special property in this wiki. 7 November 2014 12:14:45  +
DateThis property is a special property in this wiki. 2012  +
hide properties that link here 
DoSO: A document self-organizer + Title
 

 

Enter the name of the page to start browsing from.