Browse wiki

Jump to: navigation, search
A self organizing document map algorithm for large scale hyperlinked data inspired by neuronal migration
Abstract Web document clustering is one of the reseWeb document clustering is one of the research topics that is being pursued continuously due to the large variety of applications. Since Web documents usually have variety and diversity in terms of domains, content and quality, one of the technical difficulties is to find a reasonable number and size of clusters. In this research, we pay attention to SOMs (Self Organizing Maps) because of their capability of visualized clustering that helps users to investigate characteristics of data in detail. The SOM is widely known as a "scalable" algorithm because of its capability to handle large numbers of records. However, it is effective only when the vectors are small and dense. Although several research efforts on making the SOM scalable have been conducted, technical issues on scalability and performance for sparse high-dimensional data such as hyperlinked documents still remain. In this paper, we introduce MIGSOM, an SOM algorithm inspired by a recent discovery on neuronal migration. The two major advantages of MIGSOM are its scalability for sparse high-dimensional data and its clustering visualization functionality. In this paper, we describe the algorithm and implementation, and show the practicality of the algorithm by applying MIGSOM to a huge scale real data set: Wikipedia's hyperlink data.real data set: Wikipedia's hyperlink data.
Abstractsub Web document clustering is one of the reseWeb document clustering is one of the research topics that is being pursued continuously due to the large variety of applications. Since Web documents usually have variety and diversity in terms of domains, content and quality, one of the technical difficulties is to find a reasonable number and size of clusters. In this research, we pay attention to SOMs (Self Organizing Maps) because of their capability of visualized clustering that helps users to investigate characteristics of data in detail. The SOM is widely known as a "scalable" algorithm because of its capability to handle large numbers of records. However, it is effective only when the vectors are small and dense. Although several research efforts on making the SOM scalable have been conducted, technical issues on scalability and performance for sparse high-dimensional data such as hyperlinked documents still remain. In this paper, we introduce MIGSOM, an SOM algorithm inspired by a recent discovery on neuronal migration. The two major advantages of MIGSOM are its scalability for sparse high-dimensional data and its clustering visualization functionality. In this paper, we describe the algorithm and implementation, and show the practicality of the algorithm by applying MIGSOM to a huge scale real data set: Wikipedia's hyperlink data.real data set: Wikipedia's hyperlink data.
Bibtextype inproceedings  +
Doi 10.1145/1963192.1963241  +
Has author Kotaro Nakayama + , Yutaka Matsuo +
Has extra keyword Algorithm and implementation + , Clustering + , Dataset + , High dimensional data + , Hyperlinks + , Link analysis + , MAP algorithms + , Neuronal migration + , Number and size + , Research efforts + , Research topics + , Self organizing + , SOM + , SOM algorithms + , Technical difficulties + , Web document + , Web document clustering + , Wikipedia + , Algorithms + , Data visualization + , Hypertext systems + , Research + , Scalability + , Self organizing maps + , Visualisation + , World Wide Web + , Cluster analysis +
Has keyword Clustering + , Link analysis + , SOM + , Visualisation + , Wikipedia +
Isbn 9781450305181  +
Language English +
Number of citations by publication 0  +
Number of references by publication 0  +
Pages 95–96  +
Published in Proceedings of the 20th International Conference Companion on World Wide Web, WWW 2011 +
Title A self organizing document map algorithm for large scale hyperlinked data inspired by neuronal migration +
Type conference paper  +
Year 2011 +
Creation dateThis property is a special property in this wiki. 6 November 2014 16:37:47  +
Categories Publications without license parameter  + , Publications without remote mirror parameter  + , Publications without archive mirror parameter  + , Publications without paywall mirror parameter  + , Conference papers  + , Publications without references parameter  + , Publications  +
Modification dateThis property is a special property in this wiki. 6 November 2014 16:37:47  +
DateThis property is a special property in this wiki. 2011  +
hide properties that link here 
A self organizing document map algorithm for large scale hyperlinked data inspired by neuronal migration + Title
 

 

Enter the name of the page to start browsing from.