Classifying Wikipedia Articles Using Network Motif Counts and Ratios
932 bytes added
17:17, November 26, 2012
|keywords=Wikipedia Quality, Edit Networks
|abstract=Because the production of Wikipedia articles is a collaborative process, the edit network around a article can tell us something about the quality of that article. Articles that have received little attention will have sparse networks; at the other end of the spectrum, articles that are Wikipedia battle grounds will have very crowded networks. In this paper we evaluate the idea of characterizing edit networks as a vector of motif counts that can be used in clustering and classification. Our objective is not immediately to develop a powerful classifier but to assess what is the signal in network motifs. We show that this motif count vector representation is effective for classifying articles on the Wikipedia quality scale. We further show that ratios of motif counts can effectively overcome normalization problems when comparing networks of radically different sizes.
← Older edit
Retrieved from "
Follow us on Twitter!
wiki-research mailing list