Classifying Wikipedia Articles Using Network Motif Counts and Ratios

From WikiPapers
Jump to: navigation, search

Classifying Wikipedia Articles Using Network Motif Counts and Ratios is a 2012 conference paper written in English by Guangyu Wu, Martin Harrigan, Pádraig Cuningham and published in WikiSym.

[edit] Abstract

Because the production of Wikipedia articles is a collaborative process, the edit network around a article can tell us something about the quality of that article. Articles that have received little attention will have sparse networks; at the other end of the spectrum, articles that are Wikipedia battle grounds will have very crowded networks. In this paper we evaluate the idea of characterizing edit networks as a vector of motif counts that can be used in clustering and classification. Our objective is not immediately to develop a powerful classifier but to assess what is the signal in network motifs. We show that this motif count vector representation is effective for classifying articles on the Wikipedia quality scale. We further show that ratios of motif counts can effectively overcome normalization problems when comparing networks of radically different sizes.

[edit] References

This section requires expansion. Please, help!

Cited by

Probably, this publication is cited by others, but there are no articles available for them in WikiPapers.

Discussion

No comments yet. Be first!