Browse wiki

Detecting Wikipedia Vandalism via Spatio-Temporal Analysis of Revision Metadata
Abstract Blatantly unproductive edits undermine theBlatantly unproductive edits undermine the quality of the collaboratively-edited encyclopedia, Wikipedia. They not only disseminate dishonest and offensive content, but force editors to waste time undoing such acts of vandalism. Language-processing has been applied to combat these malicious edits, but as with email spam, these filters are evadable and computationally complex. Meanwhile, recent research has shown spatial and temporal features effective in mitigating email spam, while being lightweight and robust. In this paper, we leverage the spatio-temporal properties of revision metadata to detect vandalism on Wikipedia. An administrative form of reversion called rollback enables the tagging of malicious edits, which are contrasted with nonoffending edits in numerous dimensions. Crucially, none of these features require inspection of the article or revision text. Ultimately, a classifier is produced which flags vandalism at performance comparable to the natural-language efforts we intend to complement (85% accuracy at 50% recall). The classifier is scalable (processing 100+ edits a second) and has been used to locate over 5,000 manually-confirmed incidents of vandalism outside our labeled set.ents of vandalism outside our labeled set.
Abstractsub Blatantly unproductive edits undermine theBlatantly unproductive edits undermine the quality of the collaboratively-edited encyclopedia, Wikipedia. They not only disseminate dishonest and offensive content, but force editors to waste time undoing such acts of vandalism. Language-processing has been applied to combat these malicious edits, but as with email spam, these filters are evadable and computationally complex. Meanwhile, recent research has shown spatial and temporal features effective in mitigating email spam, while being lightweight and robust. In this paper, we leverage the spatio-temporal properties of revision metadata to detect vandalism on Wikipedia. An administrative form of reversion called rollback enables the tagging of malicious edits, which are contrasted with nonoffending edits in numerous dimensions. Crucially, none of these features require inspection of the article or revision text. Ultimately, a classifier is produced which flags vandalism at performance comparable to the natural-language efforts we intend to complement (85% accuracy at 50% recall). The classifier is scalable (processing 100+ edits a second) and has been used to locate over 5,000 manually-confirmed incidents of vandalism outside our labeled set.ents of vandalism outside our labeled set.
Bibtextype inproceedings  +
Doi 10.1145/1752046.1752050  +
Has author Andrew G. West + , Sampath Kannan + , Insup Lee +
Has extra keyword Vandalism +
Has keyword Wikipedia + , Spatio-temporal reputation + , Vandalism + , Collaboration software + , Content-based access control +
Has reference Assigning Trust to Wikipedia Content + , A content-driven reputation system for the Wikipedia + , Temporal analysis of the wikigraph + , Internet encyclopaedias go head to head + , Automatic Vandalism Detection in Wikipedia + , Creating, destroying, and restoring value in Wikipedia + , Automatic vandalism detection in wikipedia: towards a machine learning approach + , Talk Before You Type: Coordination in Wikipedia + , Assessing the quality of Wikipedia articles with lifecycle based metrics +
Has remote mirror http://repository.upenn.edu/cis_reports/917/  +
Has slides http://www.andrew-g-west.com/docs/wiki_eurosec_slides.pdf  +
Isbn 978-1-4503-0059-9  +
Language English +
Number of citations by publication 3  +
Number of references by publication 9  +
Pages 22-28  +
Peer-reviewed Yes  +
Published in EUROSEC +
Related dataset Wikipedia Vandalism Corpus (Andrew G. West) +
Related tool STiki +
Title Detecting Wikipedia Vandalism via Spatio-Temporal Analysis of Revision Metadata +
Type conference paper  +
Year 2010 +
Creation dateThis property is a special property in this wiki. 23 January 2012 20:24:27  +
Categories Publications without license parameter  + , Publications without archive mirror parameter  + , Publications without paywall mirror parameter  + , Conference papers  + , Publications  +
Modification dateThis property is a special property in this wiki. 7 June 2013 04:56:30  +
DateThis property is a special property in this wiki. April 2010  +
hide properties that link here 
Circadian patterns of Wikipedia editorial activity: A demographic analysis + , Dynamics of Conflicts in Wikipedia + , Edit wars in Wikipedia + Has reference
Detecting Wikipedia Vandalism via Spatio-Temporal Analysis of Revision Metadata + Title
 

 

Enter the name of the page to start browsing from.