Browse wiki

Jump to: navigation, search
Automatic Vandalism Detection in Wikipedia
Abstract We present results of a new approach to deWe present results of a new approach to detect destructive article revisions, so-called vandalism, in Wikipedia. Vandalism detection is a one-class classification problem, where vandalism edits are the target to be identified among all revisions. Interestingly, vandalism detection has not been addressed in the Information Retrieval literature by now. In this paper we discuss the characteristics of vandalism as humans recognize it and develop features to render vandalism detection as a machine learning task. We compiled a large number of vandalism edits in a corpus, which allows for the comparison of existing and new detection approaches. Using logistic regression we achieve 83% precision at 77% recall with our model. Compared to the rule-based methods that are currently applied in Wikipedia, our approach increases the F-Measure performance by 49% while being faster at the same time.y 49% while being faster at the same time.
Abstractsub We present results of a new approach to deWe present results of a new approach to detect destructive article revisions, so-called vandalism, in Wikipedia. Vandalism detection is a one-class classification problem, where vandalism edits are the target to be identified among all revisions. Interestingly, vandalism detection has not been addressed in the Information Retrieval literature by now. In this paper we discuss the characteristics of vandalism as humans recognize it and develop features to render vandalism detection as a machine learning task. We compiled a large number of vandalism edits in a corpus, which allows for the comparison of existing and new detection approaches. Using logistic regression we achieve 83% precision at 77% recall with our model. Compared to the rule-based methods that are currently applied in Wikipedia, our approach increases the F-Measure performance by 49% while being faster at the same time.y 49% while being faster at the same time.
Bibtextype misc  +
Citeulike 3812771  +
Doi 10.1007/978-3-540-78646-7_75  +
Has author Robert Gerling +
Has remote mirror http://www.uni-weimar.de/medien/webis/publications/downloads/theses/gerling_2008.pdf  +
Language German +
Number of citations by publication 4  +
Number of references by publication 0  +
Pages 663-668  +
Published in Bauhaus-University Weimar +
Title Automatic Vandalism Detection in Wikipedia +
Type Diploma  +
Year 2008 +
Creation dateThis property is a special property in this wiki. 29 January 2012 11:24:04  +
Categories Publications without keywords parameter  + , Publications without license parameter  + , Publications without archive mirror parameter  + , Publications without paywall mirror parameter  + , Publications without references parameter  + , Publications  +
Modification dateThis property is a special property in this wiki. 20 September 2014 12:26:04  +
DateThis property is a special property in this wiki. 2008  +
hide properties that link here 
Crowdsourcing a Wikipedia Vandalism Corpus + , Detecting Wikipedia Vandalism via Spatio-Temporal Analysis of Revision Metadata + , Dynamics of Conflicts in Wikipedia + , Wikipedia Vandalism Detection Through Machine Learning: Feature Review and New Proposals + Has reference
Automatic Vandalism Detection in Wikipedia + Title
 

 

Enter the name of the page to start browsing from.