Browse wiki

Jump to: navigation, search
"Got You!": Automatic vandalism detection in wikipedia with web-based shallow syntactic-semantic modeling
Abstract Discriminating vandalism edits from non-vaDiscriminating vandalism edits from non-vandalism edits in Wikipedia is a challenging task, as ill-intentioned edits can include a variety of content and be expressed in many different forms and styles. Previous studies are limited to rule-based methods and learning based on lexical features, lacking in linguistic analysis. In this paper, we propose a novel Web-based shallow syntacticsemantic modeling method, which utilizes Web search results as resource and trains topic-specific n-tag and syntactic n-gram language models to detect vandalism. By combining basic task-specific and lexical features, we have achieved high F-measures using logistic boosting and logistic model trees classifiers, surpassing the results reported by major Wikipedia vandalism detection systems.jor Wikipedia vandalism detection systems.
Abstractsub Discriminating vandalism edits from non-vaDiscriminating vandalism edits from non-vandalism edits in Wikipedia is a challenging task, as ill-intentioned edits can include a variety of content and be expressed in many different forms and styles. Previous studies are limited to rule-based methods and learning based on lexical features, lacking in linguistic analysis. In this paper, we propose a novel Web-based shallow syntacticsemantic modeling method, which utilizes Web search results as resource and trains topic-specific n-tag and syntactic n-gram language models to detect vandalism. By combining basic task-specific and lexical features, we have achieved high F-measures using logistic boosting and logistic model trees classifiers, surpassing the results reported by major Wikipedia vandalism detection systems.jor Wikipedia vandalism detection systems.
Bibtextype inproceedings  +
Has author Wang W.Y. + , McKeown K.R. +
Has extra keyword Detection system + , Lexical features + , Linguistic analysis + , Logistic models + , Modeling method + , N-gram language models + , Rule-based method + , Web searches + , Wikipedia + , Computational linguistics + , Semantic web + , Semantics + , Syntactics + , User interfaces + , Websites +
Language English +
Number of citations by publication 0  +
Number of references by publication 0  +
Pages 1146–1154  +
Published in Coling 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference +
Title "Got You!": Automatic vandalism detection in wikipedia with web-based shallow syntactic-semantic modeling +
Type conference paper  +
Volume 2  +
Year 2010 +
Creation dateThis property is a special property in this wiki. 6 November 2014 16:08:19  +
Categories Publications without keywords parameter  + , Publications without license parameter  + , Publications without DOI parameter  + , Publications without remote mirror parameter  + , Publications without archive mirror parameter  + , Publications without paywall mirror parameter  + , Conference papers  + , Publications without references parameter  + , Publications  +
Modification dateThis property is a special property in this wiki. 6 November 2014 16:08:19  +
DateThis property is a special property in this wiki. 2010  +
hide properties that link here 
"Got You!": Automatic vandalism detection in wikipedia with web-based shallow syntactic-semantic modeling + Title
 

 

Enter the name of the page to start browsing from.