Browse wiki

Jump to: navigation, search
A corpus-based study of edit categories in featured and non-featured wikipedia articles
Abstract In this paper, we present a study of the cIn this paper, we present a study of the collaborative writing process in Wikipedia. Our work is based on a corpus of 1,995 edits obtained from 891 article revisions in the English Wikipedia. We propose a 21-category classification scheme for edits based on Faigley and Witte's (1981) model. Example edit categories include spelling error corrections and vandalism. In a manual multi-label annotation study with 3 annotators, we obtain an inter-annotator agreement of α = 0.67. We further analyze the distribution of edit categories for distinct stages in the revision history of 10 featured and 10 non-featured articles. Our results show that the information content in featured articles tends to become more stable after their promotion. On the opposite, this is not true for non-featured articles. We make the resulting corpus and the annotation guidelines freely available.he annotation guidelines freely available.
Abstractsub In this paper, we present a study of the cIn this paper, we present a study of the collaborative writing process in Wikipedia. Our work is based on a corpus of 1,995 edits obtained from 891 article revisions in the English Wikipedia. We propose a 21-category classification scheme for edits based on Faigley and Witte's (1981) model. Example edit categories include spelling error corrections and vandalism. In a manual multi-label annotation study with 3 annotators, we obtain an inter-annotator agreement of α = 0.67. We further analyze the distribution of edit categories for distinct stages in the revision history of 10 featured and 10 non-featured articles. Our results show that the information content in featured articles tends to become more stable after their promotion. On the opposite, this is not true for non-featured articles. We make the resulting corpus and the annotation guidelines freely available.he annotation guidelines freely available.
Bibtextype inproceedings  +
Has author Daxenberger J. + , Iryna Gurevych +
Has extra keyword Classification scheme + , Collaborative authoring + , Information contents + , Multi-label annotation + , Quality assessment + , Spelling errors + , Wikipedia + , Wikipedia articles + , Computational linguistics + , Websites +
Has keyword Collaborative authoring + , Quality assessment + , Revision history + , Wikipedia +
Language English +
Number of citations by publication 0  +
Number of references by publication 0  +
Pages 711–726  +
Published in 24th International Conference on Computational Linguistics - Proceedings of COLING 2012: Technical Papers +
Title A corpus-based study of edit categories in featured and non-featured wikipedia articles +
Type conference paper  +
Year 2012 +
Creation dateThis property is a special property in this wiki. 6 November 2014 16:11:44  +
Categories Publications without license parameter  + , Publications without DOI parameter  + , Publications without remote mirror parameter  + , Publications without archive mirror parameter  + , Publications without paywall mirror parameter  + , Conference papers  + , Publications without references parameter  + , Publications  +
Modification dateThis property is a special property in this wiki. 6 November 2014 16:11:44  +
DateThis property is a special property in this wiki. 2012  +
hide properties that link here 
A corpus-based study of edit categories in featured and non-featured wikipedia articles + Title
 

 

Enter the name of the page to start browsing from.