János Kertész

From WikiPapers
Jump to: navigation, search

János Kertész is an author.

Publications

Only those publications related to wikis are shown here.
Title Keyword(s) Published in Language DateThis property is a special property in this wiki. Abstract R C
Early Prediction of Movie Box Office Success Based on Wikipedia Activity Big Data PLoS ONE English 2013 Use of socially generated "big data" to access information about collective states of the minds in human societies has become a new paradigm in the emerging field of computational social science. A natural application of this would be the prediction of the society's reaction to a new product in the sense of popularity and adoption rate. However, bridging the gap between "real time monitoring" and "early predicting" remains a big challenge. Here we report on an endeavor to build a minimalistic predictive model for the financial success of movies based on collective activity data of online users. We show that the popularity of a movie can be predicted much before its release by measuring and analyzing the activity level of editors and viewers of the corresponding entry to the movie in Wikipedia, the well-known online encyclopedia. 0 0
Dynamics of Conflicts in Wikipedia PLoS ONE English June 2012 In this work we study the dynamical features of editorial wars in Wikipedia (WP). Based on our previously established algorithm, we build up samples of controversial and peaceful articles and analyze the temporal characteristics of the activity in these samples. On short time scales, we show that there is a clear correspondence between conflict and burstiness of activity patterns, and that memory effects play an important role in controversies. On long time scales, we identify three distinct developmental patterns for the overall behavior of the articles. We are able to distinguish cases eventually leading to consensus from those cases where a compromise is far from achievable. Finally, we analyze discussion networks and conclude that edit wars are mainly fought by few editors only. 44 1
A practical approach to language complexity: a Wikipedia case study Submitted to PLoS ONE English 2012 In this paper we present statistical analysis of English texts from Wikipedia (WP). We try to address the issue of language complexity empirically by comparing samples of the main English WP (Main) and the simple English WP (Simple). Simple is supposed to use a more simplified language with a limited vocabulary, and editors are explicitly requested to follow this guideline, yet in practice the vocabulary richness of both samples are at the same level. However, detailed analysis of longer units (n-grams rather than words alone) shows that the language of Simple is indeed less complex than that of Main. Comparing the two language varieties by the Gunning readability index supports this conclusion. We also report on the topical dependence of language complexity, e.g. that the language is more advanced in conceptual articles compared to person-based (biographical) and object-based articles. Finally, we investigate the relation between conflict and language complexity by analysing the content of the talk pages associated to controversial and peacefully developing articles, concluding that controversy has the effect of reducing language complexity. 0 0
Circadian patterns of Wikipedia editorial activity: A demographic analysis Wikipedia
Editorial activity
Editors demography
Circadian patterns
PLoS ONE English 2012 Wikipedia (WP) as a collaborative, dynamical system of humans is an appropriate subject of social studies. Each single action of the members of this society, i.e. editors, is well recorded and accessible. Using the cumulative data of 34 Wikipedias in different languages, we try to characterize and find the universalities and differences in temporal activity patterns of editors. Based on this data, we estimate the geographical distribution of editors for each WP in the globe. Furthermore we also clarify the differences among different groups of WPs, which originate in the variance of cultural and social features of the communities of editors. 10 2
Dynamics of conflicts in Wikipedia Wikipedia; editorial activity; editors demography; circadian patterns To appear in PLoS ONE English 2012 In this work we study the dynamical features of editorial wars in Wikipedia (WP). Based on our previously established algorithm, we build up samples of controversial and peaceful articles and analyze the temporal characteristics of the activity in these samples. On short time scales, we show that there is a clear correspondence between conflict and burstiness of activity patterns, and that memory effects play an important role in controversies. On long time scales, we identify three distinct developmental patterns for the overall behavior of the articles. We are able to distinguish cases eventually leading to consensus from those cases where a compromise is far from achievable. Finally, we analyze discussion networks and conclude that edit wars are mainly fought by few editors only. 0 1
Early Prediction of Movie Box Office Success based on Wikipedia Activity Big Data Prediction
Big Data
English 2012 Use of socially generated "big data" to access information about collective states of the minds in human societies becomes a new paradigm in the emerging field of computational social science. One of the natural application of this would be prediction of the society's reaction to a new product in the sense of popularity and adoption rate. However, bridging between "real time monitoring" and "early predicting" remains as a big challenge. Here, we report on an endeavor to build a minimalistic predictive model for the financial success of movies based on collective activity data of online users. We show that the popularity of a movie could be predicted well in advance by measuring and analyzing the activity level of editors and viewers of the corresponding entry to the movie in Wikipedia, the well-known online encyclopedia. 0 0
Characterization and prediction of Wikipedia edit wars Wikipedia
Collaboration
Conflict
Classification
WebSci Conference English 2011 We present a new, eficient method for automatically detecting conict cases and test it on five diferent language Wikipedias. We discuss how the number of edits, reverts, the length of discussions deviate in such pages from those following the general workow. 4 2
Edit wars in Wikipedia IEEE Third International Conference on Social Computing English 2011 We present a new, efficient method for automatically detecting severe conflicts `edit wars' in Wikipedia and evaluate this method on six different language WPs. We discuss how the number of edits, reverts, the length of discussions, the burstiness of edits and reverts deviate in such pages from those following the general workflow, and argue that earlier work has significantly over-estimated the contentiousness of the Wikipedia editing process. 9 2