List of conference papers
This is a list of conference papers available in WikiPapers. Currently, there are 1123 conference papers.
Export: BibTeX, CSV, RDF, JSON
To create a new "conference paper" go to Form:Publication.
Conference papers
| Title | Author(s) | Keyword(s) | Published in | Language | DateThis property is a special property in this wiki. | Abstract | R | C |
|---|---|---|---|---|---|---|---|---|
| Art History on Wikipedia, a Macroscopic Observation | Doron Goldfarb Max Arends Josef Froschauer Dieter Merkl |
ArXiv | English | 20 April 2013 | How are articles about art historical actors interlinked within Wikipedia? Lead by this question, we seek an overview on the link structure of a domain specific subset of Wikipedia articles. We use an established domain-specific person name authority, the Getty Union List of Artist Names (ULAN), in order to externally identify relevant actors. Besides containing consistent biographical person data, this database also provides associative relationships between its person records, serving as a reference link structure for comparison. As a first step, we use mappings between the ULAN and English Dbpedia provided by the Virtual Internet Authority File (VIAF). This way, we are able to identify 18,002 relevant person articles. Examining the link structure between these resources reveals interesting insight about the high level structure of art historical knowledge as it is represented on Wikipedia. | 0 | 1 | |
| WikiPapers, una recopilación colaborativa de literatura sobre wikis usando MediaWiki y su extensión semántica | Emilio J. Rodríguez-Posada Juan Manuel Dodero-Beardo |
IV Jornadas Predoctorales de la ESI | Spanish | December 2012 | El interés de los investigadores por los wikis, en especial Wikipedia, ha ido en aumento en los últimos años. La primera edición de WikiSym, un simposio sobre wikis, se celebró en 2005 y desde entonces han aparecido multitud de congresos, workshops, conferencias y competiciones en este área. El estudio de los wikis es un campo emergente y prolífico. Ha habido varios intentos, aunque con escaso éxito, de recopilar toda la literatura sobre wikis. En este artículo presentamos WikiPapers, un proyecto colaborativo para recopilar toda la literatura sobre wikis. Hasta noviembre de 2012 se han recopilado más de 1.700 publicaciones y sus metadatos, además de documentación sobre herramientas y datasets relacionados. | 9 | 0 | |
| Capturing malicious bots using a beneficial bot and wiki | Takashi Yamanoue Kentaro Oda Koichi Shimozono |
Information security Vandal bot Wiki |
SIGUCCS | English | October 2012 | Locating malicious bots in a large network is problematic because its internal firewalls and NAT routers unintentionally contribute to hiding bots' host address and malicious packets. However, eliminating firewalls and NAT routers for merely locating bots is generally not acceptable. In this paper, we propose an easy to deploy, easy to manage network security controlling system for locating a malicious host behind the internal secure gateways. This network security controlling system consists of a remote security device and a command server. Each of the remote security devices is installed as a transparent link (implemented as a L2 switch), between the subnet and its gateway, to detect a host which is compromised with a malicious bot in a target subnet, while minimizing impact of deployment. The security devices are remote controlled by 'polling' the command server in order to eliminating NAT traversal problem and to be firewall friendly. Since the remote security device lives in transparent, remote controlled and robust to security gateways, we regard it as a beneficial bot. We adopt a web server with wiki software as the command server in order to take advantage of its power of customization, easy to use and easy to deployment of the server. | 4 | 1 |
| A M2M system using Arduino, Android and Wiki Software | Takashi Yamanoue Kentaro Oda Koichi Shimozono |
Wiki Social network |
IIAI ESKM | English | September 2012 | A Machine-to-Machine (M2M) system, which uses Arduino, Android, and Wiki software, is discussed. ["proposed"?] This system consists of mobile terminals and web sites with wiki software. A mobile terminal of the system consists of an Android terminal and an Arduino board with sensors and actuators. The mobile terminal reads data from the sensors in the Arduino board and sends the data to a wiki page. The mobile terminal also reads commands on the wiki page and controls the actuators of the Arduino board. In addition, a wiki page can have a program that reads the page and outputs information such as a graph. This system realizes an open communication forum for not only people but also for machines | 4 | 0 |
| Citation needed: The dynamics of referencing in Wikipedia | Chih-Chun Chen Camille Roth |
Wikipedia Collaborative system Authority |
WikiSym | English | August 2012 | The extent to which a Wikipedia article refers to external sources to substantiate its content can be seen as a measure of its externally invoked authority. We introduce a protocol for characterising the referencing process in the context of general article editing. With a sample of relatively mature articles, we show that referencing does not occur regularly through an article’s lifetime but is associated with periods of more substantial editing, when the article has reached a certain level of maturity (in terms of the number of times it has been revised and its length). References also tend to be contributed by editors who have contributed more frequently and more substantially to an article, suggesting that a subset of more qualified or committed editors may exist for each article. | 0 | 0 |
| Classifying Wikipedia Articles Using Network Motif Counts and Ratios | Guangyu Wu Martin Harrigan Pádraig Cuningham |
Quality Edit Networks |
WikiSym | English | August 2012 | Because the production of Wikipedia articles is a collaborative process, the edit network around a article can tell us something about the quality of that article. Articles that have received little attention will have sparse networks; at the other end of the spectrum, articles that are Wikipedia battle grounds will have very crowded networks. In this paper we evaluate the idea of characterizing edit networks as a vector of motif counts that can be used in clustering and classification. Our objective is not immediately to develop a powerful classifier but to assess what is the signal in network motifs. We show that this motif count vector representation is effective for classifying articles on the Wikipedia quality scale. We further show that ratios of motif counts can effectively overcome normalization problems when comparing networks of radically different sizes. | 0 | 0 |
| Deletion Discussions in Wikipedia: Decision Factors and Outcomes | Jodi Schneider Alexander Passant Stefan Decker |
Collaboration and conflict Decision-making Wikipedia Articles for Deletion Factors analysis Online argumentation Values Novices |
WikiSym | English | August 2012 | Deletion of articles is a common process in Wikipedia, in order to ensure the overall quality of the encyclopedia. Yet, there is a need to better understand the procedures in order to promote the best decisions without unnecessary community work. In this paper, we study deletion in Wikipedia, drawing from factor analysis, and taking an in-depth, content-analysis-based approach. We address three research questions: First, what factors contribute to the decision about whether to delete a given article? Second, when multiple factors are given, what is the relative importance of those factors? Third, what are the outcomes of deletion discussions, both for articles and for the community? We find that multiple factors contribute to the assessment of an article, and we discuss their relative frequency. Further, we show how the assessment timeline focuses attention on improving borderline articles that have the potential to meet Wikipedia’s content inclusion policies, and we highlight the role of novice contributors in this improvement process. | 0 | 0 |
| Design for Free Learning - a Case Study on Supporting a Service Design Course | Teresa Consiglio Gerrit C. van der Veer |
Experience report Open source Cultural diversity E- learning Service design Learner centered design |
WikiSym | August 2012 | In this experience report, we provide a case study on the use of information and communication technology (ICT) in higher education, developing an open source interactive learning environment to support a blended course. Our aim is to improve the quality of adult distance learning, ultimately involving peers worldwide, by developing learning invironments as flexible as possible regardless of the culture and context of use, of individual learning style and age of the learners.
Our example concerns a course of Service Design where the teacher was physically present only intermittently for part of the course while in the remaining time students worked in teams using our online learning environment. We developed a structure where students are guided through discovery learning and mutual teaching. We will show how we started from the students’ authentic goals and how we supported them by a simple structure of pacing the discovery process and merging theoretical understanding with practice in real life. Based on these first empirical results practical guidelines have been developed regarding improvements on the structure provided for the learning material and on the interaction facilities for students, teachers and instructional designers. |
0 | 0 | |
| Drawing a Data-Driven Portrait of Wikipedia Editors | Robert West Ingmar Weber Carlos Castillo |
Wikipedia Editors Web usage Expertise |
WikiSym | English | August 2012 | While there has been a substantial amount of research into the editorial and organizational processes within Wikipedia, little is known about how Wikipedia editors (Wikipedians) relate to the online world in general. We attempt to shed light on this issue by using aggregated log data from Yahoo!’s browser toolbar in order to analyze Wikipedians’ editing behavior in the context of their online lives beyond Wikipedia. We broadly characterize editors by investigating how their online behavior differs from that of other users; e.g., we find that Wikipedia editors search more, read more news, play more games, and, perhaps surprisingly, are more immersed in popular culture. Then we inspect how editors’ general interests relate to the articles to which they contribute; e.g., we confirm the intuition that editors are more familiar with their active domains than average users. Finally, we analyze the data from a temporal perspective; e.g., we demonstrate that a user’s interest in the edited topic peaks immediately before the edit. Our results are relevant as they illuminate novel aspects of what has become many Web users’ prevalent source of information. | 0 | 0 |
| Etiquette in Wikipedia: Weening New Editors into Productive Ones | Ryan Faulkner Steven Walling Maryana Pinchuk |
Wikipedia Huggle Newcomers Vandalism Wiki Retention |
WikiSym | English | August 2012 | Currently, the greatest challenge faced by the Wikipedia community involves reversing the decline of active editors on the site – in other words, ensuring that the encyclopedia’s contributors remain sufficiently numerous to fill the roles that keep it relevant. Due to the natural drop-off of old contributors, newcomers must constantly be socialized, trained and retained. However recent research has shown the Wikipedia community is failing to retain a large proportion of productive new contributors and implicates Wikipedia’s semi-automated quality control mechanisms and their interactions with these newcomers as an exacerbating factor. This paper evaluates the effectiveness of minor changes to the normative warning messages sent to newcomers from one of the most prolific of these quality control tools (Huggle) in preserving their rate of contribution. The experimental results suggest that substantial gains in newcomer participation can be attained through inexpensive changes to the wording of the first normative message that new contributors receive. | 0 | 0 |
| How Long Do Wikipedia Editors Keep Active? | Dell Zhang Karl Prior Mark Levene |
Social Media User Modelling Behaviour Mining Survival Analysis |
WikiSym | English | August 2012 | In this paper, we use the technique of survival analysis to investigate how long Wikipedia editors remain active in editing. Our results show that although the survival function of occasional editors roughly follows a lognormal distribution, the survival function of customary editors can be better described by a Weibull distribution (with the median lifetime of about 53 days). Furthermore, for customary editors, there are two critical phases (0-2 weeks and 8-20 weeks) when the hazard rate of becoming inactive increases. Finally, customary editors who are more active in editing are likely to keep active in editing for longer time. | 0 | 0 |
| Identifying controversial articles in Wikipedia: A comparative study | Hoda Sepehri Rad Denilson Barbosa |
Wikipedia Controversy Disagreement Comparison Monotonicity |
WikiSym | English | August 2012 | Wikipedia articles are the result of the collaborative editing of a diverse group of anonymous volunteer editors, who are passionate and knowledgeable about specific topics. One can argue that this plurality of perspectives leads to broader coverage of the topic, thus benefitting the reader. On the other hand, differences among editors on polarizing topics can lead to controversial or questionable content, where facts and arguments are presented and discussed to support a particular point of view. Controversial articles are manually tagged by Wikipedia editors, and span many interesting and popular topics, such as religion, history, and politics, to name a few. Recent works have been proposed on automatically identifying controversy within unmarked articles. However, to date, no systematic comparison of these efforts has been made. This is in part because the various methods are evaluated using different criteria and on different sets of articles by different authors, making it hard for anyone to verify the efficacy and compare all alternatives. We provide a first attempt at bridging this gap. We compare five different methods for modelling and identifying controversy, and discuss some of the unique difficulties and opportunities inherent to the way Wikipedia is produced. | 0 | 0 |
| In Search of the Ur-Wikipedia: Universality, Similarity, and Translation in the Wikipedia Inter-Language Link Network | Morten Warncke-Wang Anuradha Uduwage Zhenhua Dong John Riedl |
Wikipedia Tobler's Law First Law of Geography Multilingual |
WikiSym | English | August 2012 | Wikipedia has become one of the primary encyclopaedic information repositories on the World Wide Web. It started in 2001 with a single edition in the English language and has since expanded to more than 20 million articles in 283 languages. Criss-crossing between the Wikipedias is an interlanguage link network, connecting the articles of one edition of Wikipedia to another. We describe characteristics of articles covered by nearly all Wikipedias and those covered by only a single language edition, we use the network to understand how we can judge the similarity between Wikipedias based on concept coverage, and we investigate the flow of translation between a selection of the larger Wikipedias. Our findings indicate that the relationships between Wikipedia editions follow Tobler's first law of geography: similarity decreases with increasing distance. The number of articles in a Wikipedia edition is found to be the strongest predictor of similarity, while language similarity also appears to have an influence. The English Wikipedia edition is by far the primary source of translations. We discuss the impact of these results for Wikipedia as well as user-generated content communities in general. | 0 | 0 |
| Manypedia: Comparing Language Points of View of Wikipedia Communities | Paolo Massa Federico Scrinzi |
Wikipedia Cross-cultural comparison Linguistic Point of View Language Automatic translation Web tool Open source |
WikiSym | English | August 2012 | The 4 million articles of the English Wikipedia have been written in a collaborative fashion by more than 16 million volunteer editors. On each article, the community of editors strive to reach a neutral point of view, representing all significant views fairly, proportionately, and without biases. However, beside the English one, there are more than 280 editions of Wikipedia in different languages and their relatively isolated communities of editors are not forced by the platform to discuss and negotiate their points of view. So the empirical question is: do communities on different language Wikipedias develop their own diverse Linguistic Points of View (LPOV)? To answer this question we created and released as open source Manypedia, a web tool whose aim is to facilitate cross-cultural analysis of Wikipedia language communities by providing an easy way to compare automatically translated versions of their different representations of the same topic. | 0 | 0 |
| Mutual Evaluation of Editors and Texts for Assessing Quality of Wikipedia Articles | Yu Suzuki Masatoshi Yoshikawa |
Wikipedia Quality Peer review Edit history Link analysis |
WikiSym | English | August 2012 | In this paper, we propose a method to identify good quality Wikipedia articles by mutually evaluating editors and texts. A major approach for assessing article quality is a text survival ratio based approach. In this approach, when a text survives beyond multiple edits, the text is assessed as good quality. This approach assumes that poor quality texts are deleted by editors with high possibility. However, many vandals delete good quality texts frequently, then the survival ratios of good quality texts are improperly decreased by vandals. As a result, many good quality texts are unfairly assessed as poor quality. In our method, we consider editor quality for calculating text quality, and decrease the impacts on text qualities by the vandals who has low quality. Using this improvement, the accuracy of the text quality should be improved. However, an inherent problem of this idea is that the editor qualities are calculated by the text qualities. To solve this problem, we mutually calculate the editor and text qualities until they converge. We did our experimental evaluation, and we confirmed that the proposed method could accurately assess the text qualities. | 0 | 0 |
| Natural Language Processing for MediaWiki: The Semantic Assistants Approach | Bahar Sateli René Witte |
WikiSym | English | August 2012 | We present a novel architecture for the integration of Natural Language Processing (NLP) capabilities into wiki systems. The vision is that of a new generation of wikis that can help developing their own primary content and organize their structure by using state-of-the-art technologies from the NLP and Semantic Computing domains. The motivation for this integration is to enable wiki users – novice or expert – to benefit from modern text mining techniques directly within their wiki environment. We implemented these ideas based on MediaWiki and present a number of real-world application case studies that illustrate the practicability and effectiveness of this approach. | 0 | 0 | |
| On the Accuracy of Urban Crowd-Sourcing for Maintaining Large-Scale Geospatial Databases | Afra Mashhadi Giovanni Quattrone Licia Capra Peter Mooney |
Human Factors Measurement Reliability |
WikiSym | English | August 2012 | The world is in the midst of an immense population shift from rural areas to cities. Urban elements, such as businesses, Points-of-Interest (POIs), transportation, and housing are continuously changing, and collecting and maintaining accurate information about these elements within spatial databases has become an incredibly onerous task. A solution made possible by the uptake of social media is crowd-sourcing, where user-generated content can be cultivated into meaningful and informative collections, as exemplified by sites like Wikipedia. This form of user-contributed content is no longer confined to the Web: equipped with powerful mobile devices, citizens have become cartographers too, volunteering geographic information (e.g., POIs) as exemplified by sites like OpenStreetMap. In this paper, we investigate the extent to which crowd-sourcing can be relied upon to build and maintain an accurate map of the changing world, by means of a thorough analysis and comparison between traditional web-based crowd-sourcing (as in Wikipedia) and urban crowd-sourcing (as in OpenStreetMap). | 17 | 0 |
| Psychological processes underlying Wikipedia representations of natural and manmade disasters | Michela Ferron Paolo Massa |
Collective memory Traumatic event Man-made disasters Natural disasters LIWC Automated content analysis techniques |
WikiSym | English | August 2012 | Collective memories are precious resources for the society, because they help strengthening emotional bonding between community members, maintaining groups cohesion, and directing future behavior. Studying how people form their collective memories of emotional upheavals is important in order to better understand people's reactions and the consequences on their psychological health. Previous research investigated the effects of single traumatizing events, but few of them tried to compare different types of traumatic events like natural and man-made disasters. In this paper, interpreting Wikipedia as a collective memory place, we compare articles about natural and human-made disasters employing automated natural language techniques, in order to highlight the different psychological processes underlying users' sensemaking activities. | 0 | 0 |
| Staying in the Loop: Structure and Dynamics of Wikipedia's Breaking News Collaborations | Brian Keegan Darren Gergle Noshir Contractor |
Wikipedia High-tempo collaboration Network analysis Breaking news Collaboration Multigraph |
WikiSym | English | August 2012 | Despite the fact that Wikipedia articles about current events are more popular and attract more contributions than typical articles, canonical studies of Wikipedia have only analyzed articles about pre-existing information. We expect the co-authoring of articles about breaking news incidents to exhibit high-tempo coordination dynamics which are not found in articles about historical events and information. Using 1.03 million revisions made by 158,384 users to 3,233 English Wikipedia articles about disasters, catastrophes, and conflicts since 1990, we construct “article trajectories” of editor interactions as they coauthor an article. Examining a subset of this corpus, our analysis demonstrates that articles about current events exhibit structures and dynamics distinct from those observed among articles about non-breaking events. These findings have implications for how collective intelligence systems can be leveraged to process and make sense of complex information. | 0 | 0 |
| Towards Content-driven Reputation for Collaborative Code Repositories | Andrew G. West Insup Lee |
WikiTrust Wiki Code repository SVN Reputation Trust management Content persistence Code quality |
WikiSym | English | August 2012 | As evidenced by SourceForge and GitHub, code repositories now integrate Web 2.0 functionality that enables global participation with minimal barriers-to-entry. To prevent detrimental contributions enabled by crowdsourcing, reputation is one proposed solution. Fortunately this is an issue that has been addressed in analogous version control systems such as the *wiki* for natural language content. The WikiTrust algorithm ("content-driven reputation"), while developed and evaluated in wiki environments operates under a possibly shared collaborative assumption: actions that "survive" subsequent edits are reflective of good authorship. In this paper we examine WikiTrust's ability to measure author quality in collaborative code development. We first define a mapping from repositories to wiki environments and use it to evaluate a production SVN repository with 92,000 updates. Analysis is particularly attentive to reputation loss events and attempts to establish ground truth using commit comments and bug tracking. A proof-of-concept evaluation suggests the technique is promising (about two-thirds of reputation loss is justified) with false positives identifying areas for future refinement. Equally as important, these false positives exemplify differences in content evolution and the cooperative process between wikis and code repositories. | 0 | 0 |
| Wikipedia Customization through Web Augmentation Techniques | Oscar Díaz Cristóbal Arellano Gorka Puente |
Web Augmentation Wiki DSL |
WikiSym | English | August 2012 | Wikipedia is a successful example of collaborative knowledge construction. This can be synergistically complemented with personal knowledge construction whereby individuals are supported in their sharing, experimenting and building of information in a more private setting, without the scrutiny of the whole community. Ideally, both approaches should be seamlessly integrated so that wikipedians can easily transit from the public sphere to the private sphere, and vice versa. To this end, we introduce WikiLayer, a plugin for Wikipedia that permits wikipedians locally supplement Wikipedia articles with their own content (i.e. a layer). Layering additional content is achieved locally by seamlessly interspersing Wikipedia content with custom content. WikiLayer is driven by three main wiki principles: affordability (i.e., if you know how to edit articles, you know how to layer), organic growth (i.e., layers evolve in synchrony with the underlying articles) and shareability (i.e., layers can be shared in confidence through the wikipedian’s social network, e.g., Facebook ). The paper provides motivating scenarios for readers, contributors and editors. WikiLayer is available for download at http://webaugmentation.org/wikilayer.xpi. | 0 | 0 |
| Writing up rather than writing down: Becoming Wikipedia Literate | Heather Ford R. Stuart Geiger |
Literacy Wikipedia New literacies Educational technology Ethnography |
WikiSym | English | August 2012 | Editing Wikipedia is certainly not as simple as learning the MediaWiki syntax and knowing where the “edit” bar is, but how do we conceptualize the cultural and organizational understandings that make an effective contributor? We draw on work of literacy practitioner and theorist Richard Darville to advocate a multi-faceted theory of literacy that sheds light on what new knowledges and organizational forms are required to improve participation in Wikipedia’s communities. We outline what Darville refers to as the “background knowledges” required to be an empowered, literate member and apply this to the Wikipedia community. Using a series of examples drawn from interviews with new editors and qualitative studies of controversies in Wikipedia, we identify and outline several different literacy asymmetries. | 0 | 0 |
| A Casual Network Security Monitoring System using a Portable Sensor Device and Wiki Software | Takashi Yamanoue Kentaro Oda Koichi Shimozono |
Wiki Information security |
SAINT | English | July 2012 | A casual network security monitoring system is proposed in this paper. The system is easy to deploy without reconfiguring the central network infrastructure, the firewall, and the intrusion detector system (IDS) of an organization. A virus-infected host, which is hidden by the network address translator (NAT) of a sub LAN, can be identified easily by using this monitoring system with the IDS. This monitoring system consists of a portable sensor device and a web site with wiki software. The portable sensor device, which is located on a target LAN that may have virus-infected hosts, is remote-controlled by a network manager's commands. The commands and the results are written on a wiki page. | 3 | 1 |
| Reverts Revisited: Accurate Revert Detection in Wikipedia | Fabian Flöck Denny Vrandečić Elena Simperl |
Wikipedia Revert detection Editing behavior User modeling Collaboration systems Community-driven content creation Social dynamics |
Hypertext and Social Media 2012 | English | June 2012 | Wikipedia is commonly used as a proving ground for research in collaborative systems. This is likely due to its popularity and scale, but also to the fact that large amounts of data about its formation and evolution are freely available to inform and validate theories and models of online collaboration. As part of the development of such approaches, revert detection is often performed as an important pre-processing step in tasks as diverse as the extraction of implicit networks of editors, the analysis of edit or editor features and the removal of noise when analyzing the emergence of the con-tent of an article. The current state of the art in revert detection is based on a rather naïve approach, which identifies revision duplicates based on MD5 hash values. This is an efficient, but not very precise technique that forms the basis for the majority of research based on revert relations in Wikipedia. In this paper we prove that this method has a number of important drawbacks - it only detects a limited number of reverts, while simultaneously misclassifying too many edits as reverts, and not distinguishing between complete and partial reverts. This is very likely to hamper the accurate interpretation of the findings of revert-related research. We introduce an improved algorithm for the detection of reverts based on word tokens added or deleted to adresses these drawbacks. We report on the results of a user study and other tests demonstrating the considerable gains in accuracy and coverage by our method, and argue for a positive trade-off, in certain research scenarios, between these improvements and our algorithm’s increased runtime. | 13 | 0 |
| Online open neuroimaging mass meta-analysis | Finn Årup Nielsen Matthew J. Kempton Steven C. R. Williams |
Wiki MediaWiki Meta-analysis Neuroimaging |
SePublica 2012 | English | May 2012 | We describe a system for meta-analysis where a wiki stores numerical data in a simple format and a web service performs the numerical computation. We initially apply the system on multiple meta-analyses of structural neuroimaging data results. The described system allows for mass meta-analysis, e.g., meta-analysis across multiple brain regions and multiple mental disorders. | 2 | 0 |
| Spamming for Science: Active Measurement in Web 2.0 Abuse Research | Andrew G. West Pedram Hayati Vidyasagar Potdar Insup Lee |
WECSR | English | March 2012 | Spam and other electronic abuses have long been a focus of computer security research. However, recent work in the domain hasemphasized an *economic analysis* of these operations in the hope of understanding and disrupting the profit model of attackers. Such studies do not lend themselves to passive measurement techniques. Instead, researchers have become middle-men or active participants in spam behaviors; methodologies that lie at an interesting juncture of legal, ethical, and human subject (e.g., IRB) guidelines. In this work two such experiments serve as case studies: One testing a novel link spam model on Wikipedia and another using blackhat software to target blog comments and forums. Discussion concentrates on the experimental design process, especially as influenced by human-subject policy. Case studies are used to frame related work in the area, and scrutiny reveals the computer science community requires greater consistency in evaluating research of this nature. | 0 | 0 | |
| A Breakdown of Quality Flaws in Wikipedia | Maik Anderka Benno Stein |
Quality Flaws Information quality Wikipedia User-generated Content Analysis |
2nd Joint WICOW/AIRWeb Workshop on Web Quality (WebQuality 12) | English | 2012 | The online encyclopedia Wikipedia is a successful example of the increasing popularity of user generated content on the Web. Despite its success, Wikipedia is often criticized for containing low-quality information, which is mainly attributed to its core policy of being open for editing by everyone. The identification of low-quality information is an important task since Wikipedia has become the primary source of knowledge for a huge number of people around the world. Previous research on quality assessment in Wikipedia either investigates only small samples of articles, or else focuses on single quality aspects, like accuracy or formality. This paper targets the investigation of quality flaws, and presents the first complete breakdown of Wikipedia's quality flaw structure. We conduct an extensive exploratory analysis, which reveals (1) the quality flaws that actually exist, (2) the distribution of flaws in Wikipedia, and (3) the extent of flawed content. An important finding is that more than one in four English Wikipedia articles contains at least one quality flaw, 70% of which concern article verifiability. | 0 | 0 |
| A Cross-Lingual Dictionary for English Wikipedia Concepts | Valentin I. Spitkovsky Angel X. Chang |
Information retrieval Entity linking Wikipedia |
Proceedings of the Eighth International Conference on Language Resources and Evaluation | English | 2012 | We present a resource for automatically associating strings of text with English Wikipedia concepts. Our machinery is bi-directional, in the sense that it uses the same fundamental probabilistic methods to map strings to empirical distributions over Wikipedia articles as it does to map article URLs to distributions over short, language-independent strings of natural language text. For maximal interoperability, we release our resource as a set of flat line-based text files, lexicographically sorted and encoded with UTF-8. These files capture joint probability distributions underlying concepts (we use the terms article, concept and Wikipedia URL interchangeably) and associated snippets of text, as well as other features that can come in handy when working with Wikipedia articles and related information. | 5 | 0 |
| A Wikipedia-based corpus reference tool | Jason Ginsburg | Corpus Language teaching Wikipedia |
HCCE | English | 2012 | This paper describes a dictionary-like reference tool that is designed to help users find information that is similar to what one would find in a dictionary when looking up a word, except that this information is extracted automatically from large corpora. For a particular vocabulary item, a user can view frequency information, part-of-speech distribution, word-forms, definitions, example paragraphs and collocations. All of this information is extracted automatically from corpora and most of this information is extracted from Wikipedia. Since Wikipedia is a massive corpus covering a diverse range of general topics, this information is probably very representative of how target words are used in general. This project has applications for English language teachers and learners, as well as for language researchers. | 0 | 0 |
| Analysis of discussion contributions in translated Wikipedia articles | Ari Hautasaari Toru Ishida |
Talk page Translation Wikipedia |
English | 2012 | 0 | 0 | ||
| Biographical Social Networks on Wikipedia: A cross-cultural study of links that made history | Pablo Aragón Andreas Kaltenbrunner David Laniado Yana Volkovich |
Wikipedia Social network analysis Cross language studies |
WikiSym | English | 2012 | It is arguable whether history is made by great men and women or vice versa, but undoubtably social connections shape history. Analysing Wikipedia, a global collective memory place, we aim to understand how social links are recorded across cultures. Starting with the set of biographies in the English Wikipedia we focus on the networks of links between these biographical articles on the 15 largest language Wikipedias. We detect the most central characters in these networks and point out culture-related peculiarities. Furthermore, we reveal remarkable similarities between distinct groups of language Wikipedias and highlight the shared knowledge about connections between persons across cultures. | 0 | 0 |
| Bootstrapping wikis: developing critical mass in a fledgling community by seeding content | Jacob Solomon Rick Wash |
Bootstrapping Critical mass Online contribution |
Computer-Supported Cooperative Work | English | 2012 | 0 | 0 | |
| Breaking news on wikipedia: dynamics, structures, and roles in high-tempo collaboration | Brian C. Keegan | Breaking news Co-authorship Current events Network analysis Social network Social role Wikipedia |
Computer-Supported Cooperative Work | English | 2012 | 0 | 0 | |
| Building a standpoints web to support decision-making in wikipedia | Jodi Schneider | Collaboration Decision rationale Deliberation Online argumentation Sensemaking Wikipedia |
Computer-Supported Cooperative Work | English | 2012 | 0 | 0 | |
| Classroom Wikipedia participation effects on future intentions to contribute | Cliff Lampe Jonathan Obar Elif Ozkaya Paul Zube Alcides Velasquez |
Cscl Efficacy Participation Wikipedia |
Computer-Supported Cooperative Work | English | 2012 | One of the biggest challenges faced by social media sites like Wikipedia is how to motivate users to contribute content. Research continues to demonstrate that only a small percentage of users contribute to user-generated content sites. In this study we assess the results of a Wikimedia Foundation initiative, which had graduate and undergraduate students from 22 U.S. universities contribute content to Wikipedia articles as part of their coursework. 185 students were asked about their participation in the initiative and their intention to participate on Wikipedia in the future. Results suggest that intentions to continue contributing are influenced by the initial attitude towards the class, and the degree to which students perceived they were writing for a global audience. | 7 | 0 |
| Conflict, criticism, or confidence: an empirical examination of the gender gap in wikipedia contributions | Benjamin Collier Julia Bear |
Confidence Conflict Criticism Gender Survey Wikipedia |
Computer-Supported Cooperative Work | English | 2012 | A recent survey of contributors to Wikipedia found that less than 15% of contributors are women. This gender contribution gap has received significant attention from both researchers and the media. A panel of researchers and practitioners has offered several insights and opinions as to why a gender gap exists in contributions despite gender anonymity online. The gender research literature suggests that the difference in contribution rates could be due to three factors: (1) the high levels of conflict in discussions, (2) dislike of critical environments, and (3) lack of confidence in editing other contributors' work. This paper examines these hypotheses regarding the existence of the gender gap in contribution by using data from an international survey of 176,192 readers, contributors, and former contributors to Wikipedia, including measures of demographics, education, motivation, and participation. Implications for improving the design and culture of online communities to be more gender inclusive are discussed. | 0 | 0 |
| DBpedia for NLP: A Multilingual Cross-domain Knowledge Base | Pablo N. Mendes Max Jakob Christian Bizer |
International Conference on Language Resources and Evaluation | English | 2012 | 0 | 0 | ||
| Developing a university Wikipedia | Douglas Edmonson | PHP QR code Database IOS IPhone Information Lookup |
English | 2012 | 0 | 0 | ||
| Do editors or articles drive collaboration?: multilevel statistical network analysis of wikipedia coauthorship | Brian Keegan Darren Gergle Noshir Contractor |
Co-authorship Collaboration Ergm Exponential random graph model Network analysis Socio-material Wikipedia |
Computer-Supported Cooperative Work | English | 2012 | 0 | 0 | |
| Emotions and dialogue in a peer-production community: the case of Wikipedia | David Laniado Carlos Castillo Andreas Kaltenbrunner Mayo Fuster Morell |
Wikipedia Talk page Emotions Gender gap |
WikiSym | English | 2012 | This paper presents a large-scale analysis of emotions in conversations among Wikipedia editors. Our focus is on the emotions expressed by editors in talk pages, measured by using the Affective Norms for English Words (ANEW).
We find evidence that to a large extent women tend to participate in discussions with a more positive tone, and that administrators are more positive than non-administrators. Surprisingly, female non-administrators tend to behave like administrators in many aspects. We observe that replies are on average more positive than the comments they reply to, preventing many discussions from spiralling down into conflict. We also find evidence of emotional homophily: editors having similar emotional styles are more likely to interact with each other. Our findings offer novel insights into the emotional dimension of interactions in peer-production communities, and contribute to debates on issues such as the flattening of editor growth and the gender gap. |
0 | 0 |
| FlawFinder: A Modular System for Predicting Quality Flaws in Wikipedia | Oliver Ferschke Iryna Gurevych Marc Rittberger |
PAN | English | 2012 | With over 23 million articles in 285 languages, Wikipedia is the largest free knowledge base on the web. Due to its open nature, everybody is allowed to access and edit the contents of this huge encyclopedia. As a downside of this open access policy, quality assessment of the content becomes a critical issue and is hardly manageable without computational assistance. In this paper, we present FlawFinder, a modular system for automatically predicting quality flaws in unseen Wikipedia articles. It competed in the inaugural edition of the Quality Flaw Prediction Task at the PAN Challenge 2012 and achieved the best precision of all systems and the second place in terms of recall and F1-score. | 0 | 0 | |
| How the web can help Wikipedia: a study on information complementation of Wikipedia by the web | Damien Eklou Yasuhito Asano Masatoshi Yoshikawa |
Complementary information retrieval Information aggregation Topic modeling |
ICUIMC | English | 2012 | 0 | 0 | |
| Learning from history: predicting reverted work at the word level in wikipedia | Jeffrey Rzeszotarski Aniket Kittur |
Applied machine learning Reverted work Wikipedia |
Computer-Supported Cooperative Work | English | 2012 | 0 | 0 | |
| Massively Distributed Authorship of Academic Papers | Collaboration Writing Crowdsourcing Scholarship |
CHI | English | 2012 | Wiki-like or crowdsourcing models of collaboration can provide a number of benefits to academic work. These techniques may engage expertise from different disciplines, and potentially increase productivity. This paper presents a model of massively distributed collaborative authorship of academic papers. This
model, developed by a collective of thirty authors, identifies key tools and techniques that would be necessary or useful to the writing process. The process of collaboratively writing this paper was used to discover, negotiate, and document issues in massively authored scholarship. Our work provides the first extensive discussion of the experiential aspects of large-scale collaborative research. |
0 | 0 | |
| Negotiating Cultural Values in Social Media: A Case Study from Wikipedia | Jonathan T. Morgan Robert M. Mason Karine Nahon |
Hci Computer-Supported Cooperative Work Cmc Online collaboration Wikipedia Wiki Culture Values Controversy Social media |
HICSS | English | 2012 | 0 | 0 | |
| Network Centrality and Contributions to Online Public Good--The Case of Chinese Wikipedia | Chong (Alex) Wang Xiaoquan (Michael) Zhang |
Wikipedia Network Position Centrality Role Natural Experiment |
HICSS | English | 2012 | 0 | 0 | |
| Omnipedia: Bridging the Wikipedia Language Gap | Patti Bao Brent Hecht Samuel Carton Mahmood Quaderi Michael Horn Darren Gergle |
Wikipedia Multilingual Hyperlingual Language barrier User generated content Text mining |
International Conference on Human Factors in Computing Systems | English | 2012 | We present Omnipedia, a system that allows Wikipedia readers to gain insight from up to 25 language editions ofWikipedia simultaneously. Omnipedia highlights the similarities and differences that exist among Wikipedia language editions, and makes salient information that is unique to each language as well as that which is shared more widely. We detail solutions to numerous front-end and algorithmic challenges inherent to providing users with a multilingual Wikipedia experience. These include visualizing content in a language-neutral way and aligning data in the face of diverse information organization strategies. We present a study of Omnipedia that characterizes how people interact with information using a multilingual lens. We found that users actively sought information exclusive to unfamiliar language editions and strategically compared how language editions defined concepts. Finally, we briefly discuss how Omnipedia generalizes to other domains facing language barriers. | 0 | 0 |
| On the Evolution of Quality Flaws and the Effectiveness of Cleanup Tags in the English Wikipedia | Maik Anderka Benno Stein Matthias Busse |
Wikipedia Cleanup Tags Quality Flaws Information quality Quality Flaw Evolution |
Wikipedia Academy | English | 2012 | The improvement of information quality is a major task for the free online encyclopedia Wikipedia. Recent studies targeted the analysis and detection of specific quality flaws in Wikipedia articles. To date, quality flaws have been exclusively investigated in current Wikipedia articles, based on a snapshot representing the state of Wikipedia at a certain time. This paper goes further, and provides the first comprehensive breakdown of the evolution of quality flaws in Wikipedia. We utilize cleanup tags to analyze the quality flaws that have been tagged by the Wikipedia community in the English Wikipedia, from its launch in 2001 until 2011. This leads to interesting findings regarding (1) the development of Wikipedia's quality flaw structure and (1) the usage and the effectiveness of cleanup tags. Specifically, we show that inline tags are more effective than tag boxes, and provide statistics about the considerable volume of rare and non-specific cleanup tags. We expect that this work will support the Wikipedia community in making quality assurance activities more efficient. | 0 | 0 |
| On the Use of PU Learning for Quality Flaw Prediction in Wikipedia | Edgardo Ferretti Donato Hernández Fusilier Rafael Guzmán Cabrera Manuel Montes y Gómez Marcelo Errecalde Paolo Rosso |
PAN | English | 2012 | In this article we describe a new approach to assess Quality Flaw Prediction in Wikipedia. The partially supervised method studied, called PU Learning, has been successfully applied in classifications tasks with traditional corpora like Reuters-21578 or 20-Newsgroups. To the best of our knowledge, this is the first time that it is applied in this domain. Throughout this paper, we describe how the original PU Learning approach was evaluated for assessing quality flaws and the modifications introduced to get a quality flaws predictor which obtained the best F1 scores in the task “Quality Flaw Prediction in Wikipedia” of the PAN challenge. | 0 | 0 | |
| Overview of the 1st International Competition on Quality Flaw Prediction in Wikipedia | Maik Anderka Benno Stein |
Information quality Wikipedia Quality Flaw Prediction |
CLEF | English | 2012 | The paper overviews the task "Quality Flaw Prediction in Wikipedia" of the PAN'12 competition. An evaluation corpus is introduced which comprises 1,592,226 English Wikipedia articles, of which 208,228 have been tagged to contain one of ten important quality flaws. Moreover, the performance of three quality flaw classifiers is evaluated. | 0 | 0 |
| Planteome annotation wiki: a semantic application for the community curation of plant genotypes and phenotypes | Justin Preece Justin Elser Pankaj Jaiswal |
SWAT4LS | English | 2012 | 0 | 0 | ||
| Predicting Quality Flaws in User-generated Content: The Case of Wikipedia | Maik Anderka Benno Stein Nedim Lipka |
User-generated Content Analysis Information quality Wikipedia Quality Flaw Prediction One-class Classification |
35th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2012) | English | 2012 | The detection and improvement of low-quality information is a key concern in Web applications that are based on user-generated content; a popular example is the online encyclopedia Wikipedia. Existing research on quality assessment of user-generated content deals with the classification as to whether the content is high-quality or low-quality. This paper goes one step further: it targets the prediction of quality flaws, this way providing specific indications in which respects low-quality content needs improvement. The prediction is based on user-defined cleanup tags, which are commonly used in many Web applications to tag content that has some shortcomings. We apply this approach to the English Wikipedia, which is the largest and most popular user-generated knowledge source on the Web. We present an automatic mining approach to identify the existing cleanup tags, which provides us with a training corpus of labeled Wikipedia articles. We argue that common binary or multiclass classification approaches are ineffective for the prediction of quality flaws and hence cast quality flaw prediction as a one-class classification problem. We develop a quality flaw model and employ a dedicated machine learning approach to predict Wikipedia's most important quality flaws. Since in the Wikipedia setting the acquisition of significant test data is intricate, we analyze the effects of a biased sample selection. In this regard we illustrate the classifier effectiveness as a function of the flaw distribution in order to cope with the unknown (real-world) flaw-specific class imbalances. The flaw prediction performance is evaluated with 10,000 Wikipedia articles that have been tagged with the ten most frequent quality flaws: provided test data with little noise, four flaws can be detected with a precision close to 1. | 0 | 0 |
| Qualitative assessment of wiki-based learning processes | Antonio Balderas Manuel Palomo-Duarte Juan Manuel Dodero-Beardo Iván Ruiz-Rube |
SPDECE | English | 2012 | 0 | 0 | ||
| SWiPE: Searching Wikipedia By Example | Maurizio Atzori Carlo Zaniolo |
Structured query interface Visual query language Semi-structured data querying |
English | 2012 | 0 | 0 | ||
| Supporting collaboration in Wikipedia between language communities | Ranjitha Gurunath Kulkarni Gaurav Trivedi Tushar Suresh Miaomiao Wen Zeyu Zheng Carolyn Rose |
Computer-Supported Cooperative Work Cross-lingual document similarity Wikipedia |
English | 2012 | 0 | 0 | ||
| There is No Deadline - Time Evolution of Wikipedia Discussions | Andreas Kaltenbrunner David Laniado |
Wikipedia Online discussion Temporal patterns H-index |
WikiSym | English | 2012 | Wikipedia articles are by definition never finished: at any moment their content can be edited, or discussed in the associated talk pages. In this study we analyse the evolution of these discussions to unveil patterns of collective participation along the temporal dimension, and to shed light on the process of content creation on different topics. At a micro-scale, we investigate peaks in the discussion activity and we observe a non-trivial relationship with edit activity. At a larger scale, we introduce a measure to account for how fast discussions grow in complexity, and we find speeds that span three orders of magnitude for different articles. Our analysis should help the community in tasks such as early detection of controversies and assessment of discussion maturity. | 0 | 0 |
| WIKISCORE - A Collaborative Environment For Music Transcription And Publishing | José João A. G. Dias de Almeida Nuno Ramos Carvalho José Nuno Oliveira |
Music transcription ABC Wiki Collaborative work Music publishing |
Social Shaping of Digital Publishing: Exploring the Interplay Between Culture and Technology - Proceedings of the 16th International Conference on Electronic Publishing | English | 2012 | Music sources are most commontly shared in music scores scanned or printed on paper sheets. These artifacts are rich in information, but since they are images it is hard to re-use and share their content in todays' digital world. There are modern languages that can be used to transcribe music sheets, this is still a time consuming task, because of the complexity involved in the process and the typical huge size of the original documents. WIKI SCORE is a collaborative environment where several people work together to transcribe music sheets to a shared medium, using the notation. This eases the process of transcribing huge documents, and stores the document in a well known notation, that can be used later on to publish the whole content in several formats, such as a PDF document, images or audio files for example. | 2 | 0 |
| Wikidata: a new platform for collaborative data collection | Denny Vrandečić | Semantic web Wikipedia Linked data DBpedia |
International conference companion on World Wide Web | English | 2012 | This year, Wikimedia starts to build a new platform for the collaborative acquisition and maintenance of structured data: Wikidata. Wikidata's prime purpose is to be used within the other Wikimedia projects, like Wikipedia, to provide well-maintained, high-quality data. The nature and requirements of the Wikimedia projects require to develop a few novel, or at least unusual features for Wikidata: Wikidata will be a secondary database, i.e. instead of containing facts it will contain references for facts. It will be fully internationalized. It will contain inconsistent and contradictory facts, in order to represent the diversity of knowledge about a given entity. | 0 | 0 |
| Is Wikipedia Inefficient? Modelling Effort and Participation in Wikipedia | Kevin Crowston Nicolas Jullien Felipe Ortega |
Data Envelopment Analysis Efficiency Wikipedia |
HICSS 2013 | English | 17 November 2011 | Concerns have been raisedabout the decreased ability of Wikipedia to recruit editors and in to harness the effort of contributors to create new articles and imp | 0 | 0 |
| Autonomous Link Spam Detection in Purely Collaborative Environments | Andrew G. West Avantika Agrawal Phillip Baker Brittney Exline Insup Lee |
Wikipedia Collaboration Collaborative security Information security Spam Spam mitigation Reputation Spatio- temporal features Machine learning Intelligent routing |
WikiSym | English | October 2011 | Collaborative models (e.g., wikis) are an increasingly prevalent Web technology. However, the open-access that defines such systems can also be utilized for nefarious purposes. In particular, this paper examines the use of collaborative functionality to add inappropriate hyperlinks to destinations outside the host environment (i.e., link spam). The collaborative encyclopedia, Wikipedia, is the basis for our analysis.
Recent research has exposed vulnerabilities in Wikipedia's link spam mitigation, finding that human editors are latent and dwindling in quantity. To this end, we propose and develop an autonomous classifier for link additions. Such a system presents unique challenges. For example, low barriers-to-entry invite a diversity of spam types, not just those with economic motivations. Moreover, issues can arise with how a link is presented (regardless of the destination). In this work, a spam corpus is extracted from over 235,000 link additions to English Wikipedia. From this, 40+ features are codified and analyzed. These indicators are computed using "wiki" metadata, landing site analysis, and external data sources. The resulting classifier attains 64% recall at 0.5% false-positives (ROC-AUC=0.97). Such performance could enable egregious link additions to be blocked automatically with low false-positive rates, while prioritizing the remainder for human inspection. Finally, a live Wikipedia implementation of the technique has been developed. |
0 | 0 |
| What Wikipedia Deletes: Characterizing Dangerous Collaborative Content | Andrew G. West Insup Lee |
Wikipedia User generated content Collaboration Redaction Content removal Copyright Information security |
WikiSym | English | October 2011 | Collaborative environments, such as Wikipedia, often have low barriers-to-entry in order to encourage participation. This accessibility is frequently abused (e.g., vandalism and spam). However, certain inappropriate behaviors are more threatening than others. In this work, we study contributions which are not simply ``undone -- but *deleted* from revision histories and public view. Such treatment is generally reserved for edits which: (1) present a legal liability to the host (e.g., copyright issues, defamation), or (2) present privacy threats to individuals (i.e., contact information). Herein, we analyze one year of Wikipedia's public deletion log and use brute-force strategies to learn about privately handled redactions. This permits insight about the prevalence of deletion, the reasons that induce it, and the extent of end-user exposure to dangerous content. While Wikipedia's approach is generally quite reactive, we find that copyright issues prove most problematic of those behaviors studied. | 0 | 1 |
| Entre o agrupamento e a comunidade virtual: colaboração e conflitos na edição das biografias dos jogadores “Adriano” e “Ronaldo” na Wikipédia em português | Carlos Frederico de Brito d’Andréa | Colaboração Wikipedia Edição Agrupamento Comunidade virtual |
XXXIV Congresso Brasileiro de Ciências da Comunicação | Portuguese | September 2011 | 9 | 0 | |
| Link Spamming Wikipedia for Profit | Andrew G. West Jian Chang Krishna Venkatasubramanian Oleg Sokolsky Insup Lee |
Web 2.0 spam Spam Wikipedia Wiki Collaborative security Attack model Measurement study Spam economics |
CEAS '11: Proc. of the 8th Annual Collaboration, Electronic Messaging, Anti-Abuse, and Spam Conference | English | September 2011 | Collaborative functionality is an increasingly prevalent web technology. To encourage participation, these systems usually have low barriers-to-entry and permissive privileges. Unsurprisingly, ill-intentioned users try to leverage these characteristics for nefarious purposes. In this work, a particular abuse is examined -- link spamming -- the addition of promotional or otherwise inappropriate hyperlinks.
Our analysis focuses on the wiki model and the collaborative encyclopedia, Wikipedia, in particular. A principal goal of spammers is to maximize *exposure*, the quantity of people who view a link. Creating and analyzing the first Wikipedia link spam corpus, we find that existing spam strategies perform quite poorly in this regard. The status quo spamming model relies on link persistence to accumulate exposures, a strategy that fails given the diligence of the Wikipedia community. Instead, we propose a model that exploits the latency inherent in human anti-spam enforcement. Statistical estimation suggests our novel model would produce significantly more link exposures than status quo techniques. More critically, the strategy could prove economically viable for perpetrators, incentivizing its exploitation. To this end, we address mitigation strategies. |
0 | 0 |
| Multilingual Vandalism Detection using Language-Independent & Ex Post Facto Evidence | Andrew G. West Insup Lee |
PAN-CLEF | English | September 2011 | There is much literature on Wikipedia vandalism detection. However, this writing addresses two facets given little treatment to date. First, prior efforts emphasize zero-delay detection, classifying edits the moment they are made. If classification can be delayed (e.g., compiling offline distributions), it is possible to leverage ex post facto evidence. This work describes/evaluates several features of this type, which we find to be overwhelmingly strong vandalism indicators.
Second, English Wikipedia has been the primary test-bed for research. Yet, Wikipedia has 200+ language editions and use of localized features impairs portability. This work implements an extensive set of language-independent indicators and evaluates them using three corpora (German, English, Spanish). The work then extends to include language-specific signals. Quantifying their performance benefit, we find that such features can moderately increase classifier accuracy, but significant effort and language fluency are required to capture this utility. Aside from these novel aspects, this effort also broadly addresses the task, implementing 65 total features. Evaluation produces 0.840 PR-AUC on thezero-delay task and 0.906 PR-AUC with ex post facto evidence (averaging languages). Performance matches the state-of-the-art (English), sets novel baselines (German, Spanish), and is validated by a first-place finish over the 2011 PAN-CLEF test set. |
0 | 0 | |
| An Introductory Historical Contextualization of Online Creation Communities for the Building of Digital Commons: The Emergence of a Free Culture Movement | Mayo Fuster Morell | Proceedings of the 6th Open Knowledge Conference | English | June 2011 | Online Creation Communities (OCCs) are a set of individuals that communicate, interact and collaborate; in several forms and degrees of participation which are eco-systemically integrated; mainly via a platform of participation on the Internet, on which they depend; and aiming at knowledge-making and sharing. The paper will first provide an historical contextualization OCCs. Then, it will show how the development of OCCs is fuelled by and contributes to, the rise of a free culture movement defending and advocating the creation of digital commons, and provide an empirically grounded definition of free culture movement. The empirical analyses is based content analysis of 80 interviews to free culture practitioners, promoters and activists with an international background or rooted in Europe, USA and Latino-America and the content analysis of two seminar discussions. The data collection was developed from 2008 to 2010. | 0 | 0 | |
| Towards a diversity-minded Wikipedia | Fabian Flöck Denny Vrandečić Elena Simperl |
Wikipedia Diversity Community-driven content creation Social dynamics Opinion mining Sentiment analysis |
WebSci Conference | English | June 2011 | Wikipedia is a top-ten Web site providing a free encyclopedia created by an open community of volunteer contributors. As investigated in various studies over the past years, contributors have different backgrounds, mindsets and biases; however, the effects - positive and negative - of this diversity on the quality of the Wikipedia content, and on the sustainability of the overall project are yet only partially understood. In this paper we discuss these effects through an analysis of existing scholarly literature in the area and identify directions for future research and development; we also present an approach for diversity-minded content management within Wikipedia that combines techniques from semantic technologies, data and text mining and quantitative social dynamics analysis to create greater awareness of diversity-related issues within theWikipedia community, give readers access to indicators and metrics to understand biases and their impact on the quality of Wikipedia articles, and support editors in achieving balanced versions of these articles that leverage the wealth of knowledge and perspectives inherent to large-scale collaboration. | 24 | 1 |
| Wikipedia & Research: The innovative character of Wikipedia research and the new challenges (and opportunities) associated with it | Mayo Fuster Morell | Proceedings of the 6th Open Knowledge Conference | English | June 2011 | The workshop will focus on addressing the stage of Wikipedia research and in general common - based peer production (less focused on the content than on the methodologies and research process itself) and the innovations, problems and new insights regarding (action) research on common-based peer production. | 0 | 0 | |
| A Characterization of Wikipedia Content Based on Motifs in the Edit Graph | Guangyu Wu Martin Harrigan Pádraig Cunningham |
SMUC '11: Proceedings of the 3rd international workshop on Search and mining user-generated contents | English | February 2011 | Good Wikipedia articles are authoritative sources due to the collaboration of a number of knowledgeable contributors. This is the many eyes idea. The edit network associated with a Wikipedia article can tell us something about its quality or authoritativeness. In this paper we explore the hypothesis that the characteristics of this edit network are predictive of the quality of the corresponding article's content. We characterize the edit network using a profile of network motifs and we show that this network motif profile is predictive of the Wikipedia quality classes assigned to articles by Wikipedia editors. We further show that the network motif profile can identify outlier articles particularly in the 'Featured Article' class, the highest Wikipedia quality class. | 8 | 0 | |
| A DSL for corporate wiki initialization | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | English | 2011 | 0 | 0 | |||
| A Distributed Wiki System Based on Peer-to-Peer File Sharing Principles | Alexander Craig Alan Davoust Babak Esfandiari |
Wiki P2P Graph queries Trust |
WI-IAT | English | 2011 | 0 | 0 | |
| A Research for the Centrality of Article Edit Collective in Wikipedia | Dongjie Zhao Haitao Yang Jian Jiang Deyi Li Haisu Zhang |
Wikipedia Aarticle edit interaction network Centrality Networked data mining Collective intelligence |
ICM | English | 2011 | 0 | 0 | |
| A capstone wiki knowledge base: A case study of an online tool designed to promote life-long learning through engineering literature research | Issues in Science and Technology Librarianship | English | 2011 | 0 | 0 | |||
| A cloud-based semantic wiki for user training in healthcare process management | Studies in Health Technology and Informatics | English | 2011 | 0 | 0 | |||
| A collaborative, wiki-based organic chemistry project incorporating free chemistry software on the Web | Journal of Chemical Education | English | 2011 | 0 | 0 | |||
| A comparison of four association engines in divergent thinking support systems on wikipedia | Kobkrit Viriyayudhakorn Susumu Kunifuji Mizuhito Ogawa |
KICSS | English | 2011 | 0 | 0 | ||
| A generalized method for word sense disambiguation based on wikipedia | Chenliang Li Aixin Sun Anwitaman Datta |
Context pruning Wikipedia Word sense disambiguation |
ECIR | English | 2011 | 0 | 0 | |
| A lexicon for processing archaic language: the case of XIXth century Slovene | Tomaž Erjavec Christoph Ringlstetter Maja Žorga Annette Gotscharek |
WoLeR 2011: International Workshop on Lexical Resources | English | 2011 | The paper presents a lexicon to support computational processing of historical Slovene texts. Historical Slovene texts are being increasingly digitised and made available on the internet but are still underutilised as no language technology support is offered for their processing. Appropriate tools and resources would enable full-text searching with modern-day lemmas, modernisation of archaic language to make it more accessible to today‟s readers, and automatic OCR correction. We discuss the lexicon needed to support tokenisation, modernisation, lemmatisation and part-of-speech tagging of historical texts. The process of lexicon acquisition relies on a proof-read corpus, a large lexicon of contemporary Slovene, and tools to map historical forms to their contemporary equivalents via a set of rewrite rules, and to provide an editing environment for lexicon construction. The lexicon, currently work in progress, will be made publicly available; it should help not only in making digital libraries more accessible but also provide a quantitative basis for linguistic explorations of historical Slovene texts and a prototype electronic dictionary of archaic Slovene. | 1 | 0 | |
| A link-based visual search engine for Wikipedia | David N. Milne Ian H. Witten |
Exploratory search Information retrieval Information visualization Semantic relatedness Wikipedia |
JCDL | English | 2011 | 0 | 0 | |
| A meta-reflective wiki for collaborative design | Li Zhu Ivan Vaghi Barbara Rita Barricelli |
Hive-Mind Space model MikiWiki Boundary objects Co-evolution End-user development Habitable environment Meta-design Mikinugget Wiki |
WikiSym | English | 2011 | 0 | 0 | |
| A semantic wiki based on spatial hypertext | Journal of Universal Computer Science | English | 2011 | 0 | 0 | |||
| A semantic wiki for user training in ePrescribing processes | D. Papakonstantinou F. Malamateniou G. Vassilacopoulos |
Cloud computing EPrescribing Semantic wiki User training |
PETRA | English | 2011 | 0 | 0 | |
| An Empirical Study on Application of Wiki-Based Collaborative Lesson-Preparing | Yingjie Ren Chaohua Gong |
Lesson-preparing Collaborative lesson-preparing Wiki Eduwiki Teachers Application Knowledge management Evaluation |
ICM | English | 2011 | 0 | 0 | |
| An annotation scheme for automated bias detection in Wikipedia | Livnat Herzig Alex Nunes Batia Snir |
LAW V | English | 2011 | 0 | 0 | ||
| An experience using a spatial hypertext Wiki | Carlos Solis Nour Ali |
ShyWiki Wiki Knowledge management Spatial hypertext |
HT | English | 2011 | 0 | 0 | |
| An exploratory study of navigating wikipedia semantically: model and application | I-Chin Wu Yi-Sheng Lin Che-Hung Liu |
SNA-based summary Navigation Normalized google distance Semantically-based Wikipedia |
OCSC | English | 2011 | 0 | 0 | |
| Analysis of social learning network for wiki in moodle E-Learning | Proceedings - 4th International Conference on Interaction Sciences: IT, Human and Digital Content, ICIS 2011 | English | 2011 | 0 | 0 | |||
| Analysis on Multilingual Discussion for Wikipedia Translation | Linsi Xia Naomi Yamashita Toru Ishida |
Wikipedia Translation Multilingual communication Machine translation Multilingual Liquid Threads |
CULTURE-COMPUTING | English | 2011 | 0 | 0 | |
| Annotating social acts: authority claims and alignment moves in Wikipedia talk pages | Emily M. Bender Jonathan T. Morgan Meghan Oxley Mark Zachry Brian Hutchinson Alex Marin Bin Zhang Mari Ostendorf |
LSM | English | 2011 | 0 | 0 | ||
| Annotating software documentation in semantic wikis | Klaas Andries de Graaf | Semantic annotation Semantic wiki Software documentation Software engineering knowledge |
ESAIR | English | 2011 | 0 | 0 | |
| Annotations on access controls in wikis: a proposal | Chikashi Fuchimoto Masayoshi Aritsugi |
Annotations on access controls Collaboration |
IiWAS | English | 2011 | 0 | 0 | |
| Application of Bradford's law and Lotka's law to web metrics study on the Wiki website | Journal of Educational Media and Library Science | English | 2011 | 0 | 0 | |||
| Assessing collaboration in a wiki: The reliability of university students' peer assessment | Internet and Higher Education | English | 2011 | 0 | 0 | |||
| Assessments in large- and small-scale wiki collaborative learning environments: recommendations for educators and wiki designers | Portia Pusey Gabriele Meiselwitz |
Assessment Wiki Wiki learning Wiki learning environment |
OCSC | English | 2011 | 0 | 0 | |
| Automatic gazetteer generation from wikipedia | Alessio Bosca Luca Dini |
NLP4DL'09/AT4DL | English | 2011 | 0 | 0 | ||
| Automatically assigning Wikipedia articles to macro-categories | Jacopo Farina Riccardo Tasso David Laniado |
Wikipedia Category graph Topic coverage |
Hypertext | English | 2011 | The online encyclopedia Wikipedia offers millions of articles which are organized in a hierarchical category structure, created and updated by users. In this paper we present a technique which leverages this rich and disordered graph to assign each article to one or more topics. We modify an existing approach, based on the shortest paths between categories, in order to account for the direction of the hierarchy. | 0 | 0 |
| Autopedia: automatic domain-independent Wikipedia article generation | Conglei Yao Xu Jia Sicong Shou Shicong Feng Feng Zhou Hongyan Liu |
Wikipedia Article generation Domain independent Template selection |
World Wide Web | English | 2011 | 0 | 0 | |
| Blurring boundaries: Two groups of girls collaborate on a wiki | Journal of Adolescent and Adult Literacy | English | 2011 | 0 | 0 | |||
| Bootstrapping Multilingual Relation Discovery Using English Wikipedia and Wikimedia-Induced Entity Extraction | Patrick Schone Tim Allison Chris Giannella Craig Pfeifer |
Multilingual relation extraction Wikipedia |
ICTAI | English | 2011 | 0 | 0 | |
| Building a geographical ontology by using Wikipedia | Quoc Hung-Ngo Son Doan Werner Winiwarter |
Geographical ontology Ontology building |
IiWAS | English | 2011 | 0 | 0 | |
| Building a signed network from interactions in Wikipedia | Silviu Maniu Bogdan Cautis Talel Abdessalem |
Online community Signed networks Social applications Web of trust Wikipedia |
DBSocial | English | 2011 | 0 | 1 | |
| Calculating Wikipedia Article Similarity Using Machine Translation Evaluation Metrics | Maike Erdmann Andrew Finch Kotaro Nakayama Eiichiro Sumita Takahiro Hara Shojiro Nishio |
Cross-language Document Similarity Bilingual Dictionary Construction Data mining |
WAINA | English | 2011 | 0 | 0 | |
| Casting a web of trust over Wikipedia: an interaction-based approach | Silviu Maniu Talel Abdessalem Bogdan Cautis |
Wikipedia Online community Signed networks Social applications Web of trust |
World Wide Web | English | 2011 | 0 | 0 | |
| Characterization and prediction of Wikipedia edit wars | Róbert Sumi Taha Yasseri András Rung András Kornai János Kertész |
Wikipedia Collaboration Conflict Classification |
WebSci Conference | English | 2011 | We present a new, eficient method for automatically detecting conict cases and test it on five diferent language Wikipedias. We discuss how the number of edits, reverts, the length of discussions deviate in such pages from those following the general workow. | 4 | 2 |
| Characterizing Wikipedia pages using edit network motif profiles | Guangyu Wu Martin Harrigan Pádraig Cunningham |
Authoritativeness Network motifs Wikipedia |
SMUC | English | 2011 | Good Wikipedia articles are authoritative sources due to the collaboration of a number of knowledgeable contributors. This is the many eyes idea. The edit network associated with a Wikipedia article can tell us something about its quality or authoritativeness. In this paper we explore the hypothesis that the characteristics of this edit network are predictive of the quality of the corresponding article's content. We characterize the edit network using a profile of network motifs and we show that this network motif profile is predictive of the Wikipedia quality classes assigned to articles by Wikipedia editors. We further show that the network motif profile can identify outlier articles particularly in the 'Featured Article' class, the highest Wikipedia quality class. | 0 | 0 |
| Classifying Wikipedia entities into fine-grained classes | Maksim Tkatchenko Alexander Ulanov Andrey Simanovsky |
ICDEW | English | 2011 | 0 | 0 | ||
| ClassroomWiki: a collaborative Wiki for institutional use | Rupali Sawant Apoorv Singhal Priyank Nigam Utkarsh Shah |
ICWET | English | 2011 | 0 | 0 | ||
| Co-authorship 2.0: patterns of collaboration in Wikipedia | David Laniado Riccardo Tasso |
Wikipedia Collaboration network Online production Social network analysis |
Hypertext | English | 2011 | The study of collaboration patterns in wikis can help shed light on the process of content creation by online communities. To turn a wiki's revision history into a collaboration network, we propose an algorithm that identifies as authors of a page the users who provided the most of its relevant content, measured in terms of quantity and of acceptance by the community. The scalability of this approach allows us to study the English Wikipedia community as a co-authorship network. We find evidence of the presence of a nucleus of very active contributors, who seem to spread over the whole wiki, and to interact preferentially with inexperienced users. The fundamental role played by this elite is witnessed by the growing centrality of sociometric stars in the network. Isolating the community active around a category, it is possible to study its specific dynamics and most influential authors. | 0 | 1 |
| CoSyne: a framework for multilingual content synchronization of wikis | Christof Monz Vivi Nastase Matteo Negri Angela Fahrni Yashar Mehdad Michael Strube |
Multilinguality Recognizing textual entailment Translation Wiki |
WikiSym | English | 2011 | 0 | 0 | |
| Collaborative learning using wiki web sites for computer science undergraduate education: A case study | IEEE Transactions on Education | English | 2011 | 0 | 0 | |||
| Collaborative learning with a wiki: Differences in perceived usefulness in two contexts of use | Journal of Computer Assisted Learning | English | 2011 | 0 | 0 | |||
| Collaborative sensemaking during admin permission granting in wikipedia | Katie Derthick Patrick Tsao Travis Kriplean Alan Borning Mark Zachry David W. McDonald |
Collaboration software Contributor systems Sensemaking System administration |
OCSC | English | 2011 | 0 | 0 | |
| Collaborative video editing for Wikipedia | Michael Dale | WikiSym | English | 2011 | 0 | 0 | ||
| Collective memory building in Wikipedia: The case of North African uprisings | Michela Ferron Paolo Massa |
Wikipedia Web 2.0 Collective memory Revolution Traumatic event Egypt North Africa |
WikiSym | English | 2011 | Since December 2010, a series of protests and uprisings have shocked North African countries such as Tunisia, Egypt, Libya, Syria, Yemen and more. In this paper, focusing mainly on the Egyptian revolution, we provide evidence of the intense edit activity occurred during these uprisings on the related Wikipedia pages. Thousands of people provided their contribution on the content pages and discussed improvements and disagreements on the associated talk pages as the traumatic events unfolded. We propose to interpret this phenomenon as a process of collective memory building and argue how on Wikipedia this can be studied empirically and quantitatively in real time. We explore and suggest possible directions for future research on collective memory formation of traumatic and controversial events in Wikipedia. | 14 | 0 |
| Comparison of wiki-based process modeling systems | ACM International Conference Proceeding Series | English | 2011 | 0 | 0 | |||
| Conceptual Indexing of Documents Using Wikipedia | Carlo Abi Chahine Nathalie Chaignaud Jean-Philippe Kotowicz Jean-Pierre Pecuchet |
Document indexing Keyword and Topic Extraction Directed Acyclic Graph Wikipedia |
WI-IAT | English | 2011 | 0 | 0 | |
| Constructing a semantic wiki-based teamwork platform for collaborative e-research | 2011 International Conference on Electrical and Control Engineering, ICECE 2011 - Proceedings | English | 2011 | 0 | 0 | |||
| Cooperative or collaborative literacy practices: Mapping metadiscourse in a business students' Wiki group project | Australasian Journal of Educational Technology | English | 2011 | 0 | 0 | |||
| Credibility Assessment Using Wikipedia for Messages on Social Network Services | Yu Suzuki Akiyo Nadamoto |
Credibility Social Network Service Wikipedia |
DASC | English | 2011 | 0 | 0 | |
| Cross lingual text classification by mining multilingual topics from wikipedia | Xiaochuan Ni Jian T. Sun Jian Hu Zheng Chen |
Cross lingual text classification Multilingual Topic modeling Universal-topics Wikipedia |
WSDM | English | 2011 | 0 | 0 | |
| DBpedia Spotlight: Shedding Light on the Web of Documents | Pablo N. Mendes Max Jakob Andrés García-Silva Christian Bizer |
Text Annotation Linked data DBpedia Named Entity Disambiguation |
International Conference on Semantic Systems | English | 2011 | 0 | 0 | |
| Design and implementation of the Sweble Wikitext parser: unlocking the structured data of Wikipedia | Hannes Dohrn Dirk Riehle |
AST PEG Sweble WYSIWYG Wikipedia Abstract syntax tree Parsing expression grammar Wiki Parser |
WikiSym | English | 2011 | 0 | 0 | |
| Detection of Text Quality Flaws as a One-class Classification Problem | Maik Anderka Benno Stein Nedim Lipka |
Information quality Wikipedia Quality Flaw Prediction One-class Classification |
20th ACM Conference on Information and Knowledge Management (CIKM 11) | English | 2011 | For Web applications that are based on user generated content the detection of text quality flaws is a key concern. Our research contributes to automatic quality flaw detection. In particular, we propose to cast the detection of text quality flaws as a one-class classification problem: we are given only positive examples (= texts containing a particular quality flaw) and decide whether or not an unseen text suffers from this flaw. We argue that common binary or multiclass classification approaches are ineffective in here, and we underpin our approach by a real-world application: we employ a dedicated one-class learning approach to determine whether a given Wikipedia article suffers from certain quality flaws. Since in the Wikipedia setting the acquisition of sensible test data is quite intricate, we analyze the effects of a biased sample selection. In addition, we illustrate the classifier effectiveness as a function of the flaw distribution in order to cope with the unknown (real-world) flaw-specific class imbalances. Altogether, provided test data with little noise, four from ten important quality flaws in Wikipedia can be detected with a precision close to 1. | 0 | 0 |
| Discovering context: classifying tweets through a semantic transform based on wikipedia | Yegin Genc Yasuaki Sakamoto Jeffrey V. Nickerson |
Cognition Context Latent semantic analysis Semantics Text classification Wikipedia |
FAC | English | 2011 | 0 | 0 | |
| Discussion about Translation in Wikipedia | Ari Hautasaari Toru Ishida |
Wikipedia Translation Talk page |
CULTURE-COMPUTING | English | 2011 | 0 | 0 | |
| Document Indexing and Retrieval Using Wikipedia | Carlo Abi Chahine Nathalie Chaignaud Jean-Philippe Kotowicz Jean-Pierre Pecuchet |
Information retrieval Document indexing Wikipedia |
SITIS | English | 2011 | 0 | 0 | |
| Document Topic Extraction Based on Wikipedia Category | Jiali Yun Liping Jing Jian Yu Houkuan Huang Ying Zhang |
Topic Extraction Document Representation Wikipedia Category Semantic Relatedness |
CSO | English | 2011 | 0 | 0 | |
| Does collaboration occur when children are learning with the support of a wiki? | Turkish Online Journal of Educational Technology | English | 2011 | 0 | 0 | |||
| Don't bite the newbies: how reverts affect the quantity and quality of Wikipedia work | Aaron Halfaker Aniket Kittur John Riedl |
WikiWork Wikipedia Experience Motivation Productivity Quality Revert |
WikiSym | English | 2011 | Reverts are important to maintaining the quality of Wikipedia. They fix mistakes, repair vandalism, and help enforce policy. However, reverts can also be damaging, especially to the aspiring editor whose work they destroy. In this research we analyze 400,000 Wikipedia revisions to understand the effect that reverts had on editors. We seek to understand the extent to which they demotivate users, reducing the workforce of contributors, versus the extent to which they help users improve as encyclopedia editors. Overall we find that reverts are powerfully demotivating, but that their net influence is that more quality work is done in Wikipedia as a result of reverts than is lost by chasing editors away. However, we identify key conditions – most specifically new editors being reverted by much more experienced editors – under which reverts are particularly damaging. We propose that reducing the damage from reverts might be one effective path for Wikipedia to solve the newcomer retention problem. | 0 | 0 |
| Don't leave me alone: effectiveness of a framed wiki-based learning activity | Nikolaos Tselios Panagiota Altanopoulou Vassilis Komis |
Activity design Collaborative learning Learning outcome Project based learning Web 2.0 Wiki |
WikiSym | English | 2011 | 0 | 0 | |
| Edit wars in Wikipedia | Róbert Sumi Taha Yasseri András Rung András Kornai János Kertész |
IEEE Third International Conference on Social Computing | English | 2011 | We present a new, efficient method for automatically detecting severe conflicts `edit wars' in Wikipedia and evaluate this method on six different language WPs. We discuss how the number of edits, reverts, the length of discussions, the burstiness of edits and reverts deviate in such pages from those following the general workflow, and argue that earlier work has significantly over-estimated the contentiousness of the Wikipedia editing process. | 9 | 2 | |
| Editing knowledge resources: the wiki way | Francesco Ronzano Andrea Marchetti Maurizio Tesconi |
Collaborative editing web applications Knowledge resources Web and social knowledge management Wiki paradigm |
CIKM | English | 2011 | 0 | 0 | |
| Effectively mining wikipedia for clustering multilingual documents | N. Kiran Kumar G. S. K. Santosh Vasudeva Varma |
Document representation Multilingual document clustering Wikipedia |
NLDB | English | 2011 | 0 | 0 | |
| Effectiveness of a Framed Wiki-Based Learning Activity in the Context of HCI Education | Nikolaos Tselios Panagiota Altanopoulou Christos Katsanos |
Web 2.0 Wiki Activity design Hci education Project based learning Collaborative learning Learning outcome |
PCI | English | 2011 | 0 | 0 | |
| Embedding the HeaRT rule engine into a semantic wiki | Studies in Computational Intelligence | English | 2011 | 0 | 0 | |||
| Emphasising assessment 'as' learning by assessing wiki writing assignments collaboratively and publicly online | Australasian Journal of Educational Technology | English | 2011 | 0 | 0 | |||
| Empirical Study on Application of Wiki Based Collaborative Lesson-Preparing | Yingjie Ren Chaohua Gong |
Collaborative lesson-preparing Wiki Eduwiki Teachers Application Knowledge management Evaluation |
ICM | English | 2011 | 0 | 0 | |
| Enable Wikis for seamless hypervideo integration | Niels Seidel | CSCL Hypervideo Wiki |
ECCE | English | 2011 | 0 | 0 | |
| Enabling type/condition-specified entity/fact retrieval using semantic knowledge extracted from wikipedia | Sofia J. Athenikos Xia Lin |
Entity/fact/relation extraction Entity/fact/relation retrieval Faceted search Semantic search Semantic web Wikipedia |
SMER | English | 2011 | 0 | 0 | |
| Encyclopedic knowledge patterns from wikipedia links | Andrea Giovanni Nuzzolese Aldo Gangemi Valentina Presutti Paolo Ciancarini |
ISWC | English | 2011 | 0 | 0 | ||
| Erasing " Property Lines" A Collaborative Notion of Authorship and Textual Ownership on a Fan Wiki | Computers and Composition | English | 2011 | 0 | 0 | |||
| Evaluating significance of historical entities based on tempo-spatial impacts analysis using Wikipedia link structure | Yuku Takahashi Hiroaki Ohshima Mitsuo Yamamoto Hirotoshi Iwasaki Satoshi Oyama Katsumi Tanaka |
Historical entities Historical entity importance Pagerank Wikipedia structure analysis |
HT | English | 2011 | 0 | 0 | |
| Examining the "leftness" property of Wikipedia categories | Karl Gyllstrom Marie-Francine Moens |
Categories Wikipedia |
CIKM | English | 2011 | 0 | 0 | |
| Experience with software support for managing student-authored wiki textbooks | ASEE Annual Conference and Exposition, Conference Proceedings | English | 2011 | 0 | 0 | |||
| Experiences with Semantic Wikis for Architectural Knowledge Management | Remco C. de Boer Hans van Vliet |
Architectural knowledge management Semantic wiki Experience report |
WICSA | English | 2011 | In this paper, we reflect on our experiences with using semantic wikis for architectural knowledge management in two different contexts: e-government and distributed software development. Whereas our applications of semantic wikis in e-government focus on organizing and structuring architectural knowledge for reuse, the applications in distributed software development focus on searching and querying architectural knowledge. Yet, the emerging research challenges - alignment of knowledge models, knowledge versioning, change acknowledgements - are very similar. | 0 | 0 |
| Explaining the sustainability of digital ecosystems based on the wiki model through critical-mass theory | IEEE Transactions on Industrial Electronics | English | 2011 | 0 | 0 | |||
| Exploring Wikipedia with HMpara | David N. Milne Ian H. Witten |
Exploratory search Information visualization Wikipedia |
JCDL | English | 2011 | 0 | 0 | |
| Exploring linguistic points of view of Wikipedia | Paolo Massa Federico Scrinzi |
Wikipedia Linguistic point of view Neutral point of view Cross-cultural Language Comparison Open source Web |
WikiSym | English | 2011 | The 3 million articles of the English Wikipedia has been written since 2011 by more than 14 million volunteers. On each article, the community of editors strive to reach a neutral point of view, representing all significant views fairly, proportionately, and without bias. However, beside the English one, there are more than 270 Wikipedias in different languages and their relatively isolated communities of editors are not forced by the platform to discuss and negotiate their points of view. So the empirical question is: do communities on different languages editions of Wikipedia develop their own diverse Linguistic Points of View (LPOV)? To answer this question we created Manypedia, a web tool whose goal is to ease cross-cultural comparisons of Wikipedia language communities by analyzing their different representations of the same topic. | 0 | 1 |
| Exploring students' perceptions of integrating Wiki technology and peer feedback into English writing courses | English Teaching | English | 2011 | 0 | 0 | |||
| Exploring underproduction in Wikipedia | Andreea D. Gorbatai | Collective production Social goods Underproduction |
WikiSym | English | 2011 | 0 | 0 | |
| Exploring wiki: measuring the quality of social media using ant colony metaphor | Soumya Banerjee Nashwa El-Bendary Hameed Al-Qaheri |
Wikipedia Ant colony optimization Bio-inspired Content management Quality measurement Reputation paradigm Wiki |
MEDES | English | 2011 | 0 | 0 | |
| Exploring wikipedia's category graph for query classification | Milad Alemzadeh Richard Khoury Fakhri Karray |
Category labeling Natural Language Processing Query classification Wikipedia |
AIS | English | 2011 | 0 | 0 | |
| Extracción de Corpus Paralelos de la Wikipedia basada en la Obtención de Alineamientos Bilingües a Nivel de Frase | Joan Albert Silvestre-Cerdà Mercedes García-Martínez Alberto Barrón-Cedeño Jorge Civera Paolo Rosso |
Comparable corpora Parallel sentences extraction Statistical machine translation |
Proceedings of the Workshop on Iberian Cross-Language Natural Language Processing Tasks (ICL 2011) | Spanish | 2011 | This paper presents a proposal for extracting parallel corpora from Wikipedia on the basis of statistical machine translation techniques. We have used word-level alignment models from IBM in order to obtain phrase-level bilingual alignments between documents pairs. We have manually annotated a set of test English-Spanish comparable documents in order to evaluate the model. The obtained results are encouraging. | 4 | 0 |
| Extracting events from wikipedia as RDF triples linked to widespread semantic web datasets | Carlo Aliprandi Francesco Ronzano Andrea Marchetti Maurizio Tesconi Salvatore Minutoli |
Knowledge Extraction Knowledge representation Natural Language Processing Semantic web Semantics |
OCSC | English | 2011 | 0 | 0 | |
| Feedback mechanisms and their impact on motivation to contribute to wikis in higher education | Athanasios Mazarakis Clemens van Dinther |
Course wiki Experiment Feedback Motivation |
WikiSym | English | 2011 | 0 | 0 | |
| Focus and element length for book and wikipedia retrieval | Jaap Kamps Marijn Koolen |
INEX | English | 2011 | 0 | 0 | ||
| Gender differences in Wikipedia editing | Judd Antin Raymond Yee Coye Cheshire Oded Nov |
Wikipedia Gender Participation |
WikiSym | English | 2011 | As Wikipedia has become an indispensable source of online information, concerns about who writes, edits, and maintains it have come to the forefront. In particular, the 2010 UNU-MERIT survey found evidence of a significant gender skew: fewer than 13% of Wikipedia contributors are women. However, the number of contributors is just one way to examine gender differences in contribution. In this paper we take a more fine-grained perspective by examining how much and what types of Wiki-work men and women tend to do. First, we find that the so-called “Gender Gap” in number of editors may not be as wide as prior studies have suggested. Second, although more than 80% of editors in our sample were men, among the bottom 75% of editors by activity level, we find that men and women made similar numbers of revisions. However, among the most active Wikipedians men tended to make many more revisions than women. Finally, we find that the most active women in our sample tended to make larger revisions than the most active men. We conclude by discussing directions for future research. | 0 | 0 |
| Gist of a Thread in Social Network Services Based on Credibility of Wikipedia | Akiyo Nadamoto Yu Suzuki Takeshi Abekawa |
HICSS | English | 2011 | 0 | 0 | ||
| Graph-based named entity linking with wikipedia | Ben Hachey Will Radford James R. Curran |
Entity resolution Integration Text mining Web intelligence Wikipedia |
WISE | English | 2011 | 0 | 0 | |
| GreenWiki: a tool to support users' assessment of the quality of Wikipedia articles | Daniel Hasan Dalip Raquel Lara Santos Diogo Rennó Oliveira Valéria Freitas Amaral Marcos André Gonçalves Raquel Oliveira Prates Raquel C.M. Minardi Jussara Marques de Almeida |
Human factors Measurement Quality assessment Quality metrics User issues Wikipedia |
JCDL | English | 2011 | In this work, we present GreenWiki, which is a wiki with a panel of quality indicators to assist the reader of a Wikipedia article in assessing its quality. | 4 | 0 |
| Handling flammable materials: Wikipedia biographies of living persons as contentious objects | Elisabeth Joyce Brian Butler Jacqueline Pike |
Wikipedia Contentious objects Policy Socio-technical systems |
IConference | English | 2011 | 0 | 0 | |
| Harvesting Wikipedia Knowledge to Identify Topics in Ongoing Natural Language Dialogs | Alexa Breuing Ulli Waltinger Ipke Wachsmuth |
Wikipedia Information retrieval Human-Agent Interaction Topic Identification |
WI-IAT | English | 2011 | 0 | 0 | |
| Hot off the Wiki: Dynamics, Practices, and Structures in Wikipedia’s Coverage of the Tōhoku Catastrophes | Brian Keegan Darren Gergle Darren Contractor |
Wikipedia Breaking news Current events Network analysis Bipartite network Emergent group High tempo Collaboration |
WikiSym | English | 2011 | Wikipedia editors are uniquely motivated to collaborate around current and breaking news events. However, the speed, urgency, and intensity with which these collaborations unfold also impose a substantial burden on editors’ abilities to effectively coordinate tasks and process information. We analyze the patterns of activity on Wikipedia following the 2011 Tōhoku earthquake and tsunami to understand the dynamics of editor attention and participation, novel practices employed to collaborate on these articles, and the resulting coauthorship structures which emerge between editors and articles. Our findings have implications for supporting future coverage of breaking news articles, theorizing about motivations to participate in online community, and illuminating Wikipedia’s potential role in storing cultural memories of catastrophe. | 0 | 0 |
| Hot off the wiki: Dynamics, practices, and structures in Wikipedia's coverage of the Tōhoku catastrophes | WikiSym | English | 2011 | 0 | 0 | |||
| How to Reason by HeaRT in a Semantic Knowledge-Based Wiki | Weronika T. Adrian Szymon Bobek Grzegorz J. Nalepa Krzysztof Kaczor Krzysztof Kluza |
Knowledge-Based Systems Knowledge representation Reasoning Semantic Wikis Knowledge management |
ICTAI | English | 2011 | 0 | 0 | |
| ICKEwiki: Requirements and concepts for an enterprise wiki for SMEs | WikiSym | English | 2011 | 0 | 0 | |||
| Identifying parallel documents from a large bilingual collection of texts: application to parallel article extraction in Wikipedia | Alexandre Patry Philippe Langlais |
BUCC | English | 2011 | 0 | 0 | ||
| Identifying shared leadership in Wikipedia | Haiyi Zhu Robert E. Kraut Yi C. Wang Aniket Kittur |
Proceedings of the 2011 annual conference on Human factors in computing systems | English | 2011 | In this paper, we introduce a method to measure shared leadership in Wikipedia as a step in developing a new model of online leadership. We show that editors with varying degrees of engagement and from peripheral as well as central roles all act like leaders, but that core and peripheral editors show different profiles of leadership behavior. Specifically, we developed machine learning models to automatically identify four types of leadership behaviors from 4 million messages sent between Wikipedia editors. We found strong evidence of shared leadership in Wikipedia, with editors in peripheral roles producing a large proportion of leadership behaviors. | 4 | 1 | |
| Identifying verbal collocations in wikipedia articles | István Nagy T. Veronika Vincze |
Light verb constructions Multiword expressions Verb-particle constructions Verbal collocations Wikipedia |
TSD | English | 2011 | 0 | 0 | |
| Implementation of a wiki-based information and communication system for academia Europaea | Proceedings of the International Conference on Information Technology Interfaces, ITI | English | 2011 | 0 | 0 | |||
| Influence of several years use of wiki on academic motivation improvement | International Journal of Engineering Education | English | 2011 | 0 | 0 | |||
| Information literacy across the disciplines: Using the science information literacy wiki as a collaborative tool in agriculture, natural resources, and science librarianship | Journal of Agricultural and Food Information | English | 2011 | 0 | 0 | |||
| Integrated process and control design by the normal vector approach. Application to the Tennessee-Eastman process**http://wiki.stce.rwth-aachen.de/content/research/index.html | Computer Aided Chemical Engineering | English | 2011 | 0 | 0 | |||
| Integrating Twitter into Wiki to support informal awareness | Xuan Zhao Wenpeng Xiao Changyan Chi Min Yang |
Twitter Wiki Cross-channel workspace Informal awareness |
Computer-Supported Cooperative Work | English | 2011 | 0 | 0 | |
| Intelligent humanoid robot with japanese Wikipedia ontology and robot action ontology | Shotaro Kobayashi Susumu Tamagawa Takeshi Morita Takahira Yamaguchi |
International Conference on Human-robot Interaction | English | 2011 | 0 | 0 | ||
| Know we: A semantic wiki for knowledge engineering | Applied Intelligence | English | 2011 | 0 | 0 | |||
| Knowledge construction and knowledge sharing: A wiki-based approach | Montero-Fleta Begoña Pérez-Sabater Carmen |
Procedia - Social and Behavioral Sciences | English | 2011 | 0 | 0 | ||
| Knowledge sharing in wiki communities: An empirical study | Online Information Review | English | 2011 | 0 | 0 | |||
| Language independent identification of parallel sentences using Wikipedia | Rohit G. Bharadwaj Vasudeva Varma |
Wikipedia Language independent Parallel sentences |
World Wide Web | English | 2011 | 0 | 0 | |
| Language of vandalism: improving Wikipedia vandalism detection via stylometric analysis | Manoj Harpalani Michael Hart Sandesh Singh Rob Johnson Yejin Choi |
HLT | English | 2011 | 0 | 0 | ||
| Language-independent context aware query translation using Wikipedia | Rohit G. Bharadwaj Vasudeva Varma |
BUCC | English | 2011 | 0 | 0 | ||
| Large formal wikis: issues and solutions | Jesse Alama Kasper Brink Lionel Mamane Josef Urban |
MKM | English | 2011 | 0 | 0 | ||
| Large-scale question classification in cQA by leveraging Wikipedia semantic knowledge | Li Cai Guangyou Zhou Kang Liu Jun Zhao |
Large-scale classification Question retrieval Translation model Wikipedia |
CIKM | English | 2011 | 0 | 0 | |
| Learning to simplify sentences using Wikipedia | William Coster David Kauchak |
MTTG | English | 2011 | 0 | 0 | ||
| Learning-Oriented Assessment of Wiki Contributions: How to Assess Wiki Contributions in a Higher Education Learning Setting | Emilio J. Rodríguez-Posada Juan Manuel Dodero-Beardo Manuel Palomo-Duarte Inmaculada Medina-Bulo |
Computer-supported collaborative learning Wiki E-Learning assessment |
International Conference on Computer Supported Education | English | 2011 | Computer-Supported Collaborative Learning based on wikis offers new ways of collaboration and encourages participation. When the number of contributions from students increases, traditional assessment procedures of e-learning settings suffer from scalability problems. In a wiki-based learning experience, some automatic tools are required to support the assessment of such great amounts of data. We have studied readily available analysis tools for the MediaWiki platform, that have complementary input, work modes and output. We comment our experience in two Higher Education courses, one using HistoryFlow and another using StatMediaWiki, and discuss the advantages and drawbacks of each system. | 0 | 0 |
| Lessons from the classroom: successful techniques for teaching wikis using Wikipedia | Frank Schulenburg LiAnna Davis Max Klein |
Wikipedia Assignment Coursework Talk page Public policy initiative Student Teaching tool University |
WikiSym | English | 2011 | 0 | 0 | |
| Leveraging Wikipedia concept and category information to enhance contextual advertising | Zongda Wu Guandong Xu Rong Pan Yanchun Zhang Zhiwen Hu Jianfeng Lu |
Contextual advertising Similarity measure Wikipedia |
CIKM | English | 2011 | 0 | 0 | |
| Linguistically informed mining lexical semantic relations from wikipedia structure | Maciej Piasecki Agnieszka Indyka-Piasecka Roman Kurc |
ACIIDS | English | 2011 | 0 | 0 | ||
| Link spamming Wikipedia for profit | Andrew G. West Jian Chang Krishna Venkatasubramanian Oleg Sokolsky Insup Lee |
Wikipedia Attack model Collaborative security Spam Measurement study Spam economics Web 2.0 spam Wiki |
CEAS | English | 2011 | Collaborative functionality is an increasingly prevalent web technology. To encourage participation, these systems usually have low barriers-to-entry and permissive privileges. Unsurprisingly, ill-intentioned users try to leverage these characteristics for nefarious purposes. In this work, a particular abuse is examined -- link spamming -- the addition of promotional or otherwise inappropriate hyperlinks.
Our analysis focuses on the wiki model and the collaborative encyclopedia, Wikipedia, in particular. A principal goal of spammers is to maximize *exposure*, the quantity of people who view a link. Creating and analyzing the first Wikipedia link spam corpus, we find that existing spam strategies perform quite poorly in this regard. The status quo spamming model relies on link persistence to accumulate exposures, a strategy that fails given the diligence of the Wikipedia community. Instead, we propose a model that exploits the latency inherent in human anti-spam enforcement. Statistical estimation suggests our novel model would produce significantly more link exposures than status quo techniques. More critically, the strategy could prove economically viable for perpetrators, incentivizing its exploitation. To this end, we address mitigation strategies. |
0 | 0 |
| Local and global algorithms for disambiguation to Wikipedia | Lev Ratinov Dan Roth Doug Downey Mike Anderson |
HLT | English | 2011 | 0 | 0 | ||
| Loki-Semantic wiki with logical knowledge representation | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | English | 2011 | 0 | 0 | |||
| Mail2Wiki: low-cost sharing and early curation from email to wikis | Ben Hanrahan Guillaume Bouchard Gregorio Convertino Thiebaud Weksteen Nicholas Kong Cedric Archambeau Ed H. Chi |
Corporate wikis Design Organization |
C\&\#38;T | English | 2011 | 0 | 0 | |
| Mail2Wiki: posting and curating Wiki content from email | Benjamin V. Hanrahan Thiebaud Weksteen Nicholas Kong Gregorio Convertino Guillaume Bouchard Cedric Archambeau Ed H. Chi |
Email plugin Enterprise Wikis Organizing Sharing |
IUI | English | 2011 | 0 | 0 | |
| Matching the Affordances of Wikis to Collaborative Learning: A Case Study of IT Project Students | Andreas U. Kuswara Debbie Richards |
HICSS | English | 2011 | 0 | 0 | ||
| Measuring Hyperlink Distances: Wikipedia Case Study | Rodrigo Rodrigues Paim Daniel Ratton Figueiredo |
Hyperlink analysis Hyperlink distance Document similarity Wikipedia |
WebSci Conference | English | 2011 | Hyperlinks are a fundamental aspect of the Web, as they play a major role in accomplishing important functions such as document clustering and document ranking. Despite various facets of hyperlink analysis, in this work we consider a novel aspect of hyperlinks, namely their distance. How far in terms of contextual similarity will a hyperlink take you? We consider classical distance functions that capture the similarity between documents as well as propose a new distance function, an IDF-based generalization of Jaccard distance. We characterize the distance distribution of hyperlinks considering Wikipedia as a case study. Our results indicate that hyperlink distances are strongly skewed, with the majority of hyperlinks exhibiting very long distances. | 0 | 0 |
| Measuring Semantic Relatedness Using Wikipedia Revision Information in a Signed Network | Wen-Teng Yang Hung-Yu Kao |
Semantic relatedness HITS Wikipedia |
TAAI | English | 2011 | 0 | 0 | |
| Measuring Similarities between Technical Terms Based on Wikipedia | Myunggwon Hwang Do-Heon Jeong Seungwoo Lee Hanmin Jung |
Similarity Measure Technical Terms Wikipedia InterLink Wikipedia Category |
ITHINGSCPSCOM | English | 2011 | 0 | 0 | |
| Mentoring in Wikipedia: a clash of cultures | David R. Musicant Yuqing Ren James A. Johnson John Riedl |
Wikipedia Mentoring User retention |
WikiSym | English | 2011 | 0 | 0 | |
| MikiWiki: a meta wiki architecture and prototype based on the hive-mind space model | Li Zhu Ivan Vaghi Barbara Rita Barricelli |
HMS model MikiWiki Boundary objects Co-evolution End user development Habitable environment Meta-design Mikinugget Wiki |
IS-EUD | English | 2011 | 0 | 0 | |
| Mining Fuzzy Domain Ontology Based on Concept Vector from Wikipedia Category Network | Cheng-Yu Lu Shou-Wei Ho Jen-Ming Chung Fu-Yuan Hsu Hahn-Ming Lee Jan-Ming Ho |
Expert-finding Reviewer Classification Domain Ontology Concept Vector Data mining |
WI-IAT | English | 2011 | 0 | 0 | |
| Mobile wikipedia: a case study of information service design for chinese teenagers | Jia Zhou P. L. Patrick Rau Christoph Rohmer Jie Zhou Christophe Ghalayini Felix Roerig |
Chinese teenagers Information service Mobile phone User centered design Wikipedia |
UAHCI | English | 2011 | 0 | 0 | |
| Multilingual Ontology Matching based on Wiktionary Data Accessible via SPARQL Endpoint | Feiyu Lin Andrew Krizhanovsky |
Proceedings of the 13th Russian Conference on Digital Libraries RCDL’2011 | English | 2011 | Interoperability is a feature required by the Semantic Web. It is provided by the ontology matching methods and algorithms. But now ontologies are presented not only in English, but in other languages as well. It is important to use an automatic translation for obtaining correct matching pairs in multilingual ontology matching. The translation into many languages could be based on the Google Translate API, the Wiktionary database, etc. From the point of view of the balance of presence of many languages, of manually crafted translations, of a huge size of a dictionary, the most promising resource is the Wiktionary. It is a collaborative project working on the same principles as the Wikipedia. The parser of the Wiktionary was developed and the machine-readable dictionary was designed. The data of the machine-readable Wiktionary are stored in a relational database, but with the help of D2R server the database is presented as an RDF store. Thus, it is possible to get lexicographic information (definitions, translations, synonyms) from web service using SPARQL requests. In the case study, the problem entity is a task of multilingual ontology matching based on Wiktionary data accessible via SPARQL endpoint. Ontology matching results obtained using Wiktionary were compared with results based on Google Translate API. | 5 | 0 | |
| Multilingual document clustering using wikipedia as external knowledge | N. Kiran Kumar K. G. S. Santosh Vasudeva Varma |
Document representation Multilingual document clustering Wikipedia |
IRFC | English | 2011 | 0 | 0 | |
| Multipedia: enriching DBpedia with multimedia information | Andrés García-Silva Max Jakob Pablo N. Mendes Christian Bizer |
International Conference on Knowledge Capture | English | 2011 | 0 | 0 | ||
| My kind of people?: perceptions about Wikipedia contributors and their motivations | Judd Antin | Proceedings of the 2011 annual conference on Human factors in computing systems | English | 2011 | Perceptions of information products such as Wikipedia can depend on assumptions and stereotypes about the people who create them. As new Wikipedians consider contributing they are likely to apply such assumptions and ask themselves: "Are Wikipedia contributors my kind of people? Is this a group I'd like to belong to?" In this qualitative study I address the potential challenge of these questions by exploring readers and infrequent editors' perceptions of Wikipedia contributors and their motivations. Through analysis of twenty semi-structured interviews, I find evidence of strong negative perceptions as well as positive ones which nonetheless prevent users from identifying with active Wikipedia contributors. I argue that these perceptions present a barrier to the progression of participation over time. I conclude by discussing the practical challenges of my findings for Wikipedia and other online collaborative systems. | 14 | 0 | |
| New educational approach based on the use of wiki platforms in university environments | Proceedings of the 2011 7th International Conference on Next Generation Web Services Practices, NWeSP 2011 | English | 2011 | 0 | 0 | |||
| Overview of the INEX 2010 link the wiki track | Andrew Trotman David Alexander Shlomo Geva |
INEX | English | 2011 | 0 | 0 | ||
| Participation in Wikipedia's article deletion processes | R. Stuart Geiger Heather Ford |
Wikipedia Administration Bureaucracy Community Governance |
WikiSym | English | 2011 | 0 | 0 | |
| Peer assessment using wiki to enhance their mastery of the Chinese language | Communications in Computer and Information Science | English | 2011 | 0 | 0 | |||
| Places on the map and in the cloud: representations of locality and geography in Wikipedia | Randall M. Livingstone | Wikipedia Bias Collaboration Content Geography Users Wiki |
WikiSym | English | 2011 | 0 | 0 | |
| Planning for a successful corporate wiki | Communications in Computer and Information Science | English | 2011 | 0 | 0 | |||
| Privacy-respecting reputation for wiki users | IFIP Advances in Information and Communication Technology | English | 2011 | 0 | 0 | |||
| Protocol for Systematic Mapping of Wikipedia Studies | Mostafa Mesgari Arto Lanamäki Chitu Okoli Mohamad Mehdi |
Information Systems Research Seminar in Scandinavia | English | 2011 | 0 | 0 | ||
| Providing cross-lingual editing assistance to Wikipedia editors | Ching-Man Au Yeung Kevin Duh Masaaki Nagata |
CICLing | English | 2011 | 0 | 0 | ||
| PukiWiki-Java Connector, a simple API for saving data of Java programs on a wiki | Takashi Yamanoue Kentaro Oda Koichi Shimozono |
Java applets Collaboration Data store API Social coding |
WikiSym | English | 2011 | Experimental implementation of SDK for Java programs, PukiWiki-Java Connector, which makes an illusion that wiki pages as persistent data store, is shown. A Java program of them can be running on a wiki page and it can save its data on the page. The Java program consists of PukiWiki which is a popular wiki in Japan, the plug-in which starts up Java Applets. .A Java Applet with default access privilege cannot store its data at the local host. We have constructed the API for the applets to ease data persistent at a remote host. We also combined the API and the wiki system by introducing a wiki plugin and tags for starting up Java Applets. Applet generated persistent data resides in wiki texts side by side. We have successfully ported useful programs such as a simple text editor, a simple music editor, a simple draw program and programming environments in a PukiWiki system using this connector. | 2 | 3 |
| Quality evaluation of wikipedia articles through edit history and editor groups | Se Wang Mizuho Iwaihara |
Edit network Quality evaluation Web mining Web trust Wikipedia |
APWeb | English | 2011 | 0 | 0 | |
| Redundancy and collaboration in Wikibooks | Ilaria Liccardi Olivier Chapuis Ching-Man Au Yeung Wendy Mackay |
Collaborative writing Coordination mechanisms Text redundancy |
INTERACT | English | 2011 | This paper investigates how Wikibooks authors collaborate to create high-quality books. We combined Information Retrieval and statistical techniques to examine the complete multi-year lifecycle of over 50 high-quality Wikibooks. We found that: 1. The presence of redundant material is negatively correlated with collaboration mechanisms; 2. For most books, over 50\% of the content is written by a small core of authors; and 3. Use of collaborative tools (predicted pages and talk pages) is significantly correlated with patterns of redundancy. Non-redundant books are well-planned from the beginning and require fewer talk pages to reach high-quality status. Initially redundant books begin with high redundancy, which drops as soon as authors use coordination tools to restructure the content. Suddenly redundant books display sudden bursts of redundancy that must be resolved, requiring significantly more discussion to reach high-quality status. These findings suggest that providing core authors with effective tools for visualizing and removing redundant material may increase writing speed and improve the book's ultimate quality. | 12 | 0 |
| Reference Blindness: The Influence of References on Trust in Wikipedia | Teun Lucassen Matthijs L. Noordzij Jan Maarten Schraagen |
WebSci Conference | English | 2011 | In this study we show the influence of references on trust in information. We changed the contents of reference lists of Wikipedia articles in such a way that the new references were no longer in any sense related to the topic of the article. Furthermore, the length of the reference list was varied. College students were asked to evaluate the credibility of these articles. Only 6 out of 23 students noticed the manipulation of the references; 9 out of 23 students noticed the variations in length. These numbers are remarkably low, as 17 students indicated they considered references an important indicator of credibility. The findings suggest a highly heuristic manner of credibility evaluation. Systematic evaluation behavior was also observed in the experiment, but only of participants with low trust in Wikipedia in general. | 7 | 0 | |
| Repurposing a wiki for collaborative learning - Pedagogical and technical view | 15th International Conference on System Theory, Control and Computing, ICSTCC 2011 | English | 2011 | 0 | 0 | |||
| Research on design of a wiki & blog-based knowledge-sharing mechanism for virtual enterprise | Proceedings - 3rd International Conference on Measuring Technology and Mechatronics Automation, ICMTMA 2011 | English | 2011 | 0 | 0 | |||
| Searching the wikipedia with public online search engines | Miro Lehtonen | INEX | English | 2011 | 0 | 0 | ||
| Semantic relatedness for named entity disambiguation using a small wikipedia | Izaskun Fernandez Iñaki Alegria Nerea Ezeiza |
Named Entity Disambiguation Semantic relatedness Wikipedia |
TSD | English | 2011 | 0 | 0 | |
| Semantic retrieval of images by learning from wikipedia | Martin Klinkigt Koichi Kise Heiko Maus Andreas Dengel |
SIFT SVM Generic object recognition Image management Shape model Specific object recognition |
KES | English | 2011 | 0 | 0 | |
| Semantic wiki in environmental project management | IFIP Advances in Information and Communication Technology | English | 2011 | 0 | 0 | |||
| Simple English Wikipedia: a new text simplification task | William Coster David Kauchak |
HLT | English | 2011 | 0 | 0 | ||
| Social capital increases efficiency of collaboration among Wikipedia editors | Keiichi Nemoto Peter Gloor Robert Laubacher |
Wikipedia Collaboration Community governance Open source projects Social capital Social media Social network analysis Social network Time-to-market |
HT | English | 2011 | 0 | 0 | |
| Social mechanism of granting trust basing on polish wikipedia requests for adminship | Piotr Turek Justyna Spychala Adam Wierzbicki Piotr Gackowski |
Collaboration Trust Wikipedia |
SocInfo | English | 2011 | 0 | 0 | |
| Social networks of Wikipedia | Paolo Massa | Wikipedia Wiki Social network Empirical analysis Open source |
Hypertext | English | 2011 | Wikipedia, the free online encyclopedia anyone can edit, is a live social experiment: millions of individuals volunteer their knowledge and time to collective create it. It is hence interesting trying to understand how they do it. While most of the attention concentrated on article pages, a less known share of activities happen on user talk pages, Wikipedia pages where a message can be left for the specific user. This public conversations can be studied from a Social Network Analysis perspective in order to highlight the structure of the “talk” network. In this paper we focus on this preliminary extraction step by proposing different algorithms. We then empirically validate the differences in the networks they generate on the Venetian Wikipedia with the real network of conversations extracted manually by coding every message left on all user talk pages. The comparisons show that both the algorithms and the manual process contain inaccuracies that are intrinsic in the freedom and unpredictability of Wikipedia growth. Nevertheless, a precise description of the involved issues allows to make informed decisions and to base empirical findings on reproducible evidence. Our goal is to lay the foundation for a solid computational sociology of wikis. For this reason we release the scripts encoding our algorithms as open source and also some datasets extracted out of Wikipedia conversations, in order to let other researchers replicate and improve our initial effort. | 14 | 2 |
| Students' confidence and perceived value for participating in cross-cultural wiki-based collaborations | Educational Technology Research and Development | English | 2011 | 0 | 0 | |||
| Students' perceptions of Wiki-based collaborative writing for learners of English as a foreign language | Interactive Learning Environments | English | 2011 | 0 | 0 | |||
| Supporting Multilingual Discussion for Wikipedia Translation | Noriyuki Ishida Toshiyuki Takasaki Masanobu Ishimatsu Toru Ishida |
Wikipedia Machine translation Talk page |
CULTURE-COMPUTING | English | 2011 | 0 | 0 | |
| Text clustering based on granular computing and wikipedia | Liping Jing Jian Yu |
Granular computing Text clustering Wikipedia |
RSKT | English | 2011 | 0 | 0 | |
| The PlanetMath Encyclopedia | Joseph Corneli | Online community Mathematics Collaboration Encyclopedia Commons-Based Peer Production PlanetMath |
MathWikis | English | 2011 | The history of PlanetMath.org is discussed, tracing its inception, stabilization, and some defining challenges. Research and outreach efforts that have been conducted in the course of work on the PlanetMath project are reviewed, and the scope and reach of the resource are discussed. Recent developments are indicated briefly. Some remarks evaluating PlanetMath’s trajectory and content conclude the paper. | 0 | 0 |
| The Web of Mathematical Models: A Schema-based, Wiki-like, Interactive Platform | Thomas Grundmann Jean-Marie Gaillourdet Karsten Schmidt Arnd Poetzsch-Heffter Stefan Deßloch Martin Memmel |
MathWikis | 2011 | In science and engineering mathematical models are increasingly important to describe natural phenomena and design artifacts. Our goals is to make the notion of “mathematical models” more explicit and precise as well as to build up knowledge repositories for searching, exploring, combining, and sharing models. With the Web of Mathematical Models, WoM, we provide a platform to host such models on the Web. Models follow an explicit, content-related schema. | 0 | 0 | ||
| The analysis of wiki application in virtual resources construction in network | 2011 International Conference on E-Business and E-Government, ICEE2011 - Proceedings | English | 2011 | 0 | 0 | |||
| The behavior of Wiki users | Social Behavior and Personality | English | 2011 | 0 | 0 | |||
| The integration of students' artifacts created with Web 2.0 tools into Moodle, blog, wiki, e-portfolio and Ning | MIPRO 2011 - 34th International Convention on Information and Communication Technology, Electronics and Microelectronics - Proceedings | English | 2011 | 0 | 0 | |||
| The people's web meets linguistic knowledge: automatic sense alignment of Wikipedia and Wordnet | Elisabeth Niemann Iryna Gurevych |
IWCS | English | 2011 | 0 | 0 | ||
| The success of corporate wiki systems: an end user perspective | Zeeshan A. Bhatti Serge Baile Hina M. Yasin |
WikiSym | English | 2011 | With the ever increasing use of Web 2.0 sites on the internet, the use of Web 2.0 based tools is now employed by organizations across the globe. One of the most widely used Web 2.0 tools in organizations is wiki technology, particularly in project management. It is important for organizations to measure the success of their wiki system implementation. With the advent of new technologies in the market and their deployment by the firms, it is necessary to investigate how they can help organizations execute processes in a better way. In this paper we present a theoretical model for the measurement of corporate wikis' success from the end-user's perspective based on the theoretical foundation of DeLone & McLean's IS success model [17]. We extend the model by incorporating contextual factors with respect to wiki technology in a project management task. This study intends to help firms to understand in a better way, how they can use wikis to achieve an efficient, effective and improved end-user performance. This would also be helpful for companies engaged in wiki development business to improve their products keeping in view the perceptions of wiki end-users. | 0 | 0 | |
| The victorian web and the victorian course wiki: comparing the educational effectiveness of identical assignments in web 1.0 and web 2.0 | George P. Landow | Assignments Connectivity Education Evaluation Expository writing Hypermedia Hypertext Student-centered discussion Student-led discussion The victorian web Web 2.0 Wiki |
HT | English | 2011 | 0 | 0 | |
| Topical and structural linkage in wikipedia | Kelly Y. Itakura Charles L. A. Clarke Shlomo Geva Andrew Trotman Wei Chi Huang |
ECIR | English | 2011 | 0 | 0 | ||
| Towards Tailored Semantic Annotation Systems from Wikipedia | Shahad Kudama Rafael Berlanga Llavori Lisette Garcia-Moya Victoria Nebot Maria Jose Aramburu Cabo |
Annotation External source Wikipedia Context Tailored Fragment |
DEXA | English | 2011 | 0 | 0 | |
| Towards identifying arguments in Wikipedia pages | Hoda Sepehri Rad Denilson Barbosa |
Wikipedia Argument Controversy Evaluation |
World Wide Web | English | 2011 | 0 | 0 | |
| Towards improving wikipedia as an image-rich encyclopaedia through analyzing appropriateness of images for an article | Xinpeng Zhang Yasuhito Asano Masatoshi Yoshikawa |
Multimedia information Relation Wikipedia |
APWeb | English | 2011 | 0 | 0 | |
| Understanding and improving Wikipedia article discussion spaces | Jodi Schneider Alexandre Passant John G. Breslin |
Wikipedia Collaboration Online discussions Semantic web Talk page |
SAC | English | 2011 | 0 | 0 | |
| Understanding collaboration in Wikipedia | Royce Kimmons | First Monday | English | 2011 | Wikipedia stands as an undeniable success in online participation and collaboration. However, previous attempts at studying collaboration within Wikipedia have focused on simple metrics like rigor (i.e., the number of revisions in an article’s revision history) and diversity (i.e., the number of authors that have contributed to a given article) or have made generalizations about collaboration within Wikipedia based upon the content validity of a few select articles. By looking more closely at metrics associated with each extant Wikipedia article (N=3,427,236) along with all revisions (N=225,226,370), this study attempts to understand what collaboration within Wikipedia actually looks like under the surface. Findings suggest that typical Wikipedia articles are not rigorous, in a collaborative sense, and do not reflect much diversity in the construction of content and macro–structural writing, leading to the conclusion that most articles in Wikipedia are not reflective of the collaborative efforts of the community but, rather, represent the work of relatively few contributors. | 8 | 1 | |
| Unsupervised language-independent name translation mining from Wikipedia infoboxes | Wen-Pin Lin Matthew Snover Heng Ji |
EMNLP | English | 2011 | 0 | 0 | ||
| Using Mahout for Clustering Wikipedia's Latest Articles: A Comparison between K-means and Fuzzy C-means in the Cloud | Mahout Document clustering K-means Fuzzy c-means |
CLOUDCOM | English | 2011 | 0 | 0 | ||
| Using Wikipedia to boost collaborative filtering techniques | Gilad Katz Nir Ofek Bracha Shapira Lior Rokach Guy Shani |
Wikipedia Cold start problem Collaborative filtering Recommender system |
RecSys | English | 2011 | 0 | 0 | |
| Using a Wikipedia-based semantic relatedness measure for document clustering | Majid Yazdani Andrei Popescu-Belis |
English | 2011 | 0 | 0 | |||
| Using a wiki to scaffold primary-school students' collaborative writing | Educational Technology and Society | English | 2011 | 0 | 0 | |||
| … further results | ||||||||
