List of tools

From WikiPapers
Revision as of 15:44, December 22, 2014 by Nemo bis (Talk | contribs) (Reverted edits by (talk) to last revision by

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search
See also: List of datasets.

This is a list of tools available in WikiPapers. Currently, there are 103 tools.

Filter by type:

To create a new "tool" go to Form:Tool.


Tool Operating System(s) Language(s) Programming language(s) License Description Image
AVBOT Cross-platform English
Python GPL AVBOT is an anti-vandalism bot in Spanish Wikipedia. It uses regular expressions and scores to detect vandalism. Avbot logo.png
Alternative MediaWiki parsers Cross-platform English PHP
Alternative parsers is a compilation of various alternative MediaWiki parsers which are able or intended to translate MediaWiki's text markup syntax into something else.
Article Feedback data dashboard
AssessMediaWiki Cross-platform Spanish PHP AssessMediaWiki is an open-source web application that, connected to a MediaWiki installation, supports for hetero, self and peer to peer assessment procedures, whilst keeps track of compiled assessment data. Thus supervisors can obtain reports to help assessing students.
Authorship Tracking Cross-platform None Python BSD License Authorship Tracking This code implements the algorithms for tracking the authorship of text in revisioned content that have been published in WWW 2013:

The idea consists in attributing each portion of text to the earliest revision where it appeared. For instance, if a revision contains the sentence "the cat ate the mouse", and the sentence is deleted, and reintroduced in a later revision (not necessarily as part of a revert), once re-introduced it is still attributed to its earliest author.

Precisely, the algorithm takes a parameter N. If a sequence of tokens of length equal or greater than N has appeared before, it is attributed to its earliest occurrence. See the paper for details.

The code works by building a trie-based representation of the whole history of the revisions, in an object of the class AuthorshipAttribution. Each time a new revision is passed to the object, the object updates its internal state and it computes the earliest attribution of the new revision, which can be then easily obtained. The object itself can be serialized (and de-serialized) using json-based methods.

To avoid the representation of the whole past history from growing too much, we remove from the object the information about content that has been absent from revisions (a) for at least 90 days, and (b) for at least 100 revisions. These are configurable parameters. With these choices, for the Wikipedia, the serialization of the object has size typically between 10 and 20 times the size of a typical revision, even for pages with very long revision lists. See paper for detailed experimental results.
Catdown GNU/Linux English PHP Catdown is a tool to download images in Wikimedia Commons categories.
ClueBot GNU/Linux C
ClueBot is an anti-vandalism bot in English Wikipedia.
Commons explorer Cross-platform English Python
GPL Commons explorer is a tool map for exploring Wikimedia Commons multimedia files by location and year.
CryptoDerk's Vandal Fighter Cross-platform English Java Open source
DiffDB Java DiffDB are made of DiffIndexer and DiffSearcher.
Dump-downloader Cross-platform Perl Apache License 2.0 dump-downloader Script to request and download the full history dump of all the pages in a MediaWiki. Meant to work for Wikia's wikis but I could work with other wikis. Source code here:
HistoryFlow Windows English HistoryFlow is a tool for visualizing dynamic, evolving documents and the interactions of multiple collaborating authors. In its current implementation, history flow is being used to visualize the evolutionary history of wiki pages on Wikipedia. English Wikipedia Treaty of Trianon History Flow.png
Contropedia GPL v3 Analysis and visualization of controversies within Wikipedia articles.

More info, publications, and a demo are available at
Huggle Windows Visual Basic .NET GPL v3
Igloo Cross-platform JavaScript Open source
Ikiwiki Cross-platform English Ikiwiki supports to store a wiki as a git repository.
Images for biographies Cross-platform English Python
GPL Images for biographies is a tool that suggests images for biographies in several Wikipedias.
Infobox2rdf Cross-platform English Perl GPL v3 infobox2rdf generates huge RDF datasets from the infobox data in Wikipedia dump files.
JWordNet-Similarity Cross-platform English Java
Java Wikipedia Library Cross-platform English Java LGPL Java Wikipedia Library is an application programming interface that allows to access all information in Wikipedia.
Listen to Wikipedia Web English JavaScript BSD License Listen to Wikipedia a visual and audio illustration of live editing activity on Wikipedia. Screen Shot Listen to Wikipedia.png
Lupin's Anti-Vandal Tool JavaScript English Python
Affero GPL (code)
Creative Commons (content) is a web tool in which you can compare Linguistic Points Of View (LPOV) of different language Wikipedias. For example (but this is just one of the many possible comparisons), are you wondering if the community of editors in the English, Arabic and Hebrew Wikipedias are crystallizing different histories of the Gaza War? Manypedia palestine en ar.png
MediaWiki PHP MediaWiki is famous a wiki engine. It is used by Wikipedia. MediaWiki-smaller-logo.png
MediaWiki API MediaWiki API provides direct, high-level access to the data contained in MediaWiki databases.
MediaWiki Utilities Cross-platform English Python MIT license MediaWiki Utilities is a collection of utilities for working with XML data dumps generated for Wikimedia projects and other MediaWiki wikis.
MediaWiki extensions MediaWiki extensions are extended features for your MediaWiki wiki.
Natural Language Toolkit Cross-platform English Python
Perlwikipedia Cross-platform English Perl GPL v3 perlwikipedia is a high-level bot framework for interacting with MediaWiki wikis.
Python-wikitools Cross-platform English Python GPL v3
Pywikipediabot Cross-platform English Python MIT license pywikipediabot is a wiki robot framework. It includes a lot of functions to interact with a MediaWiki wiki. It uses MediaWiki API when available.
STiki Cross-platform English Java GPL STiki is an anti-vandalism tool that consists of server-side detection algorithms and a client-facing GUI. STiki logo.png
Salebot Salebot is an anti-vandalism bot in French Wikipedia.
Semantic MediaWiki
Semapedia Semapedia was a project to connect places with Wikipedia articles. The project is defunct.
Sioc MediaWiki Cross-platform English Sioc MediaWiki is a RDF exporter for MediaWiki's wikis.
StatMediaWiki GNU/Linux English Python GPLv3 StatMediaWiki is a project that aims to create a tool to collect and aggregate information available in a MediaWiki installation. Results are static HTML pages including tables and graphics that can help to analyze the wiki status and development, or a CSV file for custom processing. General hour activity-wikihaskell.png
Toolserver Toolserver is a service by WM-DE and the Wikimedia Foundation.
Twinkle Cross-platform English JavaScript
Vandal Fighter Cross-platform English Java Vandal Fighter - Live RC.png
VandalProof Windows English Visual Basic
VandalSniper Cross-platform English Mono
Weka Cross-platform Java GPL
Wikeval PHP
Wikeval is a MediaWiki extension to allow semantic tagging based on an ontology.
Wiki Category Matrix Visualization Cross-platform English Java Educational Community License Wiki Category Matrix Visualization is a tool that generates a visual representation of data sizes across topics of a multi-level category hierarchy in matrix form. It provides a "big picture" overview of topics in terms of categorization. Matrix-visualization-simplewiki.png
Wiki Edit History Analyzer Cross-platform English Java Wiki Edit History Analyzer processes the MediaWiki revision history and produces summaries of edit actions performed. Basic edit actions include insert, delete, replace, and move; high-level edit actions include spelling correction, wikify, etc.
Wiki Explorator Ruby Wiki Explorator is a ruby library for scientific research on wikis (and other CMS, focus: MediaWiki) for interactive exploration, statistics and visualization of (network) data.
Wiki Loves Monuments map Cross-platform English Python
GPLv3 Wiki Loves Monuments map is a map with geolocated monuments that require images. These map were used in Wiki Loves Monuments contest.
Wiki Scaffolding Language
Wiki Trip Python
Wiki Trip allows to have a trip in the process of creation of any Wikipedia page from any language edition of Wikipedia. WikiTrip is an interactive web tool empowering its users by providing an insightful visualization of two kinds of information about the Wikipedians who edited the selected page: their location in the world and their gender. Ok wikitrip screen 001 english cumulative.png
Wiki-network Python Open source Wiki-network is a set of ad hoc Python scripts.
Wiki2XML parser Cross-platform English Python Wiki2XML parser parsers Wikipedia dump file into well-structured XML.
WikiAudit Cross-platform English Java GPL WikiAudit is a tool that given a Mediawiki wiki location, and set/range of IP addresses, produces a report of the edit history from those IPs. Cheap heuristics try to identify malicious behavior. Useful for network admins and conducting security investigations.
WikiChanges WikiChanges is a web-based tool that exposes the revision history of Wikipedia articles using an interactive graphical timeline.
WikiChecker English WikiChecker analyses the status of English Wikipedia in the last minutes and hours.
Wikichron Cross-platform English Python Affero GPL (code) WikiChron is a web tool for the analysis and visualization of the evolution of wiki online communities. It uses processed data of the history dumps of mediawiki wikis, computes different metrics on this data and plot it in interactive graphs. It allows to compare different wikis in the same graphs.

This tool will serve investigators in the task of inspecting the behavior of collaborative online communities, in particular wikis, and generate research hypotheses for further and deeper studies. WikiChron has been thought to be very easy to use and highly interactive from the very first beginning. It comes with a bunch of already downloaded and processed wikis from Wikia (but any MediaWiki wiki is supported), and with more than thirty metrics to visualize and compare between wikis.

Moreover, it can be useful in the case of wiki administrators who want to see, analyze and compare how the activity on their wikis is going.

WikiChron is available online here:
WikiCreole WikiCreole is a common wiki markup language to be used across different wikis.
WikiEvidens Cross-platform English Python GPLv3 WikiEvidens is a visualization and statistical tool for wikis. Wikievidens0.0.6.png
WikiNavMap WikiNavMap visualises the tickets, wiki pages and milestones in the Trac environment.
WikiNet TK WikiNet TK is a fast and easy-to-use toolkit for WikiNet.
WikiPop English WikiPop is a system designed to detect significant increase of popularity of topics related to users' interests.
WikiPrep Cross-platform English Perl GPL v2 WikiPrep is a Perl script for preprocessing Wikipedia XML dumps.
WikiPride WikiPride allows you to visualize the breakdown of a Wikipedia community by age of account and by the volume of contributed content. You need a Toolserver account to run this.
WikiScanner English
WikiSim Cross-platform English Java University of Edinburgh GNU license WikiSim is a knowledge collection and curation simulator.
WikiSlurp WikiSlurp queries Wikipedia to return, in HTML format, portions of articles about a given subject.
WikiTeam tools Cross-platform English Python WikiTeam tools is a set of tools focused in wiki preservation and backups.
WikiTracer English WikiTracer is a web service providing platform-independent analytics and comparative growth statistics for wikis.
WikiTrust English New BSD License
WikiTrust is an open-source, on-line reputation system for Wikipedia authors and content.
WikiVis (FH-KL) Java GPL WikiVis (FH-KL) WikiVis is a tool to analyze Wikipedia based on several aspects. The main objective is to visualize the conclusions of this examination, which focuses on the editing frequency and relevance of articles and categories as well as the activity of users. Wikivis 0.5.jpg
WikiVis (UM) Cross-platform English Java Educational Community License
Version 2.0
WikiVis (UM) provides an interactive visualization of the Wikipedia information space, primarily as a means of navigating the category hierarchy as well as the article network. The project is implemented in Java, utilizing the Java 3D package. WikiVis-UM-Logo.jpg
WikiWarMonitor English WikiWarMonitor is a tool to measure edit warring.
WikiXMLDB WikiXMLDB provides a way of querying Wikipedia with XQuery.
WikiXRay Python WikiXRay is a robust and extensible software tool for an in-depth quantitative analysis of the whole Wikipedia project.
Wikia-census Cross-platform Python
Jupyter Notebooks
wikia-census A script to generate a census of all the Wikia's wikis.

Census collected and analysis here:

Source code here:
Wikidata Wikidata aims to create a free knowledge base about the world that can be read and edited by humans and machines alike. It will provide data in all the languages of the Wikimedia projects, and allow for the central access to data in a similar vein as Wikimedia Commons does for multimedia files. Wikidata is proposed as a new Wikimedia hosted and maintained project. English
PHP For two subjects (wikidata items), compares the pageviews of the articles for them in every linguistic versions of wikipedia existing for the article. Upcoming development to allow comparison of more than two subjects.
Wikihadoop Wikihadoop makes it possible to use MapReduce jobs using Hadoop on the compressed XML dump files.
Wikimedia Labs
Wikimedia Statistics
Wikimedia counter Cross-platform Python
GPL Wikimedia counter is a near real-time edit counter for all Wikimedia projects. Wikimedia projects edits counter 2010-04-16.png
Wikipedia Data Analysis Toolkit GPLv3
Wikipedia Extractor Cross-platform English Python GPL v3
Wikipedia Miner
Wikipedia Recent Changes Map Web English JavaScript Wikipedia Recent Changes Map is a web tool that displays a world map showing anonymous edits to Wikipedia, geolocated by IP.
Wikipedia-map-reduce Cross-platform English Java Apache License 2.0 Wikipedia-map-reduce is a java software library that allows analysis of Wikipedia at the revision-text level.
WikipediaVision Web English WikipediaVision is a web-based tool that shows anonymous edits to Wikipedia (almost) in real-time.
Wikiq C++ wikiq is a simple and fast stream-based MediaWiki XML dump parser.
Wikirage English Wikirage tracks the pages in Wikipedia which are receiving the most edits over various periods of time. Popular people in the news, the latest fads, and the hottest video games, Internet memes, zietgeist, and trends bubble to the surface.
Wikistream English Wikistream is a web that shows the stream of changes in Wikimedia projects, in real-time.
Wikiswarm Cross-platform English Java Wikiswarm generates code_swarm event logs from the Wikipedia API.
Wikitweets Wikitweets is a visualization of how Wikipedia is cited on Twitter.
Wikiwho English Python MIT license Wikiwho Fast and accurate processing of revision differences for authorship detection. More information: Logo wikiwho transbg.png
Wikokit Cross-platform Java EPLv1.0
New BSD License
wikokit (wiki tool kit) - several projects related to wiki.

wiwordik - machine-readable Wiktionary. A visual interface to the parsed English Wiktionary and Russian Wiktionary databases.
Java WebStart application + JavaFX, English interface.
742 languages extracted from the English Wiktionary.

423 languages extracted from the Russian Wiktionary.
Wiwordik-en.0.09.1094 scrollbox.jpg
Wmcharts wmcharts are a set of graphs about Wikimedia projects.
Zawilinski Cross-platform Java Zawilinski a Java library that supports the extraction and analysis of grammatical data in Wiktionary.

See also[edit]

External links[edit]