DiffDB
From WikiPapers
DiffDB (Alternative names for this tool) | |
Keyword(s) | data processing |
Operating system(s) | Unknown [+] |
Language(s) | Unknown [+] |
Programming language(s) | Java |
Author(s) | Yusuke Matsubara |
License(s) | Unknown [+] |
Website | https://github.com/whym/diffindexer |
Related material | |
Related tool(s) | Unknown [+] |
Related dataset(s) | Unknown [+] |
Search | |
Google Scholar | |
Export and share | |
BibTeX, CSV, RDF, JSON | |
![]() ![]() ![]() ![]() ![]() ![]() ![]() | |
Browse properties · List of tools |
DiffDB are made of DiffIndexer and DiffSearcher.
The DiffIndexer takes as raw input the diffs generated by Wikihadoop and creates a Lucene-based index. The DiffSearcher allows you to query the index so you can answer questions such as:
- Who has added template X in the last month?
- Who added more than 2000 characters to user talk pages in 2008?
- Do It Yourself Analytics with Wikipedia (Archived at WebCitation)
Publications
There is no publication about this tool yet.