Schema evolution in wikipedia - Toward a web Information system benchmark
|Schema evolution in wikipedia - Toward a web Information system benchmark|
|Author(s)||Curino C.A., Moon H.J., Tanca L., Zaniolo C.|
|Published in||ICEIS 2008 - Proceedings of the 10th International Conference on Enterprise Information Systems|
|Keyword(s)||Benchmark, Case study, Schema evolution, Wikipedia (Extra: Benchmark case studies, Evolution history, In-depth analysis, Maintenance Problem, MediaWiki, Open-source softwares, Schema changes, Schema evolution, Software tool, Support systems, Web information systems, Wikipedia, Computer software, Computer software maintenance, Database systems, Research, World Wide Web, Information systems)|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Remote mirror(s)||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of conference papers|
Schema evolution in wikipedia - Toward a web Information system benchmark is a 2008 conference paper written in English by Curino C.A., Moon H.J., Tanca L., Zaniolo C. and published in ICEIS 2008 - Proceedings of the 10th International Conference on Enterprise Information Systems.
Evolving the database that is at the core of an Information System represents a difficult maintenance problem that has only been studied in the framework of traditional information systems. However, the problem is likely to be even more severe in web information systems, where open-source software is often developed through the contributions and collaboration of many groups and individuals. Therefore, in this paper, we present an indepth analysis of the evolution history of the Wikipedia database and its schema; Wikipedia is the best-known example of a large family of web information systems built using the open-source software MediaWiki. Our study is based on: (i) a set of Schema Modification Operators that provide a simple conceptual representation for complex schema changes, and (ii) simple software tools to automate the analysis. This framework allowed us to dissect and analyze the 4.5 years of Wikipedia history, which was short in time, but intense in terms of growth and evolution. Beyond confirming the initial hunch about the severity of the problem, our analysis suggests the need for developing better methods and tools to support graceful schema evolution. Therefore, we briefly discuss documentation and automation support systems for database evolution, and suggest that the Wikipedia case study can provide the kernel of a benchmark for testing and improving such systems.
- This section requires expansion. Please, help!
Probably, this publication is cited by others, but there are no articles available for them in WikiPapers. Cited 8 time(s)