Browse wiki

Jump to: navigation, search
Efficient and scalable data evolution with column oriented databases
Abstract Database evolution is the process of updatDatabase evolution is the process of updating the schema of a database or data warehouse (schema evolution) and evolving the data to the updated schema (data evolution). It is often desired or necessitated when changes occur to the data or the query workload, the initial schema was not carefully designed, or more knowledge of the database is known and a better schema is concluded. The Wikipedia database, for example, has had more than 170 versions in the past 5 years [8]. Unfortunately, although much research has been done on the schema evolution part, data evolution has long been a prohibitively expensive process, which essentially evolves the data by executing SQL queries and re-constructing indexes. This prevents databases from being flexibly and frequently changed based on the need and forces schema designers, who cannot afford mistakes, to be highly cautious. Techniques that enable efficient data evolution will undoubtedly make life much easier. In this paper, we study the efficiency of data evolution, and discuss the techniques for data evolution on column oriented databases, which store each attribute, rather than each tuple, contiguously. We show that column oriented databases have a better potential than traditional row oriented databases for supporting data evolution, and propose a novel data-level data evolution framework on column oriented databases. Our approach, as suggested by experimental evaluations on real and synthetic data, is much more efficient than the query-level data evolution on both row and column oriented databases, which involves unnecessary access of irrelevant data, materializing intermediate results and re-constructing indexes.diate results and re-constructing indexes.
Abstractsub Database evolution is the process of updatDatabase evolution is the process of updating the schema of a database or data warehouse (schema evolution) and evolving the data to the updated schema (data evolution). It is often desired or necessitated when changes occur to the data or the query workload, the initial schema was not carefully designed, or more knowledge of the database is known and a better schema is concluded. The Wikipedia database, for example, has had more than 170 versions in the past 5 years [8]. Unfortunately, although much research has been done on the schema evolution part, data evolution has long been a prohibitively expensive process, which essentially evolves the data by executing SQL queries and re-constructing indexes. This prevents databases from being flexibly and frequently changed based on the need and forces schema designers, who cannot afford mistakes, to be highly cautious. Techniques that enable efficient data evolution will undoubtedly make life much easier. In this paper, we study the efficiency of data evolution, and discuss the techniques for data evolution on column oriented databases, which store each attribute, rather than each tuple, contiguously. We show that column oriented databases have a better potential than traditional row oriented databases for supporting data evolution, and propose a novel data-level data evolution framework on column oriented databases. Our approach, as suggested by experimental evaluations on real and synthetic data, is much more efficient than the query-level data evolution on both row and column oriented databases, which involves unnecessary access of irrelevant data, materializing intermediate results and re-constructing indexes.diate results and re-constructing indexes.
Bibtextype inproceedings  +
Doi 10.1145/1951365.1951380  +
Has author Liu Z. + , He B. + , Hsiao H.-I. + , Yirong Chen +
Has extra keyword Bitmap indexes + , Column oriented database + , Data evolution + , Experimental evaluation + , Intermediate results + , Schema + , Schema evolution + , SQL query + , Synthetic data + , Wikipedia + , Data warehouses + , Technology + , Query processing +
Has keyword Bitmap index + , Column oriented database + , Data evolution + , Schema +
Isbn 9781450305280  +
Language English +
Number of citations by publication 0  +
Number of references by publication 0  +
Pages 105–116  +
Published in ACM International Conference Proceeding Series +
Title Efficient and scalable data evolution with column oriented databases +
Type conference paper  +
Year 2011 +
Creation dateThis property is a special property in this wiki. 7 November 2014 13:02:16  +
Categories Publications without license parameter  + , Publications without remote mirror parameter  + , Publications without archive mirror parameter  + , Publications without paywall mirror parameter  + , Conference papers  + , Publications without references parameter  + , Publications  +
Modification dateThis property is a special property in this wiki. 7 November 2014 13:02:16  +
DateThis property is a special property in this wiki. 2011  +
hide properties that link here 
Efficient and scalable data evolution with column oriented databases + Title
 

 

Enter the name of the page to start browsing from.