Abstract:A Data warehouse (DW) is characterized by a complex architecture, designed in order to integrate data derived from operational data sources (DS), hence providing advanced analytical tools of these data. The DW is highly dependent on its DS. Hence, evolutions of the DS schema need to be propagated to the DW schema and content. This paper presents a model-driven approach for the evolution of a multidimensional DW. It is based on two evolution models: a first evolution model for the DS and another for the DW. These two models concern the data structure aspects as well as the evolution operations. The transition between these two models is performed through specific transformation rules defined in QVT (Query\View\Transformation).
INTRODUCTIONIn the data warehousing (DW) field, whatever the enterprise's philosophy falls into i) Bill Inmon's camp where the DW is one part of the overall BI system, or into ii) Ralph Kimball's camp where the DW is the conglomerate of all data marts within the enterprise, data issued from the operational systems are extracted, transformed, cleansed and finally loaded into the fact and dimension tables of the famous star schema which represents the keystone modeling diagram that has twofold objectives: first, it highlights the subject of analyses (i.e., fact representing the activity to be evaluated) and, secondly, it shows up the axes (i.e., dimensions) according to which the fact's data could be analyzed (Inmon, 2002). This strong dependency between the DW and the data source (DS) leads to a new evolution problem that addresses the impact of the DS schema evolution on the DW. In fact, the dynamic evolution of business processes within the enterprise can lead to another evolution of the DS schema. The associated DW cannot escape from this evolution which can simultaneously affect its schema, stored data, and also the ETL process (Extract-TransformLoad) (Vassiliadis, 2009). This paper treats this evolution problematic, it is organized as follows. In section 2, we overview researches related to the DW evolution problem.Section 3 proposes a model-driven approach for the propagation of DS schema changes towards the DW. Section 4 defines the evolution models of both the DS and the DW. Section 5 presents an example of transformation rules formalized in QVT (Query \View \Transformation); it is for the automatic passage between these two models. Finally, section 6 concludes the paper and enumerates our future perspectives.
RELATED RESEARCHESThe DW evolution problem has been the subject of several research studies. It was treated from several points of views: Analytical need evolution, DS schema evolution, etc.Some researchers (Favre et al., 2007), (Benitez et al., 2004), (Blaschka et al., 1999 ) have limited their study to the DW evolution as a result of evolution of decision makers needs, without considering the case of the DS evolution. Other literature works (Bellahsene, 2002), (Wrembel and Bebel, 2007), (Solodovnikova, 2008) have examined the evolution of the source schema as well a...