Integrations of Data Warehousing, Data Mining and Database Technologies
DOI: 10.4018/978-1-60960-537-7.ch006
|View full text |Cite
|
Sign up to set email alerts
|

On Handling the Evolution of External Data Sources in a Data Warehouse Architecture

Abstract: A data warehouse architecture (DWA) has been developed for the purpose of integrating data from multiple heterogeneous, distributed, and autonomous external data sources (EDSs) as well as for providing means for advanced analysis of integrated data. The major components of this architecture include: an external data source (EDS) layer, and extraction-transformation-loading (ETL) layer, a data warehouse (DW) layer, and an on-line analytical processing (OLAP) layer. Methods of designing a DWA, research developme… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
4
0

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 8 publications
(4 citation statements)
references
References 115 publications
0
4
0
Order By: Relevance
“…Besides,TDWhaveseveralissuesowingtoadditionoftemporalityintraditionalDW,timebased queryingofMDdatabase,impropernessinaggregationofdataandstructuralvariations,temporal viewrealizationfromnon-temporaldimensions,developmentofaMDstructure,implementation propertiesofatemporalschemaandsoon.GolfarelliandRizzi(2009)reviewedthestate-of-art temporaldatawarehousemodellingtechniquesandclassifiedtheissuesunderthreedistinctcategories like managing deviations in DW, managing data variations in data mart and managing schema alterationsindatamart.Theyidentifiedthefollowinglimitations:(a)time-dependentapproaches intendtodescribetheschemaandhandlingthevariationsofTDWbutdoesnotoffercomparative analysiswithexistingtechniques (Wrembel,2011;Manousisetal.,2015),(b)thetechniquesaretoo oldanddonotconsiderthelatesttrendsinthedomain,and(c)donotaffordcomparisonontemporal supportincurrenttechniquesbasedondifferenttimestamps. Wrembel(2009)alsotermedasubsetofmethodscalledmulti-versionedDWsthatsupportsa programminglanguageforquerying,sharingandindexingdatainDW.Inanotherwork, Wrembel (2011)discussedthechallengesthatarefacedinthedesign,modellingandorganizationofgrowing datasourcesaswellasdevelopmentofETLlayerunderUMLschema.Nevertheless,noneofthese techniquesprovideacomparisononTDWwithexistingapproaches. (Lapuraetal.,2018)exposeda financialsupportmultidimensionalDWmodelthatdividestime,financialunit,accounts,temporal dimensions that are regularly updated with transaction data obtained from financial database of auniversity.Forvisualdescription,adatavisualizationtoolisincorporatedinthewebportalof university,andallowedtousabilitytestingonlybytheterminalusers.Thistoolisreportedasauseful onebymostoftherespondents.Moreover,thetoolistestedwithqueryanalysiswherethequery responsetimeisgreatlydecreasedbyanaverageofabout50%.Thoughseveralresearchworkshave beenpresentedbasedonreal-timedataupdation (Jörg&Dessloch,2009,Vassiliadis&Simitsis, 2009,onlyalittleattentionhasbeenpaidtobringforthappropriatedimensionalmodelling.Latest datawarehousesystemsaredesignedtoofferabundantfacilitiestonumberofdatausers.Thatis,users canretrievereportsandsummariesappropriatetothem,orexaminedatawithspecificvisualization tools (Guldenetal.,2019).Moreover,usersneeddataaccessatdifferentintervalslikeinfrequently, frequently,atregulartimeperiods,rapidly,immediatelyandpredictably.Thus,thenecessityoftimedependentbusinessgoalsareincreasingday-by-daytoattaingrowthorienteddevelopmentinthe competitiveinfrastructure (Dinhetal.,2020).…”
Section: Related Workmentioning
confidence: 99%
“…Besides,TDWhaveseveralissuesowingtoadditionoftemporalityintraditionalDW,timebased queryingofMDdatabase,impropernessinaggregationofdataandstructuralvariations,temporal viewrealizationfromnon-temporaldimensions,developmentofaMDstructure,implementation propertiesofatemporalschemaandsoon.GolfarelliandRizzi(2009)reviewedthestate-of-art temporaldatawarehousemodellingtechniquesandclassifiedtheissuesunderthreedistinctcategories like managing deviations in DW, managing data variations in data mart and managing schema alterationsindatamart.Theyidentifiedthefollowinglimitations:(a)time-dependentapproaches intendtodescribetheschemaandhandlingthevariationsofTDWbutdoesnotoffercomparative analysiswithexistingtechniques (Wrembel,2011;Manousisetal.,2015),(b)thetechniquesaretoo oldanddonotconsiderthelatesttrendsinthedomain,and(c)donotaffordcomparisonontemporal supportincurrenttechniquesbasedondifferenttimestamps. Wrembel(2009)alsotermedasubsetofmethodscalledmulti-versionedDWsthatsupportsa programminglanguageforquerying,sharingandindexingdatainDW.Inanotherwork, Wrembel (2011)discussedthechallengesthatarefacedinthedesign,modellingandorganizationofgrowing datasourcesaswellasdevelopmentofETLlayerunderUMLschema.Nevertheless,noneofthese techniquesprovideacomparisononTDWwithexistingapproaches. (Lapuraetal.,2018)exposeda financialsupportmultidimensionalDWmodelthatdividestime,financialunit,accounts,temporal dimensions that are regularly updated with transaction data obtained from financial database of auniversity.Forvisualdescription,adatavisualizationtoolisincorporatedinthewebportalof university,andallowedtousabilitytestingonlybytheterminalusers.Thistoolisreportedasauseful onebymostoftherespondents.Moreover,thetoolistestedwithqueryanalysiswherethequery responsetimeisgreatlydecreasedbyanaverageofabout50%.Thoughseveralresearchworkshave beenpresentedbasedonreal-timedataupdation (Jörg&Dessloch,2009,Vassiliadis&Simitsis, 2009,onlyalittleattentionhasbeenpaidtobringforthappropriatedimensionalmodelling.Latest datawarehousesystemsaredesignedtoofferabundantfacilitiestonumberofdatausers.Thatis,users canretrievereportsandsummariesappropriatetothem,orexaminedatawithspecificvisualization tools (Guldenetal.,2019).Moreover,usersneeddataaccessatdifferentintervalslikeinfrequently, frequently,atregulartimeperiods,rapidly,immediatelyandpredictably.Thus,thenecessityoftimedependentbusinessgoalsareincreasingday-by-daytoattaingrowthorienteddevelopmentinthe competitiveinfrastructure (Dinhetal.,2020).…”
Section: Related Workmentioning
confidence: 99%
“…Since they are not directly related to the topic of this paper, they will not be described here. An overview of research problems and approaches can be found in [5,6].…”
Section: Related Workmentioning
confidence: 99%
“…For instance, Golfarelli and Rizzi [9] discuss the handling of schema and data level changes in data warehouses and data marts, design of TDWs, and querying temporal data. However, they do not include the recent efforts such as [23,24] as well as a comparative analysis of these approaches. Wrembel [22] also focuses on a subset of approaches, called Multiversioned Data Warehouses (MVDWs).…”
Section: Introductionmentioning
confidence: 99%
“…Additionally, it includes a language for querying a MVDW, as well as the detailed structure for sharing data and indexing data in a MVDW. Wrembel [23] only discusses the challenges that arise in the design, construction, and management of evolving external data sources as well as evolution of an ETL layer. All these approaches do not provide a comparison of the temporal support provided by the existing schemes.…”
Section: Introductionmentioning
confidence: 99%