What is a Data Warehouse? A physical repository where relational data are specially organized to provide enterprise-Wide, cleansed data in a standardized format A relational database?(so what is the difference?) The data warehouse is a collection of integrated subject-oriented databases designed to support DSS functions where each unit of data is relevant to some moment in timeIs non-volatile and Pearson Copyright C 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Copyright © 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved What is a Data Warehouse? • A physical repository where relational data are specially organized to provide enterprise-wide, cleansed data in a standardized format • A relational database? (so what is the difference?) • “The data warehouse is a collection of integrated, subject-oriented databases designed to support DSS functions, where each unit of data is non-volatile and relevant to some moment in time
A Historical Perspective to Data Warehousing ˇ Mainframe computers V Centralized data storage ˇ Big Data analytics Simple data entry V Data warehousing was born ˇ Social media analytics Routine reporting V Inmon, Building the Data Warehouse V Text and Web analytics Primitive database structures Kimball. The Data Warehouse Toolkit v Hadoop, Map Reduce NoSo Teradata Incorporated V EDW architecture design In-memory in-database 1970s-1980s-1990s 2000s 2010s V Mini/personal computers (PCs) v Exponentially growing Web data V Business applications for PCs Consolidation of DW/BI industry v Distributer DBMs Data warehouse appliances emerged Relational DBMS Business intelligence popularized Teradata ships commercial DBs Data mining and predictive modeling v Business Data Warehouse coined Open source software V SaaS Paas Cloud computing Pearson Copyright C 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Copyright © 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved A Historical Perspective to Data Warehousing
Characteristics ofDws ° Subject oriented Integrated Time-variant(time series) Nonvolatile Summarized Not normalized Metadata Web based relational/ multi-dimensional Client/server, real-time/right-time/active Pearson Copyright C 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Copyright © 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved Characteristics of DWs • Subject oriented • Integrated • Time-variant (time series) • Nonvolatile • Summarized • Not normalized • Metadata • Web based, relational/multi-dimensional • Client/server, real-time/right-time/active
Data mart A departmental small-scale DW that stores only limited/relevant data ependent data mart A subset that is created directly from a data warehouse Independent data mart A small data warehouse designed for a strategic business unit or a department Pearson Copyright C 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Copyright © 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved Data Mart A departmental small-scale “DW” that stores only limited/relevant data • Dependent data mart A subset that is created directly from a data warehouse • Independent data mart A small data warehouse designed for a strategic business unit or a department
Other Dw Components Operational data stores (ODS) a type of database often used as an interim area for a data warehouse Oper marts An operational data mart Enterprise data warehouse(EDW) a data warehouse for the enterprise Metadata-“ data about data” In dw metadata describe the contents of a data warehouse and its acquisition and use Pearson Copyright C 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Copyright © 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved Other DW Components • Operational data stores (ODS) – A type of database often used as an interim area for a data warehouse • Oper marts – An operational data mart • Enterprise data warehouse (EDW) – A data warehouse for the enterprise • Metadata – “data about data” – In DW metadata describe the contents of a data warehouse and its acquisition and use