Building and Maintaining a Data Warehouse 1st Edition by Fon Silvers – Ebook PDF Instant Download/Delivery: 0367387646, 9780367387648
Full download Building and Maintaining a Data Warehouse 1st Edition after payment
Product details:
ISBN 10: 0367387646
ISBN 13: 9780367387648
Author: Fon Silvers
As it is with building a house, most of the work necessary to build a data warehouse is neither visible nor obvious when looking at the completed product. While it may be easy to plan for a data warehouse that incorporates all the right concepts, taking the steps needed to create a warehouse that is as functional and user-friendly as it is theoretically sound, is not especially easy. That’s the challenge that Building and Maintaininga Data Warehouse answers. Based on a foundation of industry-accepted principles, this work provides an easy-to-follow approach that is cohesive and holistic. By offering the perspective of a successful data warehouse, as well as that of a failed one, this workdetails those factors that must be accomplished and those that are best avoided. Organized to logically progress from more general to specific information, this valuable guide: Presents areas of a data warehouse individually and in sequence, showing how each piece becomes a working part of the whole Examines the concepts and principles that are at the foundation of every successful data warehouse Explains how to recognize and attend to problematic gaps in an established data warehouse Provides the big picture perspective that planners and executives require Those considering the planning and creation of a data warehouse, as well as those who’ve already built one will profit greatly from the insights garnered by the author during his years of creating and gathering information on state-of-the-art data warehouses that are accessible, convenient, and reliable.
Table of contents:
1 The Big Picture: An Introduction to Data Warehousing
Introduction
Decision Support Systems
Dimensional and Third Normal Form Data Models
Storing the Data
Data Availability
Monitoring Data Quality
2 Data Warehouse Philosophy
Introduction
Enterprise Data
Subject Orientation
Data Integration
Form
Function
Grain
Nonvolatility
Time Variant
One Version of the Truth
Long-Term Investment
References
3 Source System Analysis
Introduction
Source System Analysis Principles
System of Record
Entity Data
Arithmetic Data
Absolute Arithmetic Data
Relative Arithmetic Data
Numeric Data That Isn’t Arithmetic
Alphanumeric Data
Granularity
Latency
Transaction Data
Snapshot Data
Source System Analysis Methods
Data Profile
Data Flow Diagram
Data State Diagram
System of Record
Business Rules
Closing Remarks
References
4 Relational Database Management System (RDBMS)
Introduction
Relational Set Theory
RDBMS Product Offerings
Residual Costs
Licensing
Support and Maintenance
Extensibility
Connective Capacity
Closing Remarks
References
5 Database Design
Introduction
Data Modeling Methodology
Conceptual Data Model
Logical Data Model
Logical (Primary) Key
Attribute
Primary Key/Foreign Key Relation
Cardinality
Super Types and Subtypes
Putting It All Together
Physical Data Model
Dimensional Data Model
Join Strategies
Conformed Dimensions
Junk Dimensions
Different Grains
Multiple Results
Factless Fact
Snowflake Schema
Dimensional Data Model Summary
Third Normal Form Data Model
Third Normal Form Fact Tables
Third Normal Form Dimension Tables
Third Normal Form Conformed Dimension Tables
Third Normal Form Joins Strategies
Source Native Key with Dates
Third Normal Form Data Model Summary
Recursive Data Model
Recursive Data Model Summary
Physical Data Model Summary
Data Architecture
Enterprise Data Warehouse
Data Mart
Operational Data Store
Subject Orientation
Data Integration
Sequence
System of Record
Volatile
Short History
Detailed Data
Cycles
Summaries and Aggregates
Closing Remarks
References
6 Data Acquisition and Integration
Introduction
Source System Analysis
Target System Analysis
Direct Requirements
Indirect Requirements
Direct and Indirect Requirements
Language
Data Profile
Data State
Data Mapping
Business Rules
Architecture
Extract, Transform, and Load (ETL)
Extract, Load, and Transform (ELT)
ETL Design Principles
ETL Process Principles
Principle 01: One Thing at a Time
Principle 02: Know When to Begin
Principle 03: Know When to End
Principle 04: Large to Medium to Small
Principle 05: Stage Data Integrity
Principle 06: Know What You Have
Process Principles Conclusion
ETL Staging Principles
Principle 07: Name the Data
Principle 08: Own the Data
Principle 09: Build the Data
Principle 10: Type the Data
Principle 11: Land the Data
Staging Principles Conclusion
ETL Functions
Extract Data from a Contiguous Dataset
Extract Data from a Data Flow
Row-Level Transformation
Dataset-Level Transformation
Surrogate Key Generation: Intradataset
Data Warehouse-Level Transformation
Surrogate Key Generation: Intra-Data Warehouse
Look-Up
Changed Data Capture
ETL Key
Universe to Universe and Candidate to Universe
Load Data from a Stable and Contiguous Dataset
Load Data from a Data Flow
Transaction Summary
Dimension Aggregation
Common Problems
Source Data Anomalies
Incomplete Source Data
Redundant Source Data
Misstated Source Data
Business Rule Changes
Obsolete Data
Redefined Data
Unrecorded Data
Closing Remarks
References
7 Business Intelligence Reporting
Introduction
BI Reporting Success Factors
Performance
User Interface
Presentation of the Data Architecture
Alignment with the Data Model
Ability to Answer Questions
Mobility
Flexibility
Availability
BI Customer Success Factors
Proactive Processes
Reactive Processes
Predefined Processes
Ad Hoc Processes
Data Needs
Information Needs
Analytic Needs
BI Reporting Application
Architecture
BI Reporting Methods
Predefined Reports
Interactive Reports
Online Analytical Process (OLAP) Reports
MOLAP
ROLAP
HOLAP
Drilling
Push versus Pull
Push
Pull
Printed on Paper
Report Archives
Web-Based BI Reporting
Operational BI Reporting: From an ODS
Operational BI Reporting: From an Operational System (Real-Time)
Operational BI Reporting: EDI, Partnerships, and Data Sharing
BI Reporting: Thus Far
Customer Relationship Management (CRM)
Business Metrics Measure the Enterprise
Decisions and Decision Making Closer to the Action
BI Reporting: Coming Soon
Reporting around the Event
BI Search
Sarbanes–Oxley and BI Reporting
Data Mining
Statistics Concepts
Random Error
Statistical Significance
Variables: Dependent and Independent
Hypothesis
Data Mining Tools
Data Mining Activities
Data Cleansing
Data Inspection
Compound Variables
Lag Variables
Numeric Variables
Categorical Variables
Hypothesis
Data Mining Algorithms
Neural Network
Decision Tree
CHAID
Nearest Neighbor
Rule Induction
Genetic Algorithm
Rule Validation and Testing
Overfitting
Closing Remarks
References
8 Data Quality
Introduction
Deming’s Definition of Quality
Data Quality Service Level Agreement (SLA)
Deming’s Statistical Process Control
Process Measurement
Methods and Strategies
Data Stewardship
Post-Load Audit and Report Errant Data
Plug in a Default Value and Report Errant Data
Reject a Record and Report the Errant Record
Reject a Dataset and Report the Errant Dataset
Recycle the Data: In Place and Report Errant Data
Recycle the Data: Recycle Wheel and Report Errant Data
Data Quality Repository
Data Quality Fact Table: Dimensional Data Model
Data Quality Fact Table: Third Normal Form Data Model
Data Quality Reporting
Follow Through
Closing Remarks
References
9 Metadata
Introduction
Types of Metadata
Static Metadata
Dynamic Metadata
Metadata Service Level Agreement (SLA)
Metadata Repository
Central Metadata Repository: Dimensional Data Model
Central Metadata Repository: Third Normal Form
Distributed Metadata Repository
Real-Time Metadata
Data Quality as Metadata
Make or Buy a Metadata Repository
Closing Remarks
References
10 Data Warehouse Customers
Introduction
Strategic Decision Makers
Tactical Decision Makers
Knowledge Workers
Operational Applications
External Partners
Electronic Data Interchange (EDI) Partners
Data Warehouse Plan
Strategic Decision Makers
Tactical Decision Makers
Knowledge Workers
Operational Applications
External Partners
Electronic Data Interchange (EDI) Partners
Closing Remarks
11 Future of Data Warehousing: An Epilogue
Introduction
Scalability and Performance
Real-Time Data Warehousing
Increased Corporate Presence
Back to the Basics
Data Quality
People also search:
building a data warehouse from scratch
building a data warehouse with examples in sql server pdf
building an enterprise data warehouse
building data marts
Tags: Fon Silvers, Building, Maintaining, Data Warehouse