Categories
Go Back
The Data Warehouse Etl Toolkit
Author: Ralph Kimball, Joe Caserta

Publisher: Wiley India Pvt Ltd
ISBN: 9788126505548
Pages: 510
Add to Booklist
Bookmark and Share
The Data Warehouse ETL Toolkit is essentially a guide for novice as well as advanced data warehouse developers and managers. The book elaborates on the foundation of the data warehousing system known as the Extract, Transform, and Load (ETL) system. The ETL system is the backbone of any data warehouse and often consumes 70 percent of the resources required for the implementation and maintenance of a typical data warehouse.

The book is organized around the four steps that are involved in the ETL system viz. extraction of data from the source systems, enforcing data quality and consistency standards, conforming data to facilitate the use of separate sources together, and finally delivering data in a presentation-ready format that developers can use to build applications.

The Data Warehouse ETL Toolkit describes the most efficient practices for data extraction from scattered sources throughout the enterprise, removal of redundancy and inaccuracy, transformation of resultant data into correctly formatted data structures, and finally physical uploading of the end product into the data warehouse. The book also includes useful ETL techniques to save time, a comprehensive guide on building dimensional structures and practical advice on maintaining data quality.