Data Warehouse Concepts, Design, and Data Integration
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Data Warehouse Concepts, Design, and Data Integration
This course is part of Data Warehousing for Business Intelligence Specialization
Instructor: Michael Mannino
72,489 already enrolled
Included with
Learn more
Ask Coursera
1,058 reviews
1,058 reviews
Skills you'll gain
Tools you'll learn
Details to know
11 assignments
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
There are 5 modules in this course
This is the second course in the Data Warehousing for Business Intelligence specialization. Ideally, the courses should be taken in sequence.
In this course, you will learn exciting concepts and skills for designing data warehouses and creating data integration workflows. These are fundamental skills for data warehouse developers and administrators. You will have hands-on experience for data warehouse design and use open source products for manipulating pivot tables and creating data integration workflows. In the data integration assignment, you can use either Oracle, MySQL, or PostgreSQL databases. You will also gain conceptual background about maturity models, architectures, multidimensional models, and management practices, providing an organizational perspective about data warehouse development. If you are currently a business or information technology professional and want to become a data warehouse designer or administrator, this course will give you the knowledge and skills to do that. By the end of the course, you will have the design experience, software background, and organizational context that prepares you to succeed with data warehouse development projects. In this course, you will create data warehouse designs and data integration workflows that satisfy the business intelligence needs of organizations. When youβre done with this course, youβll be able to: * Evaluate an organization for data warehouse maturity and business architecture alignment; * Create a data warehouse design and reflect on alternative design methodologies and design goals; * Create data integration workflows using prominent open source software; * Reflect on the role of change data, refresh constraints, refresh frequency trade-offs, and data quality goals in data integration process design; and * Perform operations on pivot tables to satisfy typical business analysis requests using prominent open source software
Module 1 introduces the course and covers concepts that provide a context for the remainder of this course. In the first two lessons, youβll understand the objectives for the course and know what topics and assignments to expect. In the remaining lessons, you will learn about historical reasons for development of data warehouse technology, learning effects, business architectures, maturity models, project management issues, market trends, and employment opportunities. This informational module will ensure that you have the background for success in later modules that emphasize details and hands-on skills.You should also read about the software requirements in the lesson at the end of module 1. I recommend that you try to install the software this week before assignments begin in week 2.
What's included
8 videos13 readings1 assignment
8 videosβ’Total 54 minutes
- Course introduction video lectureβ’1 minute
- Course objectives video lectureβ’4 minutes
- Course topics and assignments video lectureβ’4 minutes
- Motivation and characteristics video lectureβ’8 minutes
- Learning effects for data warehouse development video lectureβ’9 minutes
- Data warehouse architectures and maturity video lectureβ’11 minutes
- Data Warehouse Examples video lectureβ’9 minutes
- Employment opportunities video lectureβ’6 minutes
13 readingsβ’Total 440 minutes
- Get help and meet other learners in this course. Join your discussion forums!β’5 minutes
- Powerpoint lecture notes for lesson 1β’15 minutes
- Optional textbookβ’10 minutes
- Powerpoint lecture notes for lesson 2β’15 minutes
- Powerpoint lecture notes for lesson 3β’25 minutes
- Powerpoint lecture notes for lesson 4β’30 minutes
- Powerpoint lecture notes for lesson 5β’30 minutes
- Powerpoint lecture notes for lesson 6β’20 minutes
- Powerpoint lecture notes for lesson 7β’20 minutes
- Overview of database software requirements and installationβ’60 minutes
- Pentaho Data Integration installation (Windows + Mac)β’60 minutes
- PDI Installation Problems (Mac + Windows)β’60 minutes
- Optional textbook reading materialβ’90 minutes
1 assignmentβ’Total 60 minutes
- Concept quiz for Module 1 β’60 minutes
Now that you have conceptual background for data warehouse development, youβll start using data warehouse tools. In module 2, you will learn about the multidimensional representation of a data warehouse used by business analysts. Youβll apply what youβve learned in practice and graded problems using WebPivotTable, a web-based tool for manipulating pivot tables. At the end of this module, you will have solid background to communicate and assist business analysts who use a multidimensional representation of a data warehouse. To complete this module, you should proceed to the assignment and quiz involving WebPivotTable.
What's included
6 videos9 readings2 assignments
6 videosβ’Total 38 minutes
- Data cube representation video lectureβ’9 minutes
- Data cube operators video lectureβ’7 minutes
- Overview of Microsoft MDX video lectureβ’7 minutes
- Microsoft MDX statements video lectureβ’6 minutes
- Overview of WebPivotTable video lectureβ’5 minutes
- WebPivotTable software demonstrationβ’5 minutes
9 readingsβ’Total 430 minutes
- Powerpoint lecture notes for lesson 1β’20 minutes
- Powerpoint lecture notes for lesson 2β’20 minutes
- Powerpoint lecture notes for lesson 3β’20 minutes
- Powerpoint lecture notes for lesson 4β’20 minutes
- Powerpoint lecture notes for lesson 5β’20 minutes
- WebPivotTable Tutorialβ’60 minutes
- Optional textbook reading materialβ’120 minutes
- Module 2 Assignmentβ’120 minutes
- Solution for the WebPivotTable assignment in Module 2β’30 minutes
2 assignmentsβ’Total 180 minutes
- Concept quiz for Module 2β’60 minutes
- Quiz for the module 2 assignment - WebPivotTableβ’120 minutes
This module emphasizes data warehouse design skills. Now that you understand the multidimensional representation used by business analysts, you are ready to learn about data warehouse design using a relational database. In practice, the multidimensional representation used by business analysts must be derived from a data warehouse design using a relational DBMS. You will learn about design patterns, summarizability problems, transformations for schema integration, and design methodologies. You will apply these concepts to mini case studies about data warehouse design. At the end of the module, you will have created data warehouse designs based on data sources and business needs of hypothetical organizations.
What's included
7 videos11 readings2 assignments
7 videosβ’Total 58 minutes
- Relational database concepts for multidimensional data video lectureβ’8 minutes
- Table design patterns video lectureβ’8 minutes
- Summarizability patterns for dimension tables video lectureβ’7 minutes
- Summarizability patterns for dimension-fact relationships video lectureβ’7 minutes
- Schema integration video lessonβ’9 minutes
- Data warehouse design methodologies video lectureβ’8 minutes
- Mini case for data warehouse design video lectureβ’11 minutes
11 readingsβ’Total 700 minutes
- Powerpoint lecture notes for lesson 1β’20 minutes
- Powerpoint lecture notes for lesson 2β’20 minutes
- Powerpoint lecture notes for lesson 3β’20 minutes
- Powerpoint lecture notes for lesson 4β’20 minutes
- Powerpoint lecture notes for lesson 5β’30 minutes
- Powerpoint lecture notes for lesson 6β’20 minutes
- Powerpoint lecture notes for lesson 7β’30 minutes
- Mini Case Study and Solutionβ’180 minutes
- Optional textbook reading materialβ’90 minutes
- Module 3 Assignmentβ’180 minutes
- Self-Evaluation for the Module 3 Assignmentβ’90 minutes
2 assignmentsβ’Total 120 minutes
- Concept quiz for Module 3 β’60 minutes
- Reflective Quiz for the Module 3 Assignment β’60 minutes
Module 4 extends your background about data warehouse development. After learning about schema design concepts and practices, you are ready to learn about data integration processing to populate and refresh a data warehouse. The informational background in module 4 covers concepts about data sources, data integration processes, and techniques for pattern matching and inexact matching of text. Module 4 provides detailed material about SQL statements for data integration with examples and an assignment for both Oracle Cloud and PostgreSQL. Module 4 provides a context for the software skills that you will learn in module 5.
What's included
7 videos10 readings3 assignments
7 videosβ’Total 59 minutes
- Concepts of data integration processes video lectureβ’9 minutes
- Change data concepts video lectureβ’8 minutes
- Data cleaning tasks video lectureβ’6 minutes
- Pattern matching with regular expressions video lectureβ’9 minutes
- Matching and consolidation video lectureβ’8 minutes
- Quasi identifiers and distance functions for entity matching video lectureβ’7 minutes
- SQL for data integration video lessonβ’11 minutes
10 readingsβ’Total 510 minutes
- Powerpoint lecture notes for lesson 1β’20 minutes
- Powerpoint lecture notes for lesson 2β’20 minutes
- Powerpoint lecture notes for lesson 3β’20 minutes
- Powerpoint lecture notes for lesson 4β’20 minutes
- Powerpoint lecture notes for lesson 5β’20 minutes
- Powerpoint lecture notes for lesson 6β’20 minutes
- Powerpoint lecture notes for lesson 7β’60 minutes
- Optional reading materialβ’90 minutes
- Module 4 assignmentβ’180 minutes
- Self evaluation for the Module 4 assignmentβ’60 minutes
3 assignmentsβ’Total 180 minutes
- Concept quiz for Module 4β’60 minutes
- Reflective Quiz for the Module 4 assignmentβ’60 minutes
- Quiz for the Module 4 assignmentβ’60 minutes
Module 5 extends your background about data integration from module 4. Module 5 covers architectures, features, and details about data integration tools to complement the conceptual background in module 4. You will learn about the features of two open source data integration tools, Talend Open Studio and Pentaho Data Integration. You will use Pentaho Data Integration in a guided tutorial in preparation for a graded assignment involving Pentaho Data Integration. For the tutorial and assignment, you need to connect to a database server, Oracle Cloud or PostgreSQL. If you have time, I recommend completing the data integration assignment using both Oracle Cloud and PostgreSQL.
What's included
6 videos9 readings3 assignments
6 videosβ’Total 41 minutes
- Architectures and marketplace video lectureβ’7 minutes
- Common features of data Integration tools video lectureβ’7 minutes
- Talend Open Studio video lectureβ’8 minutes
- Pentaho Data Integration video lectureβ’8 minutes
- Software video demonstration for Pentaho Data Integrationβ’10 minutes
- Course conclusion video lectureβ’2 minutes
9 readingsβ’Total 670 minutes
- Powerpoint lecture notes for lesson 1β’20 minutes
- Powerpoint lecture notes for lesson 2β’20 minutes
- Powerpoint lecture notes for lesson 3β’10 minutes
- Powerpoint lecture notes for lesson 4β’20 minutes
- Optional reading materialβ’90 minutes
- Guided tutorial for Pentaho Data Integrationβ’180 minutes
- Module 5 assignmentβ’180 minutes
- Documents for the module 5 assignmentβ’60 minutes
- Self-Evaluation for the Module 5 Assignmentβ’90 minutes
3 assignmentsβ’Total 180 minutes
- Concept quiz for Module 5β’60 minutes
- Reflective quiz for the Module 5 assignmentβ’60 minutes
- Quiz for the Module 5 assignmentβ’60 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor
Offered by
Explore more from Design and Product
- U
University of Colorado System
Course
- U
University of Colorado System
Course
- U
University of Colorado System
Course
- U
University of Colorado System
Course
Why people choose Coursera for their career
Learner reviews
- 5 stars
61.81%
- 4 stars
27.59%
- 3 stars
5.57%
- 2 stars
2.07%
- 1 star
2.93%
Showing 3 of 1058
Reviewed on Mar 4, 2019
The course could be less-detailed. Besides open source ETL tools, other big players (e.g. Informatica, SAP DI, etc.) should be mentioned as well.
Reviewed on Jun 11, 2017
A great hands-on introduction to Pentaho. Gave me a very good insight into the capabilities of the product. I will start using the tool at work.
Reviewed on May 29, 2016
For me this course is one of the most important in BI. We had good teachers that helped us to understand the important issues.
Frequently asked questions
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Yes. In select learning programs, you can apply for financial aid or a scholarship if you canβt afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, youβll find a link to apply on the description page.
More questions
Financial aid available,
