VOOZH about

URL: https://www.coursera.org/learn/dwdesign

⇱ Data Warehouse Concepts, Design, and Data Integration | Coursera


Data Warehouse Concepts, Design, and Data Integration

Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

Data Warehouse Concepts, Design, and Data Integration

72,489 already enrolled

Included with

β€’

Learn more

Ask Coursera

Gain insight into a topic and learn the fundamentals.
4.4

1,058 reviews

6 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
93%
Most learners liked this course

Gain insight into a topic and learn the fundamentals.
4.4

1,058 reviews

6 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
93%
Most learners liked this course

Build your subject-matter expertise

This course is part of the Data Warehousing for Business Intelligence Specialization
When you enroll in this course, you'll also be enrolled in this Specialization.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 5 modules in this course

This is the second course in the Data Warehousing for Business Intelligence specialization. Ideally, the courses should be taken in sequence.

In this course, you will learn exciting concepts and skills for designing data warehouses and creating data integration workflows. These are fundamental skills for data warehouse developers and administrators. You will have hands-on experience for data warehouse design and use open source products for manipulating pivot tables and creating data integration workflows. In the data integration assignment, you can use either Oracle, MySQL, or PostgreSQL databases. You will also gain conceptual background about maturity models, architectures, multidimensional models, and management practices, providing an organizational perspective about data warehouse development. If you are currently a business or information technology professional and want to become a data warehouse designer or administrator, this course will give you the knowledge and skills to do that. By the end of the course, you will have the design experience, software background, and organizational context that prepares you to succeed with data warehouse development projects. In this course, you will create data warehouse designs and data integration workflows that satisfy the business intelligence needs of organizations. When you’re done with this course, you’ll be able to: * Evaluate an organization for data warehouse maturity and business architecture alignment; * Create a data warehouse design and reflect on alternative design methodologies and design goals; * Create data integration workflows using prominent open source software; * Reflect on the role of change data, refresh constraints, refresh frequency trade-offs, and data quality goals in data integration process design; and * Perform operations on pivot tables to satisfy typical business analysis requests using prominent open source software

Module 1 introduces the course and covers concepts that provide a context for the remainder of this course. In the first two lessons, you’ll understand the objectives for the course and know what topics and assignments to expect. In the remaining lessons, you will learn about historical reasons for development of data warehouse technology, learning effects, business architectures, maturity models, project management issues, market trends, and employment opportunities. This informational module will ensure that you have the background for success in later modules that emphasize details and hands-on skills.You should also read about the software requirements in the lesson at the end of module 1. I recommend that you try to install the software this week before assignments begin in week 2.

What's included

8 videos13 readings1 assignment

8 videosβ€’Total 54 minutes
  • Course introduction video lectureβ€’1 minute
  • Course objectives video lectureβ€’4 minutes
  • Course topics and assignments video lectureβ€’4 minutes
  • Motivation and characteristics video lectureβ€’8 minutes
  • Learning effects for data warehouse development video lectureβ€’9 minutes
  • Data warehouse architectures and maturity video lectureβ€’11 minutes
  • Data Warehouse Examples video lectureβ€’9 minutes
  • Employment opportunities video lectureβ€’6 minutes
13 readingsβ€’Total 440 minutes
  • Get help and meet other learners in this course. Join your discussion forums!β€’5 minutes
  • Powerpoint lecture notes for lesson 1β€’15 minutes
  • Optional textbookβ€’10 minutes
  • Powerpoint lecture notes for lesson 2β€’15 minutes
  • Powerpoint lecture notes for lesson 3β€’25 minutes
  • Powerpoint lecture notes for lesson 4β€’30 minutes
  • Powerpoint lecture notes for lesson 5β€’30 minutes
  • Powerpoint lecture notes for lesson 6β€’20 minutes
  • Powerpoint lecture notes for lesson 7β€’20 minutes
  • Overview of database software requirements and installationβ€’60 minutes
  • Pentaho Data Integration installation (Windows + Mac)β€’60 minutes
  • PDI Installation Problems (Mac + Windows)β€’60 minutes
  • Optional textbook reading materialβ€’90 minutes
1 assignmentβ€’Total 60 minutes
  • Concept quiz for Module 1 β€’60 minutes

Now that you have conceptual background for data warehouse development, you’ll start using data warehouse tools. In module 2, you will learn about the multidimensional representation of a data warehouse used by business analysts. You’ll apply what you’ve learned in practice and graded problems using WebPivotTable, a web-based tool for manipulating pivot tables. At the end of this module, you will have solid background to communicate and assist business analysts who use a multidimensional representation of a data warehouse. To complete this module, you should proceed to the assignment and quiz involving WebPivotTable.

What's included

6 videos9 readings2 assignments

6 videosβ€’Total 38 minutes
  • Data cube representation video lectureβ€’9 minutes
  • Data cube operators video lectureβ€’7 minutes
  • Overview of Microsoft MDX video lectureβ€’7 minutes
  • Microsoft MDX statements video lectureβ€’6 minutes
  • Overview of WebPivotTable video lectureβ€’5 minutes
  • WebPivotTable software demonstrationβ€’5 minutes
9 readingsβ€’Total 430 minutes
  • Powerpoint lecture notes for lesson 1β€’20 minutes
  • Powerpoint lecture notes for lesson 2β€’20 minutes
  • Powerpoint lecture notes for lesson 3β€’20 minutes
  • Powerpoint lecture notes for lesson 4β€’20 minutes
  • Powerpoint lecture notes for lesson 5β€’20 minutes
  • WebPivotTable Tutorialβ€’60 minutes
  • Optional textbook reading materialβ€’120 minutes
  • Module 2 Assignmentβ€’120 minutes
  • Solution for the WebPivotTable assignment in Module 2β€’30 minutes
2 assignmentsβ€’Total 180 minutes
  • Concept quiz for Module 2β€’60 minutes
  • Quiz for the module 2 assignment - WebPivotTableβ€’120 minutes

This module emphasizes data warehouse design skills. Now that you understand the multidimensional representation used by business analysts, you are ready to learn about data warehouse design using a relational database. In practice, the multidimensional representation used by business analysts must be derived from a data warehouse design using a relational DBMS. You will learn about design patterns, summarizability problems, transformations for schema integration, and design methodologies. You will apply these concepts to mini case studies about data warehouse design. At the end of the module, you will have created data warehouse designs based on data sources and business needs of hypothetical organizations.

What's included

7 videos11 readings2 assignments

7 videosβ€’Total 58 minutes
  • Relational database concepts for multidimensional data video lectureβ€’8 minutes
  • Table design patterns video lectureβ€’8 minutes
  • Summarizability patterns for dimension tables video lectureβ€’7 minutes
  • Summarizability patterns for dimension-fact relationships video lectureβ€’7 minutes
  • Schema integration video lessonβ€’9 minutes
  • Data warehouse design methodologies video lectureβ€’8 minutes
  • Mini case for data warehouse design video lectureβ€’11 minutes
11 readingsβ€’Total 700 minutes
  • Powerpoint lecture notes for lesson 1β€’20 minutes
  • Powerpoint lecture notes for lesson 2β€’20 minutes
  • Powerpoint lecture notes for lesson 3β€’20 minutes
  • Powerpoint lecture notes for lesson 4β€’20 minutes
  • Powerpoint lecture notes for lesson 5β€’30 minutes
  • Powerpoint lecture notes for lesson 6β€’20 minutes
  • Powerpoint lecture notes for lesson 7β€’30 minutes
  • Mini Case Study and Solutionβ€’180 minutes
  • Optional textbook reading materialβ€’90 minutes
  • Module 3 Assignmentβ€’180 minutes
  • Self-Evaluation for the Module 3 Assignmentβ€’90 minutes
2 assignmentsβ€’Total 120 minutes
  • Concept quiz for Module 3 β€’60 minutes
  • Reflective Quiz for the Module 3 Assignment β€’60 minutes

Module 4 extends your background about data warehouse development. After learning about schema design concepts and practices, you are ready to learn about data integration processing to populate and refresh a data warehouse. The informational background in module 4 covers concepts about data sources, data integration processes, and techniques for pattern matching and inexact matching of text. Module 4 provides detailed material about SQL statements for data integration with examples and an assignment for both Oracle Cloud and PostgreSQL. Module 4 provides a context for the software skills that you will learn in module 5.

What's included

7 videos10 readings3 assignments

7 videosβ€’Total 59 minutes
  • Concepts of data integration processes video lectureβ€’9 minutes
  • Change data concepts video lectureβ€’8 minutes
  • Data cleaning tasks video lectureβ€’6 minutes
  • Pattern matching with regular expressions video lectureβ€’9 minutes
  • Matching and consolidation video lectureβ€’8 minutes
  • Quasi identifiers and distance functions for entity matching video lectureβ€’7 minutes
  • SQL for data integration video lessonβ€’11 minutes
10 readingsβ€’Total 510 minutes
  • Powerpoint lecture notes for lesson 1β€’20 minutes
  • Powerpoint lecture notes for lesson 2β€’20 minutes
  • Powerpoint lecture notes for lesson 3β€’20 minutes
  • Powerpoint lecture notes for lesson 4β€’20 minutes
  • Powerpoint lecture notes for lesson 5β€’20 minutes
  • Powerpoint lecture notes for lesson 6β€’20 minutes
  • Powerpoint lecture notes for lesson 7β€’60 minutes
  • Optional reading materialβ€’90 minutes
  • Module 4 assignmentβ€’180 minutes
  • Self evaluation for the Module 4 assignmentβ€’60 minutes
3 assignmentsβ€’Total 180 minutes
  • Concept quiz for Module 4β€’60 minutes
  • Reflective Quiz for the Module 4 assignmentβ€’60 minutes
  • Quiz for the Module 4 assignmentβ€’60 minutes

Module 5 extends your background about data integration from module 4. Module 5 covers architectures, features, and details about data integration tools to complement the conceptual background in module 4. You will learn about the features of two open source data integration tools, Talend Open Studio and Pentaho Data Integration. You will use Pentaho Data Integration in a guided tutorial in preparation for a graded assignment involving Pentaho Data Integration. For the tutorial and assignment, you need to connect to a database server, Oracle Cloud or PostgreSQL. If you have time, I recommend completing the data integration assignment using both Oracle Cloud and PostgreSQL.

What's included

6 videos9 readings3 assignments

6 videosβ€’Total 41 minutes
  • Architectures and marketplace video lectureβ€’7 minutes
  • Common features of data Integration tools video lectureβ€’7 minutes
  • Talend Open Studio video lectureβ€’8 minutes
  • Pentaho Data Integration video lectureβ€’8 minutes
  • Software video demonstration for Pentaho Data Integrationβ€’10 minutes
  • Course conclusion video lectureβ€’2 minutes
9 readingsβ€’Total 670 minutes
  • Powerpoint lecture notes for lesson 1β€’20 minutes
  • Powerpoint lecture notes for lesson 2β€’20 minutes
  • Powerpoint lecture notes for lesson 3β€’10 minutes
  • Powerpoint lecture notes for lesson 4β€’20 minutes
  • Optional reading materialβ€’90 minutes
  • Guided tutorial for Pentaho Data Integrationβ€’180 minutes
  • Module 5 assignmentβ€’180 minutes
  • Documents for the module 5 assignmentβ€’60 minutes
  • Self-Evaluation for the Module 5 Assignmentβ€’90 minutes
3 assignmentsβ€’Total 180 minutes
  • Concept quiz for Module 5β€’60 minutes
  • Reflective quiz for the Module 5 assignmentβ€’60 minutes
  • Quiz for the Module 5 assignmentβ€’60 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

Instructor ratings
4.3 (68 ratings)
University of Colorado System
4 Coursesβ€’261,740 learners

Explore more from Design and Product

Why people choose Coursera for their career

πŸ‘ Image

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
πŸ‘ Image

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
πŸ‘ Image

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
πŸ‘ Image

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

  • 5 stars

    61.81%

  • 4 stars

    27.59%

  • 3 stars

    5.57%

  • 2 stars

    2.07%

  • 1 star

    2.93%

Showing 3 of 1058

TT
Β·

Reviewed on Mar 4, 2019

The course could be less-detailed. Besides open source ETL tools, other big players (e.g. Informatica, SAP DI, etc.) should be mentioned as well.

AM
Β·

Reviewed on Jun 11, 2017

A great hands-on introduction to Pentaho. Gave me a very good insight into the capabilities of the product. I will start using the tool at work.

AN
Β·

Reviewed on May 29, 2016

For me this course is one of the most important in BI. We had good teachers that helped us to understand the important issues.

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Financial aid available,