Data Warehousing Capstone Project
Ends soon! Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Data Warehousing Capstone Project
This course is part of IBM Data Warehouse Engineer Professional Certificate
Instructor: Rav Ahuja
3,343 already enrolled
Included with
Learn more
29 reviews
Recommended experience
29 reviews
Recommended experience
What you'll learn
Demonstrate proficiency in skills required for an entry-level data warehouse engineering role.
Design and implement various concepts and components in the data warehouse engineering lifecycle such as data repositories.
Showcase working knowledge with relational databases, data warehouses, and data pipelines.
You can use your expertise in Linux shell scripting, SQL, and business intelligence Cognos dashboard design, to tackle data warehousing problems.
Skills you'll gain
Tools you'll learn
Details to know
See how employees at top companies are mastering in-demand skills
Build your Data Management expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate from IBM
There are 5 modules in this course
In this course you will apply a variety of data warehouse engineering skills and techniques you have learned as part of the previous courses in the IBM Data Warehouse Engineer Professional Certificate. You will assume the role of a Junior Data Engineer who has recently joined the organization and be presented with a real-world use case that requires a data warehouse engineering solution.
In this module, you will design a data platform that uses MySQL as an OLTP database. You will be using MySQL to store the OLTP data.
What's included
1 video2 assignments1 app item4 plugins
1 videoβ’Total 4 minutes
- Introduction to Capstone Projectβ’4 minutes
2 assignmentsβ’Total 36 minutes
- Checklist: OLTP Databaseβ’24 minutes
- Graded Quiz: OLTP Databaseβ’12 minutes
1 app itemβ’Total 30 minutes
- Lab: OLTP Databaseβ’30 minutes
4 pluginsβ’Total 40 minutes
- Reading: Final Project Submission Guidelines and Deliverablesβ’15 minutes
- Data Platform Architectureβ’5 minutes
- Assignment Overview: OLTP Databaseβ’15 minutes
- OLTP Database Requirements and Designβ’5 minutes
In this module you will design and implement a data warehouse and you will then generate reports from the data in the data warehouse.
What's included
1 reading3 assignments3 app items2 plugins
1 readingβ’Total 1 minute
- Optional Lab Informationβ’1 minute
3 assignmentsβ’Total 69 minutes
- Checklist: Data Warehouse Design & Setupβ’15 minutes
- Checklist: Data Warehouse Reportingβ’24 minutes
- Graded Quiz: Build a Data Warehouseβ’30 minutes
3 app itemsβ’Total 180 minutes
- Hands-on Lab: Data Warehousingβ’60 minutes
- Lab: Data Warehouse Reporting using PostgreSQLβ’60 minutes
- (Optional) Obtain IBM Cloud Feature Code and Activate Trial Accountβ’60 minutes
2 pluginsβ’Total 45 minutes
- Assignment Overview:Data Warehouse Design and Reportingβ’15 minutes
- (Optional) Hands-on Lab: Data Warehouse Reporting using DB2β’30 minutes
In this module, you will assume the role of a data engineer at an e-commerce company. Your company has finished setting up a data warehouse. Now you are assigned the responsibility to design a reporting dashboard that reflects the key metrics of the business.
What's included
5 readings2 assignments6 plugins
5 readingsβ’Total 42 minutes
- (Optional): About this optional lesson with Looker Studioβ’2 minutes
- (Optional) : Getting Started with Google Looker Studioβ’10 minutes
- (Optional): Creating Visualizations in Reports using Looker Studioβ’10 minutes
- (Optional) : Summary and Highlightsβ’10 minutes
- Final Assignment Overviewβ’10 minutes
2 assignmentsβ’Total 27 minutes
- Checklist: Dashboard Creation β’12 minutes
- Graded Quiz: Dashboard Creation β’15 minutes
6 pluginsβ’Total 210 minutes
- Assignment Overview: Data Analyticsβ’15 minutes
- (Optional):Hands-on Lab: Getting Started with Google Looker Studioβ’60 minutes
- (Optional): Hands-on Lab: Creating and Configuring Visualizations in Reports with Google Looker Studioβ’60 minutes
- (Optional) Hands-on Lab: Advanced charts in Looker Studioβ’15 minutes
- (Optional): Final Assignment : Dashboard Creation using IBM Cognos Analyticsβ’30 minutes
- (Optional): Final Assignment : Dashboard Creation using Google Looker Studio β’30 minutes
In this module, you will set up an ETL process using a Shell script to extract new transactional data for each day from the MySQL database and load it into the staging data warehouse in PostgreSQL. Later you will perform the transformation on the table in the staging warehouse to load the data in a dimension table and a fact table. You will then export these tables as CSV files to the production warehouse. You will set up a Cron Job to schedule these tasks.
What's included
1 video1 reading2 assignments1 app item1 plugin
1 videoβ’Total 2 minutes
- Assignment Overview: ETL & Data Pipelinesβ’2 minutes
1 readingβ’Total 10 minutes
- Formatting Functions in PostgreSQLβ’10 minutes
2 assignmentsβ’Total 36 minutes
- Checklist: ETL with MySQL, PostgreSQL, and Bashβ’6 minutes
- Graded Quiz: ETL with MySQL, PostgreSQL, and Bashβ’30 minutes
1 app itemβ’Total 30 minutes
- Hands-on Lab: ETL with MySQL, PostgreSQL, and Bashβ’30 minutes
1 pluginβ’Total 15 minutes
- Assignment Overview: ETL with MySQL, PostgreSQL, and Bashβ’15 minutes
In this final module, you will submit the outcomes from your hands-on labs for evaluation. Your submission will be assessed either by an AI-based evaluation tool or through a peer-graded assessment,
What's included
2 readings1 peer review1 app item
2 readingsβ’Total 3 minutes
- Congrats & Next Stepsβ’2 minutes
- Thanks from the Course Teamβ’1 minute
1 peer reviewβ’Total 20 minutes
- Option 2 - Peer Graded: Final Project - Submission and Evaluationβ’20 minutes
1 app itemβ’Total 60 minutes
- Option 1 - AI Graded: Final Project - Submission and Evaluationβ’60 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor
Offered by
Explore more from Data Management
- Status: Free Trial
Course
- Status: Free TrialU
University of Colorado System
Course
- Status: Free Trial
Professional Certificate
- Status: PreviewN
Northeastern University
Course
Why people choose Coursera for their career
Learner reviews
- 5 stars
93.10%
- 4 stars
3.44%
- 3 stars
0%
- 2 stars
0%
- 1 star
3.44%
Showing 3 of 29
Reviewed on Nov 1, 2025
Thank you for this valuable course I really found it very useful
Frequently asked questions
This project requires you to engineer a production-ready data storage architecture by managing data across multiple specialized layers. You will start by working with a MySQL OLTP (Online Transaction Processing)database to handle live, transactional e-commerce data. From there, you will design and implement a structured data warehouse solution, migrating the raw transactional records into a dedicated PostgreSQL staging warehouse before transforming them into production-ready schemas.
You will gain hands-on experience building lightweight, reliable ETL pipelines using core systems tools rather than heavy enterprise software. You will write functional Linux shell scripts to extract daily incremental transaction records from MySQL and load them into your staging environment. To ensure the pipeline operates without manual intervention, you will learn how to configure and schedule these automated scripts using Cron Jobs, a vital skill for managing routine data engineering workflows.
Data warehousing relies on optimized analytical structures. In this capstone, you will perform data transformations on staging tables to populate standard dimension tables and fact tables(forming a star schema model)optimized for rapid querying. Once your data is structured, you will export the tables into the production warehouse and connect them to IBM Cognos Analytics to design an executive reporting dashboard that reflects critical e-commerce business metrics.
More questions
Financial aid available,
ΒΉ Some assignments in this course are AI-graded. For these assignments, your data will be used in accordance with Coursera's Privacy Notice.
