VOOZH about

URL: https://www.coursera.org/learn/data-storage-and-queries

⇱ Data Storage and Queries | Coursera


Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

Data Storage and Queries

Instructors: Joe Reis

Top Instructor

8,320 already enrolled

Ask Coursera

Gain insight into a topic and learn the fundamentals.
4.6

84 reviews

Intermediate level

Recommended experience

2 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

Gain insight into a topic and learn the fundamentals.
4.6

84 reviews

Intermediate level

Recommended experience

2 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

What you'll learn

  • Design storage architectures for various use cases, and select appropriate technologies to implement these architectures

  • Practice common query patters and identify ways to improve query performance and enhance the value of your data systems

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

3 assignments

Taught in English

Build your Cloud Computing expertise

This course is part of the DeepLearning.AI Data Engineering Professional Certificate
When you enroll in this course, you'll also be enrolled in this Professional Certificate.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate from DeepLearning.AI

There are 3 modules in this course

In this course, you will learn about the raw ingredients and processes that are used to physically store data on disk and in memory. You’ll explore different storage systems, including object, block, and file storage, as well as databases, that are built on top of these raw ingredients. You’ll also get a chance to use the Cypher language to query a Neo4j graph database, and perform vector similarity search, a key feature behind generative AI and large language models. You will explore the evolution of data storage abstractions, from data warehouses, to data lakes, and data lakehouses, while comparing the advantages and drawbacks of each architectural paradigm. With hands-on practice, you will design a simple data lake using Amazon Glue, and build a data lakehouse using AWS LakeFormation and Apache Iceberg. In the last week of this course, you’ll see how queries work behind the scenes, practice writing more advanced SQL queries, compare the query performance in row vs column-oriented storage, and perform streaming queries using Apache Flink.

What's included

16 videos12 readings1 assignment1 programming assignment1 ungraded lab

16 videosTotal 103 minutes
  • Welcome to Course 34 minutes
  • Course 3 Overview4 minutes
  • Storage Raw Ingredients- Physical Components of Data Storage8 minutes
  • Storage Raw Ingredients - Processes Required for Data Storage6 minutes
  • Cloud Storage Options: Block, Object and File storage8 minutes
  • Storage Tiers - Hot, Warm, & Cold Data3 minutes
  • Distributed Storage Systems7 minutes
  • Lab Walkthrough - Comparing Cloud Storage Options4 minutes
  • How Databases Store Data5 minutes
  • Row vs Column Storage6 minutes
  • Graph Databases5 minutes
  • Vector Databases5 minutes
  • Neo4j and Cypher Query Language (Part 1)4 minutes
  • Neo4j and Cypher Query Language (Part 2)9 minutes
  • [Optional] - Conversation with Juan Sequeda24 minutes
  • Week 1 Summary2 minutes
12 readingsTotal 60 minutes
  • Program Syllabus5 minutes
  • [Optional] Compression Algorithms5 minutes
  • [Optional] Database Partitioning/Sharding Methods5 minutes
  • [IMPORTANT] Guidelines before you start the labs in this course10 minutes
  • [Optional] FAQ VS Code Lab Environment5 minutes
  • Join the DeepLearning.AI Forum to ask questions, get support, or share amazing ideas!2 minutes
  • [Optional] The Parquet Format5 minutes
  • [Optional] Wide-Column Databases5 minutes
  • [Optional] ANN Algorithm: Hierarchical Navigable Small World (HNSW)5 minutes
  • [Optional] - Links to Data and Cypher Instructions2 minutes
  • Lecture Notes W11 minute
  • Week 1 Resources10 minutes
1 assignmentTotal 30 minutes
  • Week 1 Quiz30 minutes
1 programming assignmentTotal 120 minutes
  • Assignment 1: Graph Databases and Vector Search with Neo4j120 minutes
1 ungraded labTotal 120 minutes
  • Practice Lab: Comparing Cloud Data Storage Options120 minutes

What's included

16 videos2 readings1 assignment1 programming assignment1 ungraded lab

16 videosTotal 84 minutes
  • Week 2 Overview2 minutes
  • [Optional] Conversation with Bill Inmon12 minutes
  • Data Warehouse - Key Architectural Ideas6 minutes
  • Modern Cloud Data Warehouses4 minutes
  • Data Lakes - Key Architectural Ideas4 minutes
  • Next-Generation Data Lakes 5 minutes
  • Lab Walkthrough - Simple Data Lake with AWS Glue (Part 1)5 minutes
  • Lab Walkthrough - Simple Data Lake with AWS Glue (Part 2)6 minutes
  • Lab Walkthrough - Simple Data Lake with AWS Glue (Part 3 - Optional)3 minutes
  • The Data Lakehouse Architecture3 minutes
  • Date Lakehouse Implementation5 minutes
  • Lakehouse Architecture on AWS5 minutes
  • Implementing a Lakehouse on AWS8 minutes
  • Lab Walkthrough - Building a Data Lakehouse with AWS Lake Formation and Apache Iceberg (Part 1)10 minutes
  • Lab Walkthrough - Building a Data Lakehouse with AWS Lake Formation and Apache Iceberg (Part 2)5 minutes
  • Week 2 Summary2 minutes
2 readingsTotal 11 minutes
  • Lecture Notes W21 minute
  • Week 2 Resources10 minutes
1 assignmentTotal 30 minutes
  • Week 2 Quiz30 minutes
1 programming assignmentTotal 120 minutes
  • Assignment 2: Building a Data Lakehouse with AWS Lake Formation and Apache Iceberg120 minutes
1 ungraded labTotal 120 minutes
  • Practice Lab: Simple Data Lake with AWS Glue120 minutes

What's included

15 videos4 readings1 assignment1 programming assignment2 ungraded labs

15 videosTotal 77 minutes
  • Week 3 Overview3 minutes
  • The Life of a Query5 minutes
  • Advanced SQL Queries (Part 1)6 minutes
  • Advanced SQL Queries (Part 2)7 minutes
  • Index Deep Dive 7 minutes
  • Retrieving Only the Data You Need3 minutes
  • The Join Statement8 minutes
  • Aggregate Queries 3 minutes
  • Amazon Redshift Cloud Data Warehouse9 minutes
  • Lab Walkthrough - Comparing the Query Performance Between Row and Columnar Storage3 minutes
  • Additional Query Strategies5 minutes
  • Queries on Streaming data 5 minutes
  • Deploying an Application with Amazon Managed Service for Apache Flink6 minutes
  • Deploying a Studio Notebook with Amazon Managed Service for Apache Flink6 minutes
  • Course 3 Summary1 minute
4 readingsTotal 19 minutes
  • [Optional] - Additional Index Examples3 minutes
  • Lecture Notes W31 minute
  • Week 3 Resources10 minutes
  • Acknowledgments5 minutes
1 assignmentTotal 30 minutes
  • Week 3 Quiz30 minutes
1 programming assignmentTotal 180 minutes
  • Assignment 3: Advanced SQL Queries180 minutes
2 ungraded labsTotal 240 minutes
  • Practice Lab 1: Comparing the Query Performance Between Row-Oriented and Column-Oriented Databases120 minutes
  • Practice Lab 2: Streaming Queries with Apache Flink120 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructors

Instructor ratings
4.9 (15 ratings)

Top Instructor

DeepLearning.AI
4 Courses47,554 learners

Explore more from Cloud Computing

Why people choose Coursera for their career

👁 Image

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
👁 Image

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
👁 Image

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
👁 Image

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

  • 5 stars

    82.14%

  • 4 stars

    9.52%

  • 3 stars

    2.38%

  • 2 stars

    2.38%

  • 1 star

    3.57%

Showing 3 of 84

RR
·

Reviewed on Apr 24, 2025

This is a really excellent course covering a number of topics that anyone going into data engineering should be familiar with.

JT
·

Reviewed on Oct 6, 2025

Just excellent all around (from a current practitioner)

PS
·

Reviewed on Sep 15, 2025

Solid! bit of all but too in depth nor too practice oriented

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Financial aid available,