VOOZH about

URL: https://www.coursera.org/learn/packt-introduction-to-data-engineering-on-aws-xlk27

⇱ Introduction to Data Engineering on AWS | Coursera


Introduction to Data Engineering on AWS

Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

Introduction to Data Engineering on AWS

Included with

Ask Coursera

Gain insight into a topic and learn the fundamentals.
Beginner level

Recommended experience

8 hours to complete
Flexible schedule
Learn at your own pace

Gain insight into a topic and learn the fundamentals.
Beginner level

Recommended experience

8 hours to complete
Flexible schedule
Learn at your own pace

What you'll learn

  • Master how to use AWS Glue for ETL development and data transformation.

  • Learn to efficiently catalog, process, and manage large-scale datasets with AWS Glue.

  • Explore Amazon Redshift's architecture and optimize query performance for fast data analysis.

  • Gain hands-on experience with Redshift Spectrum and Serverless to enhance data scalability and flexibility.

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

6 assignments

Taught in English

Build your subject-matter expertise

This course is part of the Data Engineering on AWS - The Complete Training Specialization
When you enroll in this course, you'll also be enrolled in this Specialization.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 5 modules in this course

This course features Coursera Coach!

A smarter way to learn with interactive, real-time conversations that help you test your knowledge, challenge assumptions, and deepen your understanding as you progress through the course. In this course, you'll gain a comprehensive understanding of data engineering using AWS Glue and Redshift, two critical tools for modern data workflows. You will be equipped with the skills to manage and transform data at scale, from cataloging and processing with AWS Glue to leveraging Redshift for powerful data warehousing and analytics. By diving into hands-on tutorials, you'll learn the core concepts and practical applications necessary to streamline data pipelines and optimize query performance. As you progress through the course, you will explore a variety of AWS Glue features such as Data Catalogs, ETL development, job bookmarking, and data quality evaluation, empowering you to automate data workflows and manage large datasets effectively. With Amazon Redshift, you will learn how to configure clusters, optimize queries, and even work with Redshift Spectrum and Serverless, improving the scalability and efficiency of your data operations. This course is ideal for data professionals looking to enhance their cloud-based data engineering skills, especially those who want to integrate AWS Glue and Redshift into their existing systems. It is suitable for learners with a basic understanding of data analytics, but prior knowledge of AWS or data engineering concepts would be beneficial. The course is designed for both beginners and intermediate learners, offering a solid foundation and practical skills that can be applied in real-world data engineering roles. By the end of the course, you will be able to build and optimize ETL pipelines using AWS Glue, manage data workflows, configure Redshift clusters, optimize query performance, and deploy serverless Redshift for scalable data warehousing solutions.

In this module, we will introduce the concept of data as the new oil and explore its growing importance in the modern digital world. You'll gain a high-level overview of the course and understand the pivotal role data plays in driving innovation and business success.

What's included

1 video2 readings

1 videoTotal 8 minutes
  • Introduction to Specialization8 minutes
2 readingsTotal 20 minutes
  • Introduction to the Course 'Introduction to Data Engineering on AWS'10 minutes
  • Full Specialization Resource10 minutes

In this module, we will introduce your trainer and provide insights into their professional background. You’ll learn what to expect from this course and how their expertise will guide you throughout your learning journey.

What's included

1 video1 assignment

1 videoTotal 3 minutes
  • Know Your Trainer3 minutes
1 assignmentTotal 15 minutes
  • Know Your Trainer - Assessment15 minutes

In this module, we will explore the foundational concepts of data engineering, focusing on how AWS services facilitate modern data analytics. You'll also be introduced to essential terminologies to build a strong understanding of data engineering workflows.

What's included

2 videos1 assignment

2 videosTotal 24 minutes
  • Data Engineering on AWS13 minutes
  • Basic Terminologies11 minutes
1 assignmentTotal 15 minutes
  • Getting Started with Data Analytics - Assessment15 minutes

In this module, we will dive deep into AWS Glue, exploring its features for data cataloging, ETL processes, and data quality management. You’ll gain hands-on experience in setting up and orchestrating workflows that automate data transformation and processing tasks.

What's included

11 videos1 assignment

11 videosTotal 152 minutes
  • Glue Data Catalog32 minutes
  • Glue ETL: Part 14 minutes
  • Glue ETL: Part 210 minutes
  • Glue ETL: Part 322 minutes
  • Workflows12 minutes
  • Job Bookmark5 minutes
  • Execution Type10 minutes
  • Data Quality: Part 15 minutes
  • Data Quality: Part 222 minutes
  • Glue DataBrew21 minutes
  • Additional Features9 minutes
1 assignmentTotal 15 minutes
  • AWS Glue: Catalog and Process Your Data - Assessment15 minutes

In this module, we will explore Amazon Redshift, focusing on its architecture, cluster management, and querying capabilities. You'll also learn about advanced features like Redshift Spectrum, Serverless Redshift, and materialized views to optimize your data warehousing experience.

What's included

14 videos1 reading3 assignments

14 videosTotal 154 minutes
  • Amazon Redshift9 minutes
  • Architecture11 minutes
  • Creating a Cluster17 minutes
  • Query Editor v29 minutes
  • Distribution Styles16 minutes
  • Cluster Operations5 minutes
  • Data API7 minutes
  • Redshift Spectrum21 minutes
  • Redshift Serverless: Part 113 minutes
  • Redshift Serverless: Part 28 minutes
  • Materialized Views10 minutes
  • WLM and Concurrency10 minutes
  • DataShare8 minutes
  • Additional Information11 minutes
1 readingTotal 10 minutes
  • Conclusion to the Course 'Introduction to Data Engineering on AWS'10 minutes
3 assignmentsTotal 90 minutes
  • Amazon Redshift: A Data Warehouse in AWS - Assessment15 minutes
  • Full Course Assessment60 minutes
  • Full Course Practice Assessment15 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

Offered by

Explore more from Data Management

Why people choose Coursera for their career

👁 Image

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
👁 Image

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
👁 Image

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
👁 Image

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Frequently asked questions

Data Engineering is the practice of designing and building systems to collect, store, and analyze data. It is a crucial aspect of modern data analytics because businesses and organizations rely on accurate, clean, and easily accessible data to make informed decisions. As data becomes more valuable than ever, the need for professionals who can manage and optimize data workflows is growing, making this field essential in today’s data-driven world.

This course provides an in-depth exploration of data engineering on AWS, focusing on the use of AWS Glue for ETL (Extract, Transform, Load) processes and Amazon Redshift for data warehousing. You’ll learn how to manage data using AWS Glue's features, create ETL pipelines, and leverage Redshift for high-performance data analytics and warehousing. The course covers core concepts like data quality, serverless architecture, and performance optimization in both tools.

Upon completing this course, you’ll be proficient in building and managing ETL pipelines using AWS Glue and deploying and optimizing data warehouses with Amazon Redshift. You’ll be equipped with the skills to work with complex data workflows, ensure data quality, and utilize serverless features for scalability. Additionally, you’ll understand how to create and manage Redshift clusters and enhance query performance.

This course assumes a basic understanding of data analytics concepts and some familiarity with cloud computing. It would be helpful if you have some experience with SQL or data management, but it is not strictly necessary. The course starts with foundational concepts and progresses to more advanced topics, so beginners can also benefit from the content.

This course is designed for anyone interested in data engineering, cloud computing, or data analytics, particularly those looking to specialize in AWS tools like Glue and Redshift. It’s ideal for professionals in data-related fields such as analysts, engineers, and developers who want to deepen their knowledge of cloud-based data management and processing tools.

The course consists of approximately 8 hours of video content, but the actual time to complete it may vary depending on your pace and engagement with the material. It can be completed at your own convenience, and the videos allow you to pause, revisit, and practice as needed.

Yes, you can preview the first video and view the syllabus before you enroll. You must purchase the course to access content not included in the preview.

If you decide to enroll in the course before the session start date, you will have access to all of the lecture videos and readings for the course. You’ll be able to submit assignments once the session starts.

Once you enroll and your session begins, you will have access to all videos and other resources, including reading items and the course discussion forum. You’ll be able to view and submit practice assessments, and complete required graded assignments to earn a grade and a Course Certificate.

If you complete the course successfully, your electronic Course Certificate will be added to your Accomplishments page - from there, you can print your Course Certificate or add it to your LinkedIn profile.

This course is currently available only to learners who have paid or received financial aid, when available.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Financial aid available,