VOOZH about

URL: https://www.coursera.org/learn/packt-aws-data-processing-and-analysis-fezz2

⇱ AWS Data Processing and Analysis | Coursera


AWS Data Processing and Analysis

Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

Gain insight into a topic and learn the fundamentals.
Intermediate level

Recommended experience

9 hours to complete
Flexible schedule
Learn at your own pace

Gain insight into a topic and learn the fundamentals.
Intermediate level

Recommended experience

9 hours to complete
Flexible schedule
Learn at your own pace

What you'll learn

  • Integrate and scale data pipelines using AWS Lambda and Glue for efficient data processing

  • Analyze real-time data streams with Kinesis Analytics and OpenSearch to gain actionable insights

  • Implement security measures and manage data workflows for high-performance analysis

  • Analyze real-time data streams with Kinesis Analytics and OpenSearch to gain actionable insights

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

2 assignments

Taught in English

Build your subject-matter expertise

This course is part of the AWS Certified Data Analytics Specialty (2023) Hands-on Specialization
When you enroll in this course, you'll also be enrolled in this Specialization.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 2 modules in this course

Updated in May 2025.

This course now features Coursera Coach! A smarter way to learn with interactive, real-time conversations that help you test your knowledge, challenge assumptions, and deepen your understanding as you progress through the course. This course takes you through the complete process of data handling, starting with AWS data processing services. You’ll begin with AWS Lambda, learning how to integrate serverless functions and manage scalable data pipelines. With practical exercises, you’ll explore how AWS Glue helps automate data preparation and manage complex ETL jobs, making data lake partitioning and modification of Glue Data Catalog easy to understand. Hands-on experience with Glue Studio and DataBrew will further enhance your knowledge in preparing data for analysis. The course also delves into processing large datasets using Amazon EMR, where you’ll work with Apache Spark, Hive, and other tools in the Hadoop ecosystem. You’ll learn to optimize data processing with EMR, partition and store data efficiently, and integrate it with AWS services like Kinesis and Redshift. Exercises in Apache Spark will show you how to analyze data streams and deliver actionable insights in real time. Lastly, you'll focus on the analysis aspect using services like Kinesis Analytics, OpenSearch, and Athena. The course will guide you through setting up advanced analytics using Kinesis, creating real-time monitoring applications, and visualizing data using OpenSearch and QuickSight. By the end of this course, you’ll be well-equipped to build, process, and analyze data pipelines at scale using AWS’s powerful tools. This course is ideal for data engineers, IT professionals, and data analysts aiming to leverage AWS for data processing and analysis. Some familiarity with AWS services is recommended.

In this module, we will delve into AWS processing services, beginning with an introduction to AWS Lambda and Glue. You’ll learn how to integrate these tools for serverless and ETL workflows. We will also explore advanced topics such as Glue ETL job execution, Lambda's cost optimization strategies, and EMR’s integration with other AWS services like Apache Spark, Hive, and Hadoop. Hands-on exercises will cover using Spark with Kinesis and Redshift, and how to process data lakes with EMR.

What's included

35 videos2 readings

35 videosTotal 214 minutes
  • Section Introduction: Processing1 minute
  • What Is AWS Lambda?5 minutes
  • Lambda Integration - Part 15 minutes
  • Lambda Integration - Part 27 minutes
  • Lambda Costs, Promises, and Anti-Patterns5 minutes
  • (Exercise) AWS Lambda9 minutes
  • What Is Glue? + Partitioning Your Data Lake6 minutes
  • Glue, Hive, and ETL14 minutes
  • Modifying the Glue Data Catalog from ETL Scripts2 minutes
  • Glue ETL: Developer Endpoints, Running ETL Jobs with Bookmarks4 minutes
  • Glue Costs and Anti-Patterns3 minutes
  • AWS Glue Studio5 minutes
  • AWS Glue Data Quality3 minutes
  • AWS Glue DataBrew9 minutes
  • AWS Lake Formation9 minutes
  • AWS Lake Security4 minutes
  • Elastic MapReduce (EMR) Architecture and Usage9 minutes
  • EMR, AWS integration, and Storage8 minutes
  • EMR Promises; Introduction to Hadoop8 minutes
  • EMR Serverless, EMR, and EKS12 minutes
  • Introduction to Apache Spark9 minutes
  • Spark Integration with Kinesis and Redshift4 minutes
  • Spark integration with Athena3 minutes
  • Hive on EMR8 minutes
  • Pig on EMR2 minutes
  • HBase on EMR4 minutes
  • Presto on EMR3 minutes
  • Zeppelin and EMR Notebooks5 minutes
  • Hue, Splunk, and Flume4 minutes
  • S3DistCP and Other Services5 minutes
  • EMR Security and Instance Types6 minutes
  • (Exercise) Elastic MapReduce, Part 117 minutes
  • (Exercise) Elastic MapReduce, Part 210 minutes
  • AWS Data Pipeline5 minutes
  • AWS Step Functions4 minutes
2 readingsTotal 20 minutes
  • Introduction to the Course 'AWS Data Processing and Analysis'10 minutes
  • Full Specialization Resources10 minutes

In this module, we will focus on analyzing and querying data using AWS’s powerful analytics services. We begin with an introduction to Kinesis Analytics, OpenSearch, and Athena, followed by performance tuning and security best practices. Through hands-on exercises, you’ll build real-world applications to monitor data streams, optimize queries using Glue and Athena, and perform data warehousing with Redshift. Additionally, we’ll explore Redshift's durability, distribution styles, and newer features like AQUA and serverless options to improve large-scale data analytics.

What's included

32 videos1 reading2 assignments

32 videosTotal 220 minutes
  • Section Introduction: Analysis1 minute
  • Introduction to Kinesis Analytics8 minutes
  • Kinesis Analytics Costs; RANDOM_CUT_FOREST2 minutes
  • (Exercise) Kinesis Analytics, Part 17 minutes
  • (Exercise) Kinesis Analytics, Part 210 minutes
  • (Exercise) Kinesis Analytics, Part 317 minutes
  • (Exercise) Kinesis Analytics, Part 45 minutes
  • Introduction to OpenSearch (formerly Elasticsearch)11 minutes
  • Amazon OpenSearch Service7 minutes
  • OpenSearch Index Management and Designing for Stability11 minutes
  • Amazon OpenSearch Service Performance2 minutes
  • Amazon OpenSearch Serverless2 minutes
  • (Exercise) Amazon OpenSearch Service26 minutes
  • Introduction to Athena4 minutes
  • Athena and Glue, Costs, and Security8 minutes
  • Athena Performance2 minutes
  • Athena ACID Transactions3 minutes
  • (Exercise) AWS Glue and Athena13 minutes
  • Redshift Introduction and Architecture9 minutes
  • Redshift Spectrum and Performance Tuning5 minutes
  • Redshift Durability and Scaling4 minutes
  • Redshift Distribution Styles3 minutes
  • Redshift Sort Keys3 minutes
  • Redshift Data Flows and the COPY command8 minutes
  • Redshift Integration / WLM / Vacuum / Anti-Patterns11 minutes
  • Redshift Resizing (Elastic Versus Classic) and New Redshift Features in 20204 minutes
  • Newer Redshift Features, AQUA6 minutes
  • Redshift Security Concerns2 minutes
  • Redshift Serverless7 minutes
  • (Exercise) Redshift Spectrum, Part 18 minutes
  • (Exercise) Redshift Spectrum, Part 26 minutes
  • Amazon Relational Database Service (RDS) and Aurora4 minutes
1 readingTotal 10 minutes
  • Conclusion to the Course 'AWS Data Processing and Analysis'10 minutes
2 assignmentsTotal 75 minutes
  • Full Course Assessment60 minutes
  • Full Course Practice Assessment15 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

Offered by

Explore more from Cloud Computing

Why people choose Coursera for their career

👁 Image

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
👁 Image

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
👁 Image

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
👁 Image

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Frequently asked questions

Yes, you can preview the first video and view the syllabus before you enroll. You must purchase the course to access content not included in the preview.

If you decide to enroll in the course before the session start date, you will have access to all of the lecture videos and readings for the course. You’ll be able to submit assignments once the session starts.

Once you enroll and your session begins, you will have access to all videos and other resources, including reading items and the course discussion forum. You’ll be able to view and submit practice assessments, and complete required graded assignments to earn a grade and a Course Certificate.

If you complete the course successfully, your electronic Course Certificate will be added to your Accomplishments page - from there, you can print your Course Certificate or add it to your LinkedIn profile.

This course is currently available only to learners who have paid or received financial aid, when available.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Financial aid available,