Data Analysis with Python
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Data Analysis with Python
This course is part of multiple programs.
Instructor: Joseph Santarcangelo
648,808 already enrolled
Included with
Learn more
Ask Coursera
19,718 reviews
Recommended experience
19,718 reviews
Recommended experience
What you'll learn
Construct Python programs to clean and prepare data for analysis by addressing missing values, formatting inconsistencies, normalization, and binning
Analyze real-world datasets through exploratory data analysis (EDA) using libraries such as Pandas, NumPy, and SciPy to uncover patterns and insights
Apply data operation techniques using dataframes to organize, summarize, and interpret data distributions, correlation analysis, and data pipelines
Develop and evaluate regression models using Scikit-learn, and use these models to generate predictions and support data-driven decision-making
Skills you'll gain
Tools you'll learn
Details to know
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
There are 6 modules in this course
Analyzing data with Python is a key skill for aspiring Data Scientists and Analysts!
This course takes you from the basics of importing and cleaning data to building and evaluating predictive models. Youβll learn how to collect data from various sources, wrangle and format it, perform exploratory data analysis (EDA), and create effective visualizations. As you progress, youβll build linear, multiple, and polynomial regression models, construct data pipelines, and refine your models for better accuracy. Through hands-on labs and projects, youβll gain practical experience using popular Python libraries such as Pandas, NumPy, Matplotlib, Seaborn, SciPy, and Scikit-learn. These tools will help you manipulate data, create insights, and make predictions. By completing this course, youβll not only develop strong data analysis skills but also earn a Coursera certificate and an IBM digital badge to showcase your achievement.
This module introduces the foundational skills required to begin data analysis using Python. You will learn how to understand dataset structures, identify key variables, and import data from different sources using Python libraries such as Pandas and NumPy. The module also explores how to retrieve data from databases using SQLite and perform basic dataset exploration. Through hands-on labs, you will practice importing and examining real-world datasets such as laptop pricing and used car pricing.
What's included
6 videos3 readings2 assignments2 app items
6 videosβ’Total 20 minutes
- Course Introductionβ’2 minutes
- Understanding the Dataβ’3 minutes
- Python Packages for Data Scienceβ’3 minutes
- Importing and Exporting Data in Pythonβ’4 minutes
- Getting Started Analyzing Data in Pythonβ’4 minutes
- Accessing Databases with Pythonβ’4 minutes
3 readingsβ’Total 18 minutes
- Lesson Summaryβ’3 minutes
- Overview: Laptop Pricing Data Setβ’5 minutes
- Module 1 Cheat Sheet: Importing Data Setsβ’10 minutes
2 assignmentsβ’Total 42 minutes
- Graded Quiz: Importing Data Setsβ’30 minutes
- Practice Quiz: Importing Data Setsβ’12 minutes
2 app itemsβ’Total 35 minutes
- Lab: Importing Data Sets - Used Cars Pricingβ’15 minutes
- Lab: Importing Datasets - Laptop Pricingβ’20 minutes
This module focuses on preparing data for analysis through essential data wrangling techniques. You will learn how to clean, transform, and format datasets by handling missing values, converting data types, normalizing numerical values, and creating bins for analysis. The module also introduces methods for transforming categorical variables into numerical representations suitable for modeling. Through hands-on exercises, you will apply these techniques to real-world datasets.
What's included
6 videos2 readings2 assignments2 app items
6 videosβ’Total 20 minutes
- Pre-processing Data in Pythonβ’2 minutes
- Dealing with Missing Values in Pythonβ’6 minutes
- Data Formatting in Pythonβ’4 minutes
- Data Normalization in Pythonβ’4 minutes
- Binning in Pythonβ’2 minutes
- Turning Categorical Variables into Quantitative Variables in Pythonβ’2 minutes
2 readingsβ’Total 18 minutes
- Lesson Summaryβ’3 minutes
- Module 2 Cheat Sheet: Data Wranglingβ’15 minutes
2 assignmentsβ’Total 36 minutes
- Graded Quiz: Data Wranglingβ’24 minutes
- Practice Quiz: Data Wranglingβ’12 minutes
2 app itemsβ’Total 45 minutes
- Lab: Data Wrangling - Used Cars Pricingβ’30 minutes
- Lab: Data Wrangling - Laptop Pricingβ’15 minutes
This module develops your ability to analyze and understand datasets through exploratory data analysis techniques. You will learn how to calculate descriptive statistics, perform correlation analysis, and apply grouping techniques to uncover relationships between variables. The module also introduces data visualization methods and statistical tests such as the chi-square test for categorical variables. Through practical labs, you will analyze datasets to identify trends, patterns, and potential insights.
What's included
5 videos3 readings2 assignments2 app items1 plugin
5 videosβ’Total 15 minutes
- Exploratory Data Analysisβ’1 minute
- Descriptive Statisticsβ’5 minutes
- GroupBy in Pythonβ’3 minutes
- Correlationβ’3 minutes
- Correlation - Statisticsβ’3 minutes
3 readingsβ’Total 38 minutes
- Creating Different Types of Plots in Pythonβ’20 minutes
- Lesson Summaryβ’3 minutes
- Module 3 Cheat Sheet: Exploratory Data Analysisβ’15 minutes
2 assignmentsβ’Total 27 minutes
- Graded Quiz: Exploratory Data Analysisβ’15 minutes
- Practice Quiz: Exploratory Data Analysisβ’12 minutes
2 app itemsβ’Total 60 minutes
- Lab: Exploratory Data Analysis - Used Car Pricingβ’30 minutes
- Lab: Exploratory Data Analysis - Laptop Pricingβ’30 minutes
1 pluginβ’Total 15 minutes
- Chi-Square Test for Categorical Variablesβ’15 minutes
This module introduces the fundamentals of building predictive models using regression techniques. You will learn how to construct simple linear, multiple linear, and polynomial regression models to analyze relationships between variables. The module also covers methods for evaluating model performance using metrics such as R-squared and Mean Squared Error. Visualization techniques such as residual plots and KDE plots are used to assess how well models fit the data.
What's included
6 videos3 readings2 assignments2 app items
6 videosβ’Total 27 minutes
- Model Developmentβ’2 minutes
- Linear Regression and Multiple Linear Regressionβ’7 minutes
- Model Evaluation using Visualizationβ’5 minutes
- Polynomial Regression and Pipelinesβ’5 minutes
- Measures for In-Sample Evaluationβ’4 minutes
- Prediction and Decision Makingβ’5 minutes
3 readingsβ’Total 33 minutes
- Kernel Density Estimation (KDE) Plots for Model Evaluationβ’15 minutes
- Lesson Summaryβ’3 minutes
- Module 4 Cheat Sheet: Model Developmentβ’15 minutes
2 assignmentsβ’Total 27 minutes
- Graded Quiz: Model Developmentβ’15 minutes
- Practice Quiz: Model Developmentβ’12 minutes
2 app itemsβ’Total 75 minutes
- Lab: Model Development - Used Car Pricingβ’30 minutes
- Lab: Model Development - Laptop Pricingβ’45 minutes
This module focuses on improving model performance through evaluation and optimization techniques. You will learn how to detect overfitting and underfitting and apply strategies to improve model generalization. The module introduces ridge regression and hyperparameter tuning using grid search to refine predictive models. Through hands-on labs, you will evaluate and improve regression models using real-world datasets.
What's included
4 videos3 readings2 assignments2 app items
4 videosβ’Total 21 minutes
- Model Evaluation and Refinementβ’8 minutes
- Overfitting, Underfitting and Model Selectionβ’4 minutes
- Ridge Regressionβ’5 minutes
- Grid Searchβ’5 minutes
3 readingsβ’Total 22 minutes
- Introduction to Ridge Regressionβ’5 minutes
- Lesson Summaryβ’2 minutes
- Module 5 Cheat Sheet: Model Evaluation and Refinementβ’15 minutes
2 assignmentsβ’Total 30 minutes
- Graded Quiz: Model Evaluation and Refinementβ’15 minutes
- Practice Quiz: Model Evaluation and Refinement β’15 minutes
2 app itemsβ’Total 75 minutes
- Lab: Model Evaluation and Refinement - Used Cars Pricingβ’30 minutes
- Lab: Model Evaluation and Refinement - Laptop Pricingβ’45 minutes
In this module, you will apply the full data analysis workflow learned throughout the course. You will import, clean, analyze, and model real-world datasets to generate insights and predictions. The module includes a practice project and a final project that simulate real data analysis scenarios. You will also complete a final exam to demonstrate your understanding of key concepts in Python-based data analysis.
What's included
7 readings2 assignments3 app items
7 readingsβ’Total 35 minutes
- Practice Project Overviewβ’5 minutes
- Final Project Overviewβ’5 minutes
- Final Project Submission Guidelines and Deliverablesβ’5 minutes
- Cheat Sheet: Data Analysis for Pythonβ’10 minutes
- IBM Digital Badgeβ’2 minutes
- Congratulations and Next Stepsβ’5 minutes
- Thanks from the Course Teamβ’3 minutes
2 assignmentsβ’Total 90 minutes
- Option 2: AI Graded - Final Project Submission and Evaluationβ’30 minutes
- Final Examβ’60 minutes
3 app itemsβ’Total 195 minutes
- Option 1: AI Graded - Final Project Submission and Evaluationβ’60 minutes
- Practice Project - Data Analytics for Insurance Cost Data Setβ’75 minutes
- Final Project - Data Analytics for House Pricing Data Setβ’60 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor
Offered by
Explore more from Data Analysis
- Status: Free Trial
Course
- Status: Free Trial
Course
Why people choose Coursera for their career
Learner reviews
- 5 stars
76.40%
- 4 stars
18.17%
- 3 stars
3.64%
- 2 stars
0.92%
- 1 star
0.85%
Showing 3 of 19718
Reviewed on Mar 9, 2020
Very good course that goes straight to the main topics needed to work on data analysis using Python. This will kick start my learning process which will be followed with a lot of coding practices.
Reviewed on Jul 16, 2020
Although good to learn the know-how of basic data analysis techniques, the quizzes are predictable and you don't end up coding as much as you should. A good starter course to wet your feet in DA!
Reviewed on Dec 8, 2021
Most of what you'll learn in this package are fundamentals to other knowledge areas. So, practice both in and out of the course. Iβ appreciate the coordinators in making it possible. Thank you.
Frequently asked questions
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
More questions
Financial aid available,
ΒΉ Some assignments in this course are AI-graded. For these assignments, your data will be used in accordance with Coursera's Privacy Notice.
