Python for Data Analytics
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Python for Data Analytics
This course is part of DeepLearning.AI Data Analytics Professional Certificate
Instructor: Sean Barnes
Top Instructor
10,499 already enrolled
Ask Coursera
25 reviews
Recommended experience
25 reviews
Recommended experience
Skills you'll gain
Tools you'll learn
Details to know
See how employees at top companies are mastering in-demand skills
Build your Data Analysis expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate from DeepLearning.AI
There are 5 modules in this course
This comprehensive course guides students through the complete data analytics workflow using Python, combining programming fundamentals with advanced statistical analysis. The curriculum is structured across five interconnected modules that build upon each other, using real-world datasets to provide practical, hands-on experience.
Starting with programming fundamentals, you'll learn essential Python concepts while working with real datasets like public library revenue and restaurant safety inspections. The course introduces the Jupyter Notebook environment and transitions students from spreadsheet-based analysis to powerful programmatic approaches. Students master core programming concepts including variables, functions, and control flow structures. This course helps you bridge the gap between theoretical knowledge and practical application, enabling you to become proficient in using Python for comprehensive data analysis, from basic data manipulation to advanced statistical modeling and forecasting.
This module is an introduction to Python programming, designed for beginners with no prior coding experience. You will explore the fundamental concepts and practices that underpin programming languages, with a specific focus on their application in data manipulation and analysis.
What's included
24 videos10 readings4 assignments1 programming assignment3 ungraded labs
24 videos•Total 97 minutes
- Welcome to Course 3•6 minutes
- Generative AI in this course•2 minutes
- Module 1 introduction•2 minutes
- Computer programming•4 minutes
- Navigating the Jupyter notebook environment•4 minutes
- Input, processing, output•3 minutes
- Python or a spreadsheet?•4 minutes
- Types and expressions•3 minutes
- Printing and comments•4 minutes
- Storing information: variables•4 minutes
- Debugging with variables•5 minutes
- Creating lists•4 minutes
- List operations•5 minutes
- Taking action: calling functions•5 minutes
- State•3 minutes
- Control flow•2 minutes
- Comparison•5 minutes
- Branching code: if & else•4 minutes
- Repeating actions: for loops•4 minutes
- Indentation•5 minutes
- Branching code: elif•5 minutes
- Repeating actions: range•4 minutes
- Execution order•3 minutes
- Your first graded lab•3 minutes
10 readings•Total 78 minutes
- Join the DeepLearning.AI Forum to ask questions, get support, or share amazing ideas!•2 minutes
- [Optional] Practice with types, expressions, and printing•5 minutes
- Variable names•15 minutes
- [Optional] Practice with variables•10 minutes
- [Optional] Practice with lists and functions•10 minutes
- [Optional] Practice with comparisons and if statements•5 minutes
- [Optional] Practice with branching code•5 minutes
- [Optional] Practice with loops•5 minutes
- Python cheat sheet•20 minutes
- Module 1 lecture notes•1 minute
4 assignments•Total 230 minutes
- Module 1 quiz•30 minutes
- Lesson 1 quiz•10 minutes
- Lesson 2 quiz•10 minutes
- Lesson 3 quiz•180 minutes
1 programming assignment•Total 80 minutes
- Retail sales analysis•80 minutes
3 ungraded labs•Total 130 minutes
- Module 1 lecture code•10 minutes
- Air quality in New York City - Data Exploration•60 minutes
- Air quality in New York City - Looping Through Data•60 minutes
This module introduces essential data analysis techniques using Python and the pandas library. You will learn how to import and work with data efficiently, leveraging DataFrames and Series to manipulate, filter, and analyze datasets. The module covers fundamental concepts such as vectorization for performance optimization, distinguishing between attributes and methods, and performing descriptive statistics. Additionally, you will explore data visualization techniques and segmentation methods to extract meaningful insights from structured data.
What's included
19 videos9 readings4 assignments1 programming assignment4 ungraded labs
19 videos•Total 71 minutes
- Module 2 introduction•1 minute
- Beyond lists•4 minutes
- Importing modules•4 minutes
- Pandas•2 minutes
- Reading CSV into Python•5 minutes
- DataFrames•4 minutes
- Attributes and methods•2 minutes
- Selecting columns•5 minutes
- Counts, sums, & histograms•4 minutes
- Sorting•3 minutes
- Sorting by multiple columns•3 minutes
- Filtering•5 minutes
- Filtering by multiple conditions•2 minutes
- Selecting rows•5 minutes
- Central tendency, variability, and skewness•5 minutes
- Categorical data•5 minutes
- Correlation•6 minutes
- Segmentation by one feature•3 minutes
- Segmentation by multiple features•5 minutes
9 readings•Total 101 minutes
- About the 2016 New Coder Survey data set•5 minutes
- [Optional] Practice with DataFrames•20 minutes
- Dictionaries and NumPy arrays•20 minutes
- [Optional] Practice with sorting•10 minutes
- [Optional] Practice with filtering•10 minutes
- Selection in Pandas•10 minutes
- [Optional] Practice with descriptive statistics•15 minutes
- Python Cheat Sheet•10 minutes
- Module 2 lecture notes•1 minute
4 assignments•Total 60 minutes
- Module 2 quiz•30 minutes
- Lesson 1 quiz•10 minutes
- Lesson 2 quiz•10 minutes
- Lesson 3 quiz•10 minutes
1 programming assignment•Total 80 minutes
- Retail sales - Expanding your analysis•80 minutes
4 ungraded labs•Total 120 minutes
- Module 2 lecture code•30 minutes
- Practice Lab: Buenos Aires Subway - Data structures•30 minutes
- Practice Lab: Buenos Aires Subway - Sorting and filtering•30 minutes
- Practice Lab: Buenos Aires subway - Descriptive statistics•30 minutes
This module focuses on data visualization using Python, covering essential tools and techniques for creating effective visuals. You will learn to generate visualizations directly from pandas DataFrames and Series, as well as use popular libraries like matplotlib and Seaborn to develop custom plots. The module explores various visualization types, from basic line graphs and bar charts to advanced distribution and categorical plots. Additionally, you will learn how to enhance readability through styling, annotations, and design choices to highlight trends, patterns, and anomalies in data.
What's included
18 videos4 readings4 assignments1 programming assignment4 ungraded labs
18 videos•Total 75 minutes
- Module 3 introduction•1 minute
- Plotting with matplotlib•6 minutes
- Colors, grids, & saving plots•4 minutes
- Text & annotations•5 minutes
- Ticks & spines•5 minutes
- Grouped column charts•6 minutes
- Stacked column charts •3 minutes
- Scatter plots•4 minutes
- Method chaining•4 minutes
- Plotting with Seaborn•5 minutes
- Themes & palettes•3 minutes
- Box plots•5 minutes
- Histograms•4 minutes
- Other charts•5 minutes
- Combining charts•4 minutes
- Matplotlib subplots•4 minutes
- Looping with subplots•3 minutes
- Seaborn pairplot•3 minutes
4 readings•Total 41 minutes
- Reading Documentation•15 minutes
- [Optional] Practice with method chaining•15 minutes
- Python Cheat Sheet•10 minutes
- Module 3 lecture notes•1 minute
4 assignments•Total 80 minutes
- Module 3 quiz•30 minutes
- Lesson 1 quiz•10 minutes
- Lesson 2 quiz•10 minutes
- Lesson 3 quiz•30 minutes
1 programming assignment•Total 80 minutes
- Exploring Australia's coral reefs•80 minutes
4 ungraded labs•Total 90 minutes
- Module 3 lecture code•30 minutes
- Practice Lab: Flight delays and cancellations - Matplotlib Charts•20 minutes
- Practice Lab: Flight delays and cancellations - Plotting with Seaborn•20 minutes
- Practice Lab: Flight delays and cancellations - Histograms and rugplots•20 minutes
This module introduces statistical inference and regression modeling using Python. You will learn to construct confidence intervals, perform hypothesis testing with t-tests, and simulate data using NumPy. The module covers both simple and multiple linear regression, guiding you through model development, interpretation of key metrics (such as R-squared, p-values, and coefficients), and prediction of new data points. Additionally, you will explore methods to encode categorical variables, evaluate model performance using error metrics, and refine regression models with the help of Large Language Models (LLMs).
What's included
20 videos6 readings4 assignments1 programming assignment4 ungraded labs
20 videos•Total 89 minutes
- Module 4 introduction•2 minutes
- Confidence intervals•6 minutes
- One-sample t-tests•5 minutes
- Two-sample t-tests•4 minutes
- Simulation: uniform•5 minutes
- Simulation: normal•4 minutes
- What is linear regression?•6 minutes
- Choosing an independent variable•3 minutes
- Training the model•5 minutes
- Interpreting the output of a regression model•5 minutes
- Prediction•5 minutes
- Multiple linear regression•4 minutes
- Training a multiple linear regression model •5 minutes
- Interpreting multiple linear regression•4 minutes
- Encoding categorical data•4 minutes
- Modeling with categorical data•5 minutes
- Prediction: Multiple Linear Regression•3 minutes
- Evaluating your model•5 minutes
- LLMs for model iteration•6 minutes
- The linear regression process•3 minutes
6 readings•Total 56 minutes
- [Optional] Practice with confidence intervals•10 minutes
- [Optional] Practice with simulating random outcomes•15 minutes
- Nonlinear transformations•10 minutes
- Interaction features•10 minutes
- Python Cheat Sheet•10 minutes
- Module 4 lecture notes•1 minute
4 assignments•Total 60 minutes
- Module 4 quiz•30 minutes
- Lesson 1 quiz•10 minutes
- Lesson 2 quiz•10 minutes
- Lesson 3 quiz•10 minutes
1 programming assignment•Total 80 minutes
- Analyzing Car CO₂ Emissions•80 minutes
4 ungraded labs•Total 120 minutes
- Module 4 lecture code•30 minutes
- Practice Lab: London housing prices - Confidence intervals and hypothesis testing•30 minutes
- Practice Lab: London housing prices - Linear regression•30 minutes
- Practice Lab: London housing prices - Regression with categorical data•30 minutes
This module explores working with time series data in Python, focusing on DateTime objects, indexing, and visualization. You will learn to manipulate time-based data, apply descriptive statistics, and segment time series by key date features. The module covers resampling and reshaping techniques, as well as using simple and multiple linear regression to model trends and seasonality. Additionally, you will evaluate forecasting models using appropriate error metrics to assess their performance.
What's included
14 videos5 readings4 assignments2 programming assignments5 ungraded labs
14 videos•Total 61 minutes
- Module 5 introduction•1 minute
- DateTimes•5 minutes
- Using DateTimes as indices•4 minutes
- Line charts•4 minutes
- Formatting date axis labels•6 minutes
- Moving average•5 minutes
- Percent change•5 minutes
- Segmentation•5 minutes
- Multiple line charts: reshaping•6 minutes
- Resampling•5 minutes
- Forecasting with the trend•6 minutes
- Forecasting with seasonality•5 minutes
- Error metrics for forecasting•4 minutes
- Your next steps•1 minute
5 readings•Total 36 minutes
- [Optional] Practice with Datetimes•10 minutes
- [Optional] Practice with reshaping (pivoting) dataframes•10 minutes
- Python Cheat Sheet•10 minutes
- Module 5 lecture notes•1 minute
- Acknowledgments•5 minutes
4 assignments•Total 50 minutes
- Module 5 quiz•30 minutes
- Lesson 1 quiz•10 minutes
- Lesson 2 quiz•5 minutes
- Lesson 3 quiz•5 minutes
2 programming assignments•Total 160 minutes
- Analyzing Chlorophyll levels in Australian Coral Reefs•80 minutes
- Capstone: Loan Interest Rates•80 minutes
5 ungraded labs•Total 160 minutes
- Module 5 lecture code•30 minutes
- Practice Lab: Flight delays and cancellations - Plotting the time series•20 minutes
- Practice Lab: Flight delays and cancellations - Working with time series data•20 minutes
- Practice Lab: Flight delays and cancellations - Linear regression with time series•30 minutes
- (Optional) Installing Python on your computer•60 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor
Offered by
Explore more from Data Analysis
- Status: PreviewS
Simplilearn
Course
- Status: Free TrialC
Coursera
Specialization
- Status: Free Trial
Course
- Status: Free Trial
Course
Why people choose Coursera for their career
Learner reviews
- 5 stars
80%
- 4 stars
12%
- 3 stars
0%
- 2 stars
4%
- 1 star
4%
Showing 3 of 25
Reviewed on Jun 22, 2025
Provides clear instructions, easy-to-follow tutorials, and lots of resources.
Reviewed on Jun 16, 2025
I love the way the course is structured and how Python is introduced using real-world use-cases.
Reviewed on Sep 4, 2025
The course was very detailed and got useful support whenever the concepts were hard to grasp.
Frequently asked questions
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
More questions
Financial aid available,
