Probability & Statistics for Machine Learning & Data Science
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Probability & Statistics for Machine Learning & Data Science
This course is part of Mathematics for Machine Learning and Data Science Specialization
Instructor: Luis Serrano
114,344 already enrolled
Ask Coursera
689 reviews
Recommended experience
689 reviews
Recommended experience
What you'll learn
Describe and quantify the uncertainty inherent in predictions made by machine learning models
Visually and intuitively understand the properties of commonly used probability distributions in machine learning and data science
Apply common statistical methods like maximum likelihood estimation (MLE) and maximum a priori estimation (MAP) to machine learning problems
Assess the performance of machine learning models using interval estimates and margin of errors
Skills you'll gain
- Sampling (Statistics)
- Statistical Hypothesis Testing
- Bayesian Statistics
- Statistics
- Statistical Visualization
- Statistical Machine Learning
- Data Science
- Histogram
- Probability Distribution
- Statistical Analysis
- Probability & Statistics
- Probability
- Box Plots
- Statistical Methods
- Exploratory Data Analysis
- Descriptive Statistics
- Statistical Inference
- Correlation Analysis
- A/B Testing
Details to know
8 assignments
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
There are 4 modules in this course
Newly updated for 2024! Mathematics for Machine Learning and Data Science is a foundational online program created by DeepLearning.AI and taught by Luis Serrano. In machine learning, you apply math concepts through programming. And so, in this specialization, youβll apply the math concepts you learn using Python programming in hands-on lab exercises. As a learner in this program, you'll need basic to intermediate Python programming skills to be successful.
After completing this course, you will be able to: β’ Describe and quantify the uncertainty inherent in predictions made by machine learning models, using the concepts of probability, random variables, and probability distributions. β’ Visually and intuitively understand the properties of commonly used probability distributions in machine learning and data science like Bernoulli, Binomial, and Gaussian distributions β’ Apply common statistical methods like maximum likelihood estimation (MLE) and maximum a priori estimation (MAP) to machine learning problems β’ Assess the performance of machine learning models using interval estimates and margin of errors β’ Apply concepts of statistical hypothesis testing to commonly used tests in data science like AB testing β’ Perform Exploratory Data Analysis on a dataset to find, validate, and quantify patterns. Many machine learning engineers and data scientists need help with mathematics, and even experienced practitioners can feel held back by a lack of math skills. This Specialization uses innovative pedagogy in mathematics to help you learn quickly and intuitively, with courses that use easy-to-follow visualizations to help you see how the math behind machine learning actually works. We recommend you have a high school level of mathematics (functions, basic algebra) and familiarity with programming (data structures, loops, functions, conditional statements, debugging). Assignments and labs are written in Python but the course introduces all the machine learning libraries youβll use.
In this week, you will learn about probability of events and various rules of probability to correctly do arithmetic with probabilities. You will learn the concept of conditional probability and the key idea behind Bayes theorem. In lesson 2, we generalize the concept of probability of events to probability distribution over random variables. You will learn about some common probability distributions like the Binomial distribution and the Normal distribution.
What's included
30 videos9 readings2 assignments1 programming assignment4 ungraded labs
30 videosβ’Total 151 minutes
- Course Introductionβ’5 minutes
- A note on programming experienceβ’1 minute
- What is Probability?β’6 minutes
- What is Probability? - Dice Exampleβ’1 minute
- Complement of Probabilityβ’4 minutes
- Sum of Probabilities (Disjoint Events)β’5 minutes
- Sum of Probabilities (Joint Events)β’8 minutes
- Independenceβ’7 minutes
- Birthday problemβ’5 minutes
- Conditional Probability - Part 1β’8 minutes
- Conditional Probability - Part 2β’7 minutes
- Bayes Theorem - Intuitionβ’6 minutes
- Bayes Theorem - Mathematical Formulaβ’6 minutes
- Bayes Theorem - Spam exampleβ’5 minutes
- Bayes Theorem - Prior and Posteriorβ’3 minutes
- Bayes Theorem - The Naive Bayes Modelβ’5 minutes
- Probability in Machine Learningβ’6 minutes
- Random Variablesβ’7 minutes
- Probability Distributions (Discrete)β’4 minutes
- Binomial Distributionβ’6 minutes
- (Optional) Binomial Coefficientβ’5 minutes
- Bernoulli Distributionβ’1 minute
- Probability Distributions (Continuous)β’5 minutes
- Probability Density Functionβ’6 minutes
- Cumulative Distribution Functionβ’8 minutes
- Uniform Distributionβ’5 minutes
- Normal Distributionβ’8 minutes
- (Optional) Chi-Squared Distributionβ’3 minutes
- Sampling from a Distributionβ’5 minutes
- Week 1 - Conclusionβ’0 minutes
9 readingsβ’Total 92 minutes
- Join the DeepLearning.AI Forum to ask questions, get support, or share amazing ideas!β’2 minutes
- Check your knowledgeβ’10 minutes
- Learning Python: Recommended Resourcesβ’10 minutes
- Interactive Tool: Repeated Experimentsβ’15 minutes
- Interactive Tool: Relationship between PMF/PDF and CDF of some distributionsβ’15 minutes
- (Optional) Common Coursera Labs Operationsβ’10 minutes
- (Optional) Assignment Troubleshooting Tipsβ’10 minutes
- (Optional) Partial Grading for Assignmentsβ’10 minutes
- Week 1 - Slidesβ’10 minutes
2 assignmentsβ’Total 45 minutes
- Week 1 - Summative quizβ’30 minutes
- Week 1 - Practice Quizβ’15 minutes
1 programming assignmentβ’Total 240 minutes
- Naive Bayesβ’240 minutes
4 ungraded labsβ’Total 240 minutes
- Four Birthday Problemsβ’60 minutes
- Monty Hall Problemβ’60 minutes
- Exploratory Data Analysis - Intro to Pandasβ’60 minutes
- Exploratory Data Analysis - Exploring Your Dataβ’60 minutes
This week you will learn about different measures to describe probability distributions as well as any dataset. These include measures of central tendency (mean, median, and mode), variance, skewness, and kurtosis. The concept of the expected value of a random variable is introduced to help you understand each of these measures. You will also learn about some visual tools to describe data and distributions. In lesson 2, you will learn about the probability distribution of two or more random variables using concepts like joint distribution, marginal distribution, and conditional distribution. You will end the week by learning about covariance: a generalization of variance to two or more random variables.
What's included
27 videos2 readings2 assignments1 programming assignment3 ungraded labs
27 videosβ’Total 141 minutes
- Expected Valueβ’11 minutes
- Other measures of central tendency: median and modeβ’6 minutes
- Expected value of a Functionβ’4 minutes
- Sum of expectationsβ’7 minutes
- Varianceβ’11 minutes
- Standard Deviationβ’4 minutes
- Sum of Gaussiansβ’3 minutes
- Standardizing a Distributionβ’4 minutes
- Skewness and Kurtosis: Moments of a Distributionβ’2 minutes
- Skewness and Kurtosis - Skewnessβ’8 minutes
- Skewness and Kurtosis - Kurtosisβ’7 minutes
- Quantiles and Box-Plotsβ’3 minutes
- Visualizing data: Box-Plotsβ’3 minutes
- Visualizing data: Kernel density estimationβ’2 minutes
- Visualizing data: Violin Plotsβ’1 minute
- Visualizing data: QQ plotsβ’2 minutes
- Joint Distribution (Discrete) - Part 1β’5 minutes
- Joint Distribution (Discrete) - Part 2β’5 minutes
- Joint Distribution (Continuous)β’5 minutes
- Marginal and Conditional Distributionβ’7 minutes
- Conditional Distributionβ’5 minutes
- Covariance of a Datasetβ’10 minutes
- Covariance of a Probability Distributionβ’11 minutes
- Covariance Matrixβ’2 minutes
- Correlation Coefficientβ’5 minutes
- Multivariate Gaussian Distributionβ’6 minutes
- Week 2 - Conclusionβ’0 minutes
2 readingsβ’Total 25 minutes
- Interactive Tool: Mean, median and standard deviationβ’15 minutes
- Week 2 - Slidesβ’10 minutes
2 assignmentsβ’Total 60 minutes
- Week 2 - Summative Quizβ’30 minutes
- Week 2 - Practice Quizβ’30 minutes
1 programming assignmentβ’Total 100 minutes
- Loaded Diceβ’100 minutes
3 ungraded labsβ’Total 180 minutes
- Summary statistics and visualization of data setsβ’60 minutes
- Exploratory Data Analysis - Data Visualization and Summary Statisticsβ’60 minutes
- Simulating Dice Rolls with Numpy (helper for the assignment, not necessary and not graded)β’60 minutes
This week shifts its focus from probability to statistics. You will start by learning the concept of a sample and a population and two fundamental results from statistics that concern samples and population: the law of large numbers and the central limit theorem. In lesson 2, you will learn the first and the simplest method of estimation in statistics: point estimation. You will see how maximum likelihood estimation, the most common point estimation method, works and how regularization helps prevent overfitting. You'll then learn how Bayesian Statistics incorporates the concept of prior beliefs into the way data is evaluated and conclusions are reached.
What's included
20 videos3 readings2 assignments2 ungraded labs
20 videosβ’Total 99 minutes
- Population and Sampleβ’6 minutes
- Sample Meanβ’3 minutes
- Sample Proportionβ’2 minutes
- Sample Varianceβ’11 minutes
- Law of Large Numbersβ’4 minutes
- Central Limit Theorem - Discrete Random Variableβ’3 minutes
- Central Limit Theorem - Continuous Random Variableβ’8 minutes
- Point Estimationβ’1 minute
- Maximum Likelihood Estimation: Motivation β’3 minutes
- MLE: Bernoulli Exampleβ’5 minutes
- MLE: Gaussian Exampleβ’6 minutes
- MLE: Linear Regressionβ’6 minutes
- Regularizationβ’3 minutes
- Back to "Bayesics"β’3 minutes
- Bayesian Statistics - Frequentist vs. Bayesianβ’3 minutes
- Bayesian Statistics - MAPβ’5 minutes
- Bayesian Statistics - Updating Priorsβ’9 minutes
- Bayesian Statistics - Full Worked Exampleβ’11 minutes
- Relationship between MAP, MLE and Regularizationβ’6 minutes
- Week 3 - Conclusionβ’0 minutes
3 readingsβ’Total 35 minutes
- MLE for Gaussian populationβ’10 minutes
- Interactive Tool: Likelihood Functionsβ’15 minutes
- Week 3 - Slidesβ’10 minutes
2 assignmentsβ’Total 90 minutes
- Week 3 - Summative Quizβ’60 minutes
- Week 3 - Practice Quizβ’30 minutes
2 ungraded labsβ’Total 120 minutes
- Sampling data from different distribution and studying the distribution of sample meanβ’60 minutes
- Exploratory Data Analysis - Linear Regressionβ’60 minutes
This week you will learn another estimation method called interval estimation. The most common interval estimates are confidence intervals and you will see how they are calculated and how to correctly interpret them. In lesson 2, you will learn about hypothesis testing where estimates are formulated as a hypothesis and then tested in the presence of available evidence or a sample of data. You will learn the concept of p-value that helps in making a decision about a hypothesis test and also learn some common tests like the t-test, two-sample t-test, and the paired t-test. You will end the week with an interesting application of hypothesis testing in data science: A/B testing.
What's included
22 videos8 readings2 assignments1 programming assignment1 ungraded lab
22 videosβ’Total 112 minutes
- Confidence Intervals - Overviewβ’10 minutes
- Confidence Intervals - Changing the Intervalβ’9 minutes
- Confidence Intervals - Margin of Errorβ’10 minutes
- Confidence Intervals - Calculation Stepsβ’1 minute
- Confidence Intervals - Exampleβ’2 minutes
- Calculating Sample Sizeβ’3 minutes
- Difference Between Confidence and Probabilityβ’2 minutes
- Unknown Standard Deviationβ’5 minutes
- Confidence Intervals for Proportionβ’4 minutes
- Defining Hypothesesβ’3 minutes
- Type I and Type II errorsβ’5 minutes
- Right-Tailed, Left-Tailed, and Two-Tailed Testsβ’8 minutes
- p-Valueβ’8 minutes
- Critical Valuesβ’5 minutes
- Power of a Testβ’6 minutes
- Interpreting Resultsβ’3 minutes
- t-Distributionβ’4 minutes
- t-Testsβ’4 minutes
- Two Sample t-Testβ’6 minutes
- Paired t-Testβ’5 minutes
- ML Application: A/B Testingβ’9 minutes
- Week 4 - Conclusionβ’0 minutes
8 readingsβ’Total 72 minutes
- Interactive Tool: Confidence Intervalsβ’15 minutes
- Test for proportions β’10 minutes
- Two sample test for proportionsβ’10 minutes
- [IMPORTANT] Reminder about end of access to Lab Notebooksβ’2 minutes
- Week 4 - Slidesβ’10 minutes
- Acknowledgmentsβ’10 minutes
- (Optional) Opportunity to Mentor Other Learnersβ’5 minutes
- Referencesβ’10 minutes
2 assignmentsβ’Total 45 minutes
- Week 4 - Summative Quizβ’30 minutes
- Week 4 - Practice Quizβ’15 minutes
1 programming assignmentβ’Total 100 minutes
- A/B Testingβ’100 minutes
1 ungraded labβ’Total 60 minutes
- Exploratory Data Analysis - Confidence Intervals and Hypothesis Testingβ’60 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor
Offered by
Explore more from Algorithms
- Status: Free TrialD
DeepLearning.AI
Specialization
- Status: Free TrialD
DeepLearning.AI
Course
- Status: Free TrialD
DeepLearning.AI
Course
- Status: Free Trial
Course
Why people choose Coursera for their career
Learner reviews
- 5 stars
76.92%
- 4 stars
14.51%
- 3 stars
4.06%
- 2 stars
1.74%
- 1 star
2.75%
Showing 3 of 689
Reviewed on Nov 12, 2023
Very good course! Highly recommended to those who are just starting to learn mathematics for machine learning
Reviewed on May 21, 2025
It was very helpful course. It starts from the bare minimum but gradually you get to the point where you find yourself in Statistopia ???. Big applaud and thanks to Luis and also DeepLearning.AI
Reviewed on Jun 17, 2024
Very thorough and easy to comprehend approach to learning statistical and probability theory which is important foundational knowledge, not just in ML but any field of data analytics!
Frequently asked questions
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Yes. In select learning programs, you can apply for financial aid or a scholarship if you canβt afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, youβll find a link to apply on the description page.
More questions
Financial aid available,
