Statistical Inference and Hypothesis Testing in Data Science Applications

Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

👁 University of Colorado Boulder

Statistical Inference and Hypothesis Testing in Data Science Applications

This course is part of Data Science Foundations: Statistical Inference Specialization

👁 Jem Corcoran

Instructor: Jem Corcoran

8,303 already enrolled

Included with

•

Learn more

6 modules

Gain insight into a topic and learn the fundamentals.

4.6

55 reviews

Intermediate level

Recommended experience

Flexible schedule

4 weeks at 10 hours a week

Learn at your own pace

6 modules

Gain insight into a topic and learn the fundamentals.

4.6

55 reviews

Intermediate level

Recommended experience

Flexible schedule

4 weeks at 10 hours a week

Learn at your own pace

What you'll learn

Define a composite hypothesis and the level of significance for a test with a composite null hypothesis.
Define a test statistic, level of significance, and the rejection region for a hypothesis test. Give the form of a rejection region.
Perform tests concerning a true population variance.
Compute the sampling distributions for the sample mean and sample minimum of the exponential distribution.

Skills you'll gain

Details to know

👁 Image

Shareable certificate

Add to your LinkedIn profile

Assessments

6 assignments

Taught in English

94%

Most learners liked this course

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

👁 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is part of the Data Science Foundations: Statistical Inference Specialization

When you enroll in this course, you'll also be enrolled in this Specialization.

Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate

👁 Image

There are 6 modules in this course

This course will focus on theory and implementation of hypothesis testing, especially as it relates to applications in data science. Students will learn to use hypothesis tests to make informed decisions from data. Special attention will be given to the general logic of hypothesis testing, error and error rates, power, simulation, and the correct computation and interpretation of p-values. Attention will also be given to the misuse of testing concepts, especially p-values, and the ethical implications of such misuse.

This course can be taken for academic credit as part of CU Boulder’s Master of Science in Data Science (MS-DS) degree offered on the Coursera platform. The MS-DS is an interdisciplinary degree that brings together faculty from CU Boulder’s departments of Applied Mathematics, Computer Science, Information Science, and others. With performance-based admissions and no application process, the MS-DS is ideal for individuals with a broad range of undergraduate education and/or professional experience in computer science, information science, mathematics, and statistics. Learn more about the MS-DS program at https://www.coursera.org/degrees/master-of-science-data-science-boulder.

Welcome to the course! This module contains logistical information to get you started!

What's included

4 readings1 discussion prompt1 ungraded lab

4 readings•Total 31 minutes

Course Updates and Accessibility Support•1 minute
Earn Academic Credit for your Work!•10 minutes
Course Support•10 minutes
Course Resources•10 minutes

1 discussion prompt•Total 10 minutes

Introduce Yourself•10 minutes

1 ungraded lab•Total 60 minutes

Introduction to Jupyter Notebooks and R•60 minutes

In this module, we will define a hypothesis test and develop the intuition behind designing a test. We will learn the language of hypothesis testing, which includes definitions of a null hypothesis, an alternative hypothesis, and the level of significance of a test. We will walk through a very simple test.

What's included

6 videos11 readings1 assignment1 programming assignment2 ungraded labs

6 videos•Total 70 minutes

What is Hypothesis Testing?•4 minutes
Types of Hypotheses•15 minutes
Normal Computations•24 minutes
Errors in Hypothesis Testing•8 minutes
Test Statistics and Significance•15 minutes
A First Test•5 minutes

11 readings•Total 105 minutes

What is Hypothesis Testing?•5 minutes
Types of Hypotheses•10 minutes
Video Slides for Types of Hypotheses•10 minutes
Normal Computations•10 minutes
Video Slides for Normal Computations•10 minutes
Errors in Hypothesis Testing•10 minutes
Video Slides for Errors in Hypothesis Testing•10 minutes
Test Statistics and Significance•10 minutes
Video Slides for Test Statistics and Level of Significance•10 minutes
A First Test•10 minutes
Video Slides for A First Test•10 minutes

1 assignment•Total 30 minutes

Introduction to Hypothesis Testing•30 minutes

1 programming assignment•Total 180 minutes

Intro to Hypothesis Testing Lab•180 minutes

2 ungraded labs•Total 120 minutes

An Introduction to R and Jupyter Notebooks•60 minutes
Visualizing Errors in Hypothesis Testing•60 minutes

In this module, we will expand the lessons of Module 1 to composite hypotheses for both one and two-tailed tests. We will define the “power function” for a test and discuss its interpretation and how it can lead to the idea of a “uniformly most powerful” test. We will discuss and interpret “p-values” as an alternate approach to hypothesis testing.

What's included

7 videos7 readings1 assignment1 programming assignment1 ungraded lab

7 videos•Total 125 minutes

Composite Hypotheses and Level of Significance•16 minutes
One-Tailed Tests•20 minutes
Power Functions•13 minutes
Hypothesis Testing with P-Values•22 minutes
Two Tailed Tests•13 minutes
CLT: A Brief Review•17 minutes
Hypothesis Tests for Proportions•24 minutes

7 readings•Total 70 minutes

Video Slides for Composite Hypotheses and Level of Significance•10 minutes
Video Slides for One-Tailed Tests•10 minutes
Video Slides for Power Functions•10 minutes
Video Slides for Hypothesis Testing with P-Values•10 minutes
Video Slides for Two-Tailed Tests•10 minutes
Video Slides for CLT: A Brief Review•10 minutes
Video Slides for Hypothesis Tests for Proportions•10 minutes

1 assignment•Total 30 minutes

Constructing Tests•30 minutes

1 programming assignment•Total 180 minutes

The Basics of Hypothesis Testing•180 minutes

1 ungraded lab•Total 60 minutes

Distributions of P-Values•60 minutes

In this module, we will learn about the chi-squared and t distributions and their relationships to sampling distributions. We will learn to identify when hypothesis tests based on these distributions are appropriate. We will review the concept of sample variance and derive the “t-test”. Additionally, we will derive our first two-sample test and apply it to make some decisions about real data.

What's included

7 videos7 readings1 assignment1 programming assignment1 ungraded lab

7 videos•Total 140 minutes

The t and Chi-Squared Distributions•41 minutes
The Sample Variance for the Normal Distribution•24 minutes
t-Tests•19 minutes
Two Sample Tests for Means•15 minutes
Two Sample t-Tests for a Difference of Means•18 minutes
Welch's t-Test and Paired Data•14 minutes
Comparing Population Proportions•9 minutes

7 readings•Total 70 minutes

Video Slides for the t and Chi-Squared Distributions•10 minutes
Video Slides for the Sample Variance and the Normal Distribution•10 minutes
Video Slides for t-Tests•10 minutes
Video Slides for Two Sample Tests for Means•10 minutes
Video Slides for Differences in Population Means•10 minutes
Video Slides for Welch's Test and Paired Data•10 minutes
Video Slides for Comparing Population Proportions•10 minutes

1 assignment•Total 30 minutes

More Hypothesis Tests!•30 minutes

1 programming assignment•Total 180 minutes

t-Tests•180 minutes

1 ungraded lab•Total 60 minutes

t-Tests and Two Sample Tests•60 minutes

In this module, we will consider some problems where the assumption of an underlying normal distribution is not appropriate and will expand our ability to construct hypothesis tests for this case. We will define the concept of a “uniformly most powerful” (UMP) test, whether or not such a test exists for specific problems, and we will revisit some of our earlier tests from Modules 1 and 2 through the UMP lens. We will also introduce the F-distribution and its role in testing whether or not two population variances are equal.

What's included

6 videos6 readings2 assignments

6 videos•Total 118 minutes

Properties of the Exponential Distribution•13 minutes
Two Tests•28 minutes
Best Tests•23 minutes
UMP Tests•10 minutes
A Test for the Variance of the Normal Distribution•12 minutes
The F-Distribution and a Ratio of Variances•31 minutes

6 readings•Total 60 minutes

Video Slides for Properties of the Exponential Distribution•10 minutes
Video Slides for Two Hypothesis Tests for the Exponential•10 minutes
Video Slides for Best Tests•10 minutes
Video Slides for UMP Tests•10 minutes
Video Slides for a Normal Variance Test•10 minutes
Video Slides for an F-Distribution and a Ratio of Variances•10 minutes

2 assignments•Total 60 minutes

Best Tests and Some General Skills•30 minutes
Uniformly Most Powerful Tests and F-Tests•30 minutes

In this module, we develop a formal approach to hypothesis testing, based on a “likelihood ratio” that can be more generally applied than any of the tests we have discussed so far. We will pay special attention to the large sample properties of the likelihood ratio, especially Wilks’ Theorem, that will allow us to come up with approximate (but easy) tests when we have a large sample size. We will close the course with two chi-squared tests that can be used to test whether the distributional assumptions we have been making throughout this course are valid.

What's included

5 videos5 readings1 assignment1 programming assignment1 ungraded lab

5 videos•Total 93 minutes

MLEs•23 minutes
The GRLT•15 minutes
Wilks' Theorem•12 minutes
Chi-Squared Goodness of Fit Test•23 minutes
Independence and Homogeneity•19 minutes

5 readings•Total 50 minutes

Video Slides for MLEs•10 minutes
Video Slides for the GLRT•10 minutes
Video Slides for Wilks' Theorem•10 minutes
Video Slides for Chi-Squared Goodness of Fit Test•10 minutes
Video Slides for Independence and Homogeneity•10 minutes

1 assignment•Total 30 minutes

Adventures in GLRTs•30 minutes

1 programming assignment•Total 180 minutes

Chi-Squared Tests and Mo•180 minutes

1 ungraded lab•Total 60 minutes

Exploring Wilks' Theorem•60 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Build toward a degree

This course is part of the following degree program(s) offered by University of Colorado Boulder. If you are admitted and enroll, your completed coursework may count toward your degree learning and your progress can transfer with you.¹

Instructor

Instructor ratings

4.9 (14 ratings)

👁 Jem Corcoran

Jem Corcoran

University of Colorado Boulder

7 Courses•46,424 learners

Offered by

👁 Image

University of Colorado Boulder

Explore more from Probability and Statistics

👁 Image
U
University of Colorado Boulder
Introduction to Bayesian Statistics for Data Science
Course
👁 Image
Status: Free Trial
U
University of Colorado Boulder
Data Science Foundations: Statistical Inference
Specialization
👁 Image
Status: Free Trial
C
Coursera
Run Inference & Hypothesis Tests
Course
👁 Image
Status: Free Trial
U
University of Colorado Boulder
Data Science as a Field
Course

Why people choose Coursera for their career

👁 Image

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

👁 Image

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

👁 Image

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

👁 Image

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

5 stars
78.18%
4 stars
12.72%
3 stars
3.63%
2 stars
0%
1 star
5.45%

Showing 3 of 55

Reviewed on Jul 6, 2023

coursera classes can be rough and maybe even a little bit buggy it's loaded with good knowlede tho. the professor is great!

Reviewed on Jun 17, 2024

The Teacher is awesome. The course content is also very interesting. A nice trip

Reviewed on Jul 27, 2022

Loved the material. Content looks quite convincing and well explained!

View more reviews

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

URL: https://www.coursera.org/learn/statistical-inference-and-hypothesis-testing-in-data-science-applications