Statistics and Data Analysis with R
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Statistics and Data Analysis with R
This course is part of Statistics and Applied Data Analysis Specialization
Instructor: Charlie Nuttelman
1,610 already enrolled
Included with
Learn more
Ask Coursera
Recommended experience
Recommended experience
What you'll learn
Use statistical functions in RStudio to solve problems related to discrete and continuous probability distributions.
Create simple linear, polynomial, and multilinear regression models in RStudio and use those models to make predictions.
Perform one-sample and two-sample hypothesis tests and create confidence and prediction intervals on various statistics.
Skills you'll gain
- Statistical Modeling
- Probability & Statistics
- Statistics
- Statistical Inference
- Data Analysis
- Tidyverse (R Package)
- Statistical Hypothesis Testing
- Statistical Programming
- Data Import/Export
- Data Wrangling
- Descriptive Statistics
- Data Manipulation
- Plot (Graphics)
- Regression Analysis
- Statistical Analysis
- Probability Distribution
- Descriptive Analytics
- Statistical Methods
Tools you'll learn
Details to know
6 assignments
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
There are 6 modules in this course
This course is the third course in a 3-part specialization entitled "Statistics and Applied Data Analysis." The course is meant for those familiar with statistics but unfamiliar with the programming language R.
The purpose of this course is to teach learners how to use the popular open-source (and thus, free) integrated development environment RStudio to perform basic and complex statistical calculations. After an introduction to basic calculations, vector, matrices, data frames, and how to import data from common file types (.xlsx, .csv, .txt), learners are taught how to solve probability and counting problems in R, followed by discrete and continuous probability distribution calculations, one-sample hypothesis tests, and two-sample hypothesis tests (comparisons). Finally, participants will learn how to create regression models in R and perform analysis of variance (ANOVA). One of the most beneficial aspect of the course are the programming assignments, which are completed online in the R programming language in Jupyter notebooks.
Welcome to "Statistics and Data Analysis with R"! In this week, you will be introduced to R and RStudio and will learn how to install and navigate RStudio. You will then learn how to perform basic calculations, use script files, create and work with vectors and matrices, and install and load add-on packages. Finally, you will learn all about data frames and tibbles, how to import data from external files (.xlsx, .csv, and .txt files), and how to work with built-in and user-defined functions. When you are ready, you must pass the Week 1 Graded Quiz in order to access the Week 2 Starter Files and Cheat Sheet. You will need access to these items in order to complete Module 2. You must also pass Assignment 1, which counts towards the final grade in the course.
What's included
14 videos5 readings1 assignment1 programming assignment2 discussion prompts
14 videosβ’Total 93 minutes
- Welcome to the Course!β’4 minutes
- How the Course Worksβ’3 minutes
- Introduction to R and RStudioβ’4 minutes
- Basic Calculations in Rβ’9 minutes
- Using Script Filesβ’8 minutes
- Vectors and Matrices (Part 1)β’8 minutes
- Vectors and Matrices (Part 2)β’11 minutes
- How to Install and Load Packagesβ’6 minutes
- Data Frames and Tibblesβ’10 minutes
- Additional Data Frame Examplesβ’6 minutes
- Importing Data Into RStudioβ’10 minutes
- How to Use Built-In Functionsβ’6 minutes
- User-Defined Functionsβ’5 minutes
- How Programming Assignments Workβ’3 minutes
5 readingsβ’Total 36 minutes
- Course Updates and Accessibility Supportβ’1 minute
- The Importance of a Course Certificate and the Future of Higher Educationβ’10 minutes
- Week 1 Starter Files and Cheat Sheetβ’10 minutes
- Installation Linksβ’5 minutes
- Week 2 Starter Files and Cheat Sheetβ’10 minutes
1 assignmentβ’Total 30 minutes
- Week 1 Graded Quizβ’30 minutes
1 programming assignmentβ’Total 60 minutes
- Assignment 1β’60 minutes
2 discussion promptsβ’Total 20 minutes
- What About You?β’10 minutes
- (OPTIONAL) Week 1 Discussionβ’10 minutes
In Week 2, you'll learn how to calculate common descriptive statistics in R, how to calculate conditional statistics, and how to present data in a graphical manner (scatter plots, column plots, and pie plots). You'll also learn how to create boxplots and probability plots in R and how to analyze the normality of the data using the Anderson-Darling statistic. Week 2 has 9 screencasts with many in-video questions to test your understanding of the material and help you learn. The week ends with a hands-on Assignment 2, which you will complete in a Jupyter notebook in the programming language R and that counts towards your final grade in the course. When you are ready, you must pass the Week 2 Graded Quiz in order to access the Week 3 Starter Files and Cheat Sheet. You will need access to these items in order to complete Module 3. Best of luck to you this week! As always, if you have questions or issues, please initiate a discussion thread and either myself or someone else will chime in with some help.
What's included
9 videos1 reading1 assignment1 programming assignment1 discussion prompt
9 videosβ’Total 69 minutes
- Descriptive Statistics (Part 1)β’8 minutes
- Descriptive Statistics (Part 2)β’10 minutes
- Conditional Statisticsβ’5 minutes
- Scatter Plots (Part 1)β’10 minutes
- Scatter Plots (Part 2)β’7 minutes
- Histogramsβ’6 minutes
- Column and Pie Plotsβ’7 minutes
- Box Plotsβ’8 minutes
- Probability Plots and the AD Statisticβ’9 minutes
1 readingβ’Total 10 minutes
- Week 3 Starter Files and Cheat Sheetβ’10 minutes
1 assignmentβ’Total 30 minutes
- Week 2 Graded Quizβ’30 minutes
1 programming assignmentβ’Total 60 minutes
- Assignment 2β’60 minutes
1 discussion promptβ’Total 10 minutes
- (OPTIONAL) Week 2 Discussionβ’10 minutes
In Week 3, you'll learn all about probability and counting rules in R, including how to calculate combinations and permutations, how to calculate probabilities associated with common discrete probability distributions (binomial, geometric, negative binomial, hypergeometric, Poisson distributions), and how to calculate probabilities associated with common continuous probability distributions (uniform, normal, T, chi-squared, and F distributions) in R. You will also perform inverse normal distribution calculations and their associated z-values (standardization). Week 3 has 14 screencasts with many in-video questions to test your understanding of the material and help you learn. The week ends with Assignment 3 in which you will perform several calculations in a Jupyter notebook. Assignment 3 counts towards your final grade in the course. When you are ready, you must pass the Week 3 Graded Quiz in order to access the Week 4 Starter Files and Cheat Sheet. You will need access to these items in order to complete Module 4. Best of luck to you this week! As always, if you have questions or issues, please initiate a discussion thread and either myself or someone else will chime in with some help.
What's included
16 videos1 reading1 assignment1 programming assignment1 discussion prompt
16 videosβ’Total 129 minutes
- Permutations and Combinationsβ’11 minutes
- The Binomial Distributionβ’8 minutes
- The Geometric Distributionβ’8 minutes
- The Negative Binomial Distributionβ’7 minutes
- The Hypergeometric Distributionβ’6 minutes
- The Poisson Distributionβ’8 minutes
- The Multinomial Distributionβ’7 minutes
- The Uniform Distributionβ’9 minutes
- The Normal Distributionβ’7 minutes
- Inverse Normal Distribution Calculationsβ’8 minutes
- Standardizing and Z-Valuesβ’11 minutes
- (OPTIONAL REVIEW) Variance Known or Unknown?β’5 minutes
- (OPTIONAL REVIEW) Sampling Distribution vs. Population Distribution β’9 minutes
- The T Distributionβ’11 minutes
- The Chi-Squared Distributionβ’11 minutes
- The F Distributionβ’5 minutes
1 readingβ’Total 10 minutes
- Week 4 Starter Files and Cheat Sheetβ’10 minutes
1 assignmentβ’Total 30 minutes
- Week 3 Graded Quizβ’30 minutes
1 programming assignmentβ’Total 60 minutes
- Assignment 3β’60 minutes
1 discussion promptβ’Total 10 minutes
- (OPTIONAL) Week 3 Discussionβ’10 minutes
In Week 4, you'll learn all about how to calculate one-sample statistics in R. You will begin the week by learning how to calculate confidence and prediction intervals on the mean, variance, and binomial proportion. Then, you will learn how to perform hypothesis tests on the mean, variance, and a binomial proportion. You will also learn how to calculate the power and probability of a type II error in R, which is related to sample size considerations, which you will also explore. Week 4 has 10 screencasts with many in-video questions to test your understanding of the material and help you learn. I encourage you to download and make use of the Week 4 Cheat Sheet (for those who purchase a Course Certificate) as this will help distill the challenging concepts and R functions that are found in this week's material. Week 4 concludes with Assignment 4, which you will complete in the R programming language in a Jupyter notebook and that counts towards your final grade in the course. When you are ready, you must pass the Week 4 Graded Quiz in order to access the Week 5 Starter Files and Cheat Sheet. You will need access to these items in order to complete Module 5. Quiz 4 requires you to perform statistical calculations in R, so be sure to prepare accordingly.
What's included
12 videos1 reading1 assignment1 programming assignment1 discussion prompt
12 videosβ’Total 111 minutes
- Confidence Interval on the Mean, Variance Knownβ’12 minutes
- Confidence Interval on the Mean, Variance Unknownβ’10 minutes
- Prediction Interval on a Future Observationβ’10 minutes
- Hypothesis Tests on the Mean, Variance Knownβ’11 minutes
- Hypothesis Tests on the Mean, Variance Unknownβ’10 minutes
- (OPTIONAL REVIEW) Type I and Type II Errorsβ’9 minutes
- (OPTIONAL REVIEW) Power of the Testβ’4 minutes
- Type II Error and Power of the Testβ’10 minutes
- Choice of Sample Sizeβ’10 minutes
- Confidence Interval on the Varianceβ’8 minutes
- Hypothesis Tests on the Varianceβ’11 minutes
- Hypothesis Tests on a Binomial Proportionβ’5 minutes
1 readingβ’Total 10 minutes
- Week 5 Starter Files and Cheat Sheetβ’10 minutes
1 assignmentβ’Total 30 minutes
- Week 4 Graded Quizβ’30 minutes
1 programming assignmentβ’Total 4,560 minutes
- Assignment 4β’4,560 minutes
1 discussion promptβ’Total 10 minutes
- (OPTIONAL) Week 4 Discussionβ’10 minutes
In Week 5, you'll learn all about two-sample comparisons. You will calculate confidence intervals related to and hypothesis tests involving the comparison of means, comparison of variances, and comparison of binomial proportions. The type of test that is performed depends on whether variance is known or unknown, which you will also explore. Week 5 has 7 screencasts with many in-video questions to test your understanding of the material and help you learn. The week concludes with Assignment 5. When you are ready, you must pass Quiz 5 in order to continue in the course. You will also want to pay close attention to the Week 5 Cheat Sheet (available to learners who purchase a Course Certificate) as this will serve as a great reference for Assignment 5 and Quiz 5. When you are ready, you must pass the Week 5 Graded Quiz in order to access the Week 6 Starter Files and Cheat Sheet. You will need access to these items in order to complete Module 6. Quiz 5 requires you to perform statistical calculations in R, so be sure to prepare accordingly.
What's included
7 videos1 reading1 assignment1 programming assignment1 discussion prompt
7 videosβ’Total 50 minutes
- Comparison of Means, Variance Known (Part 1)β’6 minutes
- Comparison of Means, Variance Known (Part 2)β’9 minutes
- Comparison of Variances (F-Test), Part 1β’5 minutes
- Comparison of Variances (F-Test), Part 2β’7 minutes
- Comparison of Means, Variance Unknownβ’9 minutes
- Paired T Testsβ’8 minutes
- Comparison of Binomial Proportionsβ’7 minutes
1 readingβ’Total 10 minutes
- Week 6 Starter Files and Cheat Sheetβ’10 minutes
1 assignmentβ’Total 30 minutes
- Week 5 Graded Quizβ’30 minutes
1 programming assignmentβ’Total 60 minutes
- Assignment 5β’60 minutes
1 discussion promptβ’Total 10 minutes
- (OPTIONAL) Week 5 Discussionβ’10 minutes
In Week 6, you'll learn all about creating simple linear, polynomial, and multilinear regression models, which basically are mathematical relationships between input variables (regressor variables) and an output variable (response). You will learn how to calculate confidence intervals on and perform hypothesis tests on model parameters and you will learn how to select the best possible regression model from several candidate models using backward elimination. Finally, you will learn how to perform analysis of variance (ANOVA) when you have more than two groups to compare. Week 6 has 9 screencasts with many in-video questions to test your understanding of the material and help you learn. The week concludes with Assignment 6. When you are ready, you must pass Quiz 6 in order to continue in the course. You will also want to pay close attention to the Week 6 Cheat Sheet (available to learners who purchase a Course Certificate) as this will serve as a great reference for Assignment 6 and Quiz 6. Quiz 6 requires you to perform statistical calculations in R, so be sure to prepare accordingly. Once you've completed Week 6, you'll be done with the course!
What's included
9 videos1 assignment1 programming assignment1 discussion prompt
9 videosβ’Total 91 minutes
- Simple Linear Regression (Part 1)β’13 minutes
- Simple Linear Regression (Part 2)β’13 minutes
- Residual Analysis (Part 1)β’8 minutes
- Residual Analysis (Part 2)β’7 minutes
- Polynomial Regressionβ’12 minutes
- Multilinear Regressionβ’8 minutes
- Model Building Techniques (Part 1)β’9 minutes
- Model Building Techniques (Part 2)β’9 minutes
- Analysis of Varianceβ’11 minutes
1 assignmentβ’Total 30 minutes
- Week 6 Graded Quizβ’30 minutes
1 programming assignmentβ’Total 60 minutes
- Assignment 6β’60 minutes
1 discussion promptβ’Total 10 minutes
- (OPTIONAL) Week 6 Discussionβ’10 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor
Offered by
Explore more from Probability and Statistics
- Status: PreviewO
O.P. Jindal Global University
Course
- Status: Free Trial
- Status: Free Trial
Course
Why people choose Coursera for their career
Frequently asked questions
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Yes. In select learning programs, you can apply for financial aid or a scholarship if you canβt afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, youβll find a link to apply on the description page.
More questions
Financial aid available,
