Basic Statistics
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Basic Statistics
This course is part of Methods and Statistics in Social Sciences Specialization
Instructors: Matthijs Rooduijn
324,398 already enrolled
Included with
Ask Coursera
4,670 reviews
4,670 reviews
Details to know
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
There are 9 modules in this course
Understanding statistics is essential to understand research in the social and behavioral sciences. In this course you will learn the basics of statistics; not just how to calculate them, but also how to evaluate them. This course will also prepare you for the next course in the specialization - the course Inferential Statistics.
In the first part of the course we will discuss methods of descriptive statistics. You will learn what cases and variables are and how you can compute measures of central tendency (mean, median and mode) and dispersion (standard deviation and variance). Next, we discuss how to assess relationships between variables, and we introduce the concepts correlation and regression. The second part of the course is concerned with the basics of probability: calculating probabilities, probability distributions and sampling distributions. You need to know about these things in order to understand how inferential statistics work. The third part of the course consists of an introduction to methods of inferential statistics - methods that help us decide whether the patterns we see in our data are strong enough to draw conclusions about the underlying population we are interested in. We will discuss confidence intervals and significance tests. Normally, you would not only learn about all these statistical concepts, but you would also be trained to calculate and generate these statistics yourself using freely available statistical software. Due to technical issues we are currently unable to do so. We will try to offer this again soon.
In this module we'll consider the basics of statistics. But before we start, we'll give you a broad sense of what the course is about and how it's organized. Are you new to Coursera or still deciding whether this is the course for you? Then make sure to check out the 'Course introduction' and 'What to expect from this course' sections below, so you'll have the essential information you need to decide and to do well in this course! If you have any questions about the course format, deadlines or grading, you'll probably find the answers here. Are you a Coursera veteran and ready to get started? Then you might want to skip ahead to the first course topic: 'Exploring data'. You can always check the general information later. Veterans and newbies alike: Don't forget to introduce yourself in the 'meet and greet' forum!
What's included
1 video11 readings1 assignment
1 video•Total 4 minutes
- Welcome to Basic Statistics!•4 minutes
11 readings•Total 110 minutes
- Hi there!•10 minutes
- How to navigate this course•10 minutes
- How to contribute•10 minutes
- General info - What will I learn in this course?•10 minutes
- Course format - How is this course structured?•10 minutes
- Requirements - What resources do I need?•10 minutes
- Grading - How do I pass this course?•10 minutes
- Team - Who created this course?•10 minutes
- Honor Code - Integrity in this course•10 minutes
- Useful literature and documents•10 minutes
- Research on Feedback•10 minutes
1 assignment•Total 30 minutes
- Use of your data for research•30 minutes
In this first module, we’ll introduce the basic concepts of descriptive statistics. We’ll talk about cases and variables, and we’ll explain how you can order them in a so-called data matrix. We’ll discuss various levels of measurement and we’ll show you how you can present your data by means of tables and graphs. We’ll also introduce measures of central tendency (like mode, median and mean) and dispersion (like range, interquartile range, variance and standard deviation). We’ll not only tell you how to interpret them; we’ll also explain how you can compute them. Finally, we’ll tell you more about z-scores. In this module we’ll only discuss situations in which we analyze one single variable. This is what we call univariate analysis. In the next module we will also introduce studies in which more variables are involved.
What's included
8 videos4 readings1 assignment
8 videos•Total 53 minutes
- 1.01 Cases, variables and levels of measurement•8 minutes
- 1.02 Data matrix and frequency table•6 minutes
- 1.03 Graphs and shapes of distributions•7 minutes
- 1.04 Mode, median and mean•7 minutes
- 1.05 Range, interquartile range and box plot•8 minutes
- 1.06 Variance and standard deviation•5 minutes
- 1.07 Z-scores•5 minutes
- 1.08 Example•7 minutes
4 readings•Total 40 minutes
- Data and visualisation•10 minutes
- Measures of central tendency and dispersion•10 minutes
- Z-scores and example•10 minutes
- Transcripts - Exploring data•10 minutes
1 assignment•Total 30 minutes
- Exploring Data•30 minutes
In this second module we’ll look at bivariate analyses: studies with two variables. First we’ll introduce the concept of correlation. We’ll investigate contingency tables (when it comes to categorical variables) and scatterplots (regarding quantitative variables). We’ll also learn how to understand and compute one of the most frequently used measures of correlation: Pearson's r. In the next part of the module we’ll introduce the method of OLS regression analysis. We’ll explain how you (or the computer) can find the regression line and how you can describe this line by means of an equation. We’ll show you that you can assess how well the regression line fits your data by means of the so-called r-squared. We conclude the module with a discussion of why you should always be very careful when interpreting the results of a regression analysis.
What's included
8 videos6 readings1 assignment
8 videos•Total 49 minutes
- 2.01 Crosstabs and scatterplots•7 minutes
- 2.02 Pearson's r•7 minutes
- 2.03 Regression - Finding the line•4 minutes
- 2.04 Regression - Describing the line•8 minutes
- 2.05 Regression - How good is the line?•6 minutes
- 2.06 Correlation is not causation•5 minutes
- 2.07 Example contingency table•3 minutes
- 2.08 Example Pearson's r and regression•8 minutes
6 readings•Total 60 minutes
- Correlation•10 minutes
- Regression•10 minutes
- Reference•10 minutes
- Caveats and examples•10 minutes
- Reference•10 minutes
- Transcripts - Correlation and regression•10 minutes
1 assignment•Total 30 minutes
- Correlation and Regression•30 minutes
This module introduces concepts from probability theory and the rules for calculating with probabilities. This is not only useful for answering various kinds of applied statistical questions but also to understand the statistical analyses that will be introduced in subsequent modules. We start by describing randomness, and explain how random events surround us. Next, we provide an intuitive definition of probability through an example and relate this to the concepts of events, sample space and random trials. A graphical tool to understand these concepts is introduced here as well, the tree-diagram.Thereafter a number of concepts from set theory are explained and related to probability calculations. Here the relation is made to tree-diagrams again, as well as contingency tables. We end with a lesson where conditional probabilities, independence and Bayes rule are explained. All in all, this is quite a theoretical module on a topic that is not always easy to grasp. That's why we have included as many intuitive examples as possible.
What's included
11 videos5 readings1 assignment
11 videos•Total 64 minutes
- 3.01 Randomness•5 minutes
- 3.02 Probability•5 minutes
- 3.03 Sample space, event, probability of event and tree diagram•6 minutes
- 3.04 Quantifying probabilities with tree diagram•6 minutes
- 3.05 Basic set-theoretic concepts•6 minutes
- 3.06 Practice with sets•8 minutes
- 3.07 Union•5 minutes
- 3.08 Joint and marginal probabilities•6 minutes
- 3.09 Conditional probability•5 minutes
- 3.10 Independence between random events•5 minutes
- 3.11 More conditional probability, decision trees and Bayes' Law•8 minutes
5 readings•Total 50 minutes
- Probability & randomness•10 minutes
- Sample space, events & tree diagrams•10 minutes
- Probability & sets•10 minutes
- Conditional probability & independence•10 minutes
- Transcripts - Probability•10 minutes
1 assignment•Total 30 minutes
- Probability•30 minutes
Probability distributions form the core of many statistical calculations. They are used as mathematical models to represent some random phenomenon and subsequently answer statistical questions about that phenomenon. This module starts by explaining the basic properties of a probability distribution, highlighting how it quantifies a random variable and also pointing out how it differs between discrete and continuous random variables. Subsequently the cumulative probability distribution is introduced and its properties and usage are explained as well. In a next lecture it is shown how a random variable with its associated probability distribution can be characterized by statistics like a mean and variance, just like observational data. The effects of changing random variables by multiplication or addition on these statistics are explained as well.The lecture thereafter introduces the normal distribution, starting by explaining its functional form and some general properties. Next, the basic usage of the normal distribution to calculate probabilities is explained. And in a final lecture the binomial distribution, an important probability distribution for discrete data, is introduced and further explained. By the end of this module you have covered quite some ground and have a solid basis to answer the most frequently encountered statistical questions. Importantly, the fundamental knowledge about probability distributions that is presented here will also provide a solid basis to learn about inferential statistics in the next modules.
What's included
8 videos5 readings1 assignment
8 videos•Total 52 minutes
- 4.01 Random variables and probability distributions•7 minutes
- 4.02 Cumulative probability distributions•5 minutes
- 4.03 The mean of a random variable•5 minutes
- 4.04 Variance of a random variable•7 minutes
- 4.05 Functional form of the normal distribution•6 minutes
- 4.06 The normal distribution: probability calculations•5 minutes
- 4.07 The standard normal distribution•9 minutes
- 4.08 The binomial distribution•9 minutes
5 readings•Total 50 minutes
- Probability distributions•10 minutes
- Mean and variance of a random variable•10 minutes
- The normal distribution•10 minutes
- The binomial distribution•10 minutes
- Transcripts - Probability distributions•10 minutes
1 assignment•Total 30 minutes
- Probability distributions•30 minutes
Methods for summarizing sample data are called descriptive statistics. However, in most studies we’re not interested in samples, but in underlying populations. If we employ data obtained from a sample to draw conclusions about a wider population, we are using methods of inferential statistics. It is therefore of essential importance that you know how you should draw samples. In this module we’ll pay attention to good sampling methods as well as some poor practices. To draw conclusions about the population a sample is from, researchers make use of a probability distribution that is very important in the world of statistics: the sampling distribution. We’ll discuss sampling distributions in great detail and compare them to data distributions and population distributions. We’ll look at the sampling distribution of the sample mean and the sampling distribution of the sample proportion.
What's included
7 videos5 readings1 assignment
7 videos•Total 45 minutes
- 5.01 Sample and population•4 minutes
- 5.02 Sampling•8 minutes
- 5.03 The sampling distribution•7 minutes
- 5.04 The central limit theorem•7 minutes
- 5.05 Three distributions•7 minutes
- 5.06 Sampling distribution proportion•5 minutes
- 5.07 Example•7 minutes
5 readings•Total 50 minutes
- Sample and sampling•10 minutes
- Sampling distribution of sample mean and central limit theorem•10 minutes
- Reference•10 minutes
- Sampling distribution of sample proportion and example•10 minutes
- Transcripts - Sampling distributions•10 minutes
1 assignment•Total 30 minutes
- Sampling distributions•30 minutes
We can distinguish two types of statistical inference methods. We can: (1) estimate population parameters; and (2) test hypotheses about these parameters. In this module we’ll talk about the first type of inferential statistics: estimation by means of a confidence interval. A confidence interval is a range of numbers, which, most likely, contains the actual population value. The probability that the interval actually contains the population value is what we call the confidence level. In this module we’ll show you how you can construct confidence intervals for means and proportions and how you should interpret them. We’ll also pay attention to how you can decide how large your sample size should be.
What's included
7 videos4 readings1 assignment
7 videos•Total 40 minutes
- 6.01 Statistical inference•4 minutes
- 6.02 CI for mean with known population sd•6 minutes
- 6.03 CI for mean with unknown population sd•8 minutes
- 6.04 CI for proportion•6 minutes
- 6.05 Confidence levels•7 minutes
- 6.06 Choosing the sample size•6 minutes
- 6.07 Example•4 minutes
4 readings•Total 40 minutes
- Inference and confidence interval for mean•10 minutes
- Confidence interval for proportion and confidence levels•10 minutes
- Sample size and example•10 minutes
- Transcripts - Confidence intervals•10 minutes
1 assignment•Total 30 minutes
- Confidence intervals•30 minutes
In this module we’ll talk about statistical hypotheses. They form the main ingredients of the method of significance testing. An hypothesis is nothing more than an expectation about a population. When we conduct a significance test, we use (just like when we construct a confidence interval) sample data to draw inferences about population parameters. The significance test is, therefore, also a method of inferential statistics. We’ll show that each significance test is based on two hypotheses: the null hypothesis and the alternative hypothesis. When you do a significance test, you assume that the null hypothesis is true unless your data provide strong evidence against it. We’ll show you how you can conduct a significance test about a mean and how you can conduct a test about a proportion. We’ll also demonstrate that significance tests and confidence intervals are closely related. We conclude the module by arguing that you can make right and wrong decisions while doing a test. Wrong decisions are referred to as Type I and Type II errors.
What's included
7 videos4 readings1 assignment
7 videos•Total 39 minutes
- 7.01 Hypotheses•5 minutes
- 7.02 Test about proportion•8 minutes
- 7.03 Test about mean•5 minutes
- 7.04 Step-by-step plan•7 minutes
- 7.05 Significance test and confidence interval•5 minutes
- 7.06 Type I and Type II errors•5 minutes
- 7.07 Example•5 minutes
4 readings•Total 40 minutes
- Hypotheses and significance tests•10 minutes
- Step-by-step plan and confidence interval•10 minutes
- Type I and Type II errors and example•10 minutes
- Transcripts - Significance tests•10 minutes
1 assignment•Total 20 minutes
- Significance tests•20 minutes
This is the final module, where you can apply everything you've learned until now in the final exam. Please note that you can only take the final exam once a month, so make sure you are fully prepared to take the test. Please follow the honor code and do not communicate or confer with others while taking this exam. Good luck!
What's included
1 assignment
1 assignment•Total 30 minutes
- Final Exam•30 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructors
Offered by
Explore more from Probability and Statistics
- Status: Free Trial
- Status: Free TrialD
Duke University
Course
Guided Project
- Status: Free TrialR
Rice University
Course
Why people choose Coursera for their career
Learner reviews
- 5 stars
74.23%
- 4 stars
18.65%
- 3 stars
4.36%
- 2 stars
1.13%
- 1 star
1.60%
Showing 3 of 4670
Reviewed on Sep 8, 2020
Thank You, @University_of_Amsterdam for this wonderful course. I have really benefited a lot from this course. Thank you, Dr. Matthijs Rooduijn for making this course so lively and interesting!!
Reviewed on Aug 27, 2017
Very Good course. I was pretty much satisfied. R-lab can be improved and better explanations to help us on the test could have been given (after not passing the first time, for example).
Reviewed on Mar 21, 2021
Excellent course on basic statistics. The instructors are good and have done much to make it simpler. However, I would have appreciated more worked out examples along with the transcripts.
Frequently asked questions
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.
More questions
Financial aid available,
