Statistical Methods and Data Analysis
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Ask Coursera
What you'll learn
β’ Explain the basic statistical reasoning involved in data analysis.
β’ Explain the applications of data analysis with examples from published research.
β’ Execute the data analysis projects using R.
Skills you'll gain
- Statistical Visualization
- Statistics
- Statistical Methods
- Data Visualization
- Statistical Analysis
- Correlation Analysis
- Big Data
- Probability & Statistics
- Classification And Regression Tree (CART)
- Descriptive Statistics
- Probability
- Sampling (Statistics)
- Statistical Programming
- Regression Analysis
- Statistical Modeling
- Statistical Inference
- Probability Distribution
- Statistical Hypothesis Testing
Tools you'll learn
Details to know
29 assignments
See how employees at top companies are mastering in-demand skills
There are 8 modules in this course
Welcome to the Statistical Methods and Data Analysis course! This course serves as an introduction to the statistical and computation methods that have become indispensable tools for those pursuing careers in public policy. Alongside offering the necessary background in basic and applied statistics, the course will also introduce you to the powerful R programming interface.
For statistical methods, the course focuses on understanding and application of different concepts and tools of statistics. You will be introduced to data visualization, which is an indispensable tool in this period of vast information around us. In this course, you will learn how to summarize data using descriptive statistics. Next, you will learn about sampling and how to make inferences about the population from a sample via probability theory. You will understand the difference between experimental and observational data. You will learn how to analyze experimental data using tests of significance. For observational data, you will learn correlation analysis and regression analysis. Finally, the course will help you to learn about regression with big data. For programming skills, you will learn R as a programming language for statistical computing. R is a free software environment for statistical computing and has lots of support information available on the Internet. At the end of this course, you will have the confidence of executing your research project using R.
Statistical methods, by definition, are tools for identifying patterns in large datasets. This module takes the first step towards statistical analysis by exploring various strategies for visualizing data, an increasingly important skill in todayβs era of big data. This module explains the different forms of data, types of plots, and charts used to depict the different forms of data. In addition, the module focuses on different visualization techniques appropriate for big data.
What's included
10 videos3 readings4 assignments
10 videosβ’Total 92 minutes
- Statistical Methods and Data Analysisβ’5 minutes
- Nominal, Ordinal, and Interval-Ratio Measurementβ’12 minutes
- Bar Plot, Pie Chart, and Histogramβ’10 minutes
- Stacked Bar Chart, Multiple Bar Chart, and Scatter Plotβ’9 minutes
- Box Plot, Violin Plot, and Ridge plotβ’13 minutes
- Big Dataβ’10 minutes
- Word Frequency Plot and Word Cloudβ’8 minutes
- Sentiment Analysisβ’7 minutes
- N-Gram Plotβ’9 minutes
- Twitter Analysisβ’9 minutes
3 readingsβ’Total 30 minutes
- Course Overviewβ’10 minutes
- Pre-reading materialβ’10 minutes
- Essential Reading Material β Data Visualizationβ’10 minutes
4 assignmentsβ’Total 18 minutes
- Practice Quizβ’2 minutes
- Practice Quizβ’2 minutes
- Practice Quizβ’4 minutes
- Practice Quizβ’10 minutes
While data visualization gives us a βfirst cutβ in the empirical world, knowing what the data βlooks likeβ will not take us far towards identifying relationships between variables β the focal point of policymaking. At the minimum, identification requires that the researcher be able to summarize large amounts of information in the form of descriptive statistics. This module explains the measures of central tendency and dispersion for ungrouped data and for grouped data. The measures of central tendency and dispersion for ungrouped data include mean, median, mode, standard deviation, skewness, and kurtosis. The means of central tendency and dispersion for grouped data include grouped mean, grouped standard deviation, grouped mode, and grouped median.
What's included
8 videos1 reading3 assignments
8 videosβ’Total 63 minutes
- Measures of Central Tendency and Dispersion: Introductionβ’5 minutes
- Mode and Variation Ratioβ’9 minutes
- Median and Quartile Rangeβ’8 minutes
- Mean and Standard Deviationβ’7 minutes
- Skewness and Kurtosisβ’12 minutes
- Time-Series Dataβ’9 minutes
- Grouped Mean and Standard Deviationβ’7 minutes
- Grouped Mode and Medianβ’6 minutes
1 readingβ’Total 10 minutes
- Essential Reading Material β Descriptive Statisticsβ’10 minutes
3 assignmentsβ’Total 46 minutes
- Graded Quizβ’30 minutes
- Practice Quizβ’12 minutes
- Practice Quizβ’4 minutes
Except in the rarest of cases when data on the entire population is available for all attributes of interest to the researcher, social scientists must draw inferences about a population from a sample drawn from that population. This module focuses on the statistical reasoning involved in studying the uncertainty attached to sample statistics. For making inferences about the population from a sample, the module explains the fundamentals of probability theory. In addition, the module explains the concepts of random variables and function of random variables. Finally, the module covers the concepts and applications of the binomial and normal distributions.
What's included
9 videos1 reading5 assignments
9 videosβ’Total 65 minutes
- Compound Eventsβ’11 minutes
- Axioms of Probability and Addition Ruleβ’7 minutes
- Independence and Multiplication Ruleβ’8 minutes
- Bayesβ Theoremβ’9 minutes
- Introduction to Random Variablesβ’4 minutes
- Function of Random Variablesβ’6 minutes
- Binomial Distributionβ’7 minutes
- Normal Distributionβ’6 minutes
- Normal Approximationβ’7 minutes
1 readingβ’Total 10 minutes
- Essential Reading Material β Probability Distributionsβ’10 minutes
5 assignmentsβ’Total 78 minutes
- Graded Quizβ’60 minutes
- Practice Quizβ’4 minutes
- Practice Quizβ’4 minutes
- Practice Quizβ’4 minutes
- Practice Quizβ’6 minutes
This module discusses the various strategies available to researchers for drawing samples from a population and the first principles involved in determining sample size. The module explains the sampling strategies for sampling from a population. In addition, the module explains how to measure the accuracy of sample estimates. Finally, the module focuses on statistical inference. The goal of statistical inference is to make a statement about something that is not observed based on something that is observed, within a certain level of uncertainty. The module will discuss the Central Limit Theorem (CLT) and the concept of the confidence interval, which allow us to make such statements.
What's included
6 videos1 reading4 assignments
6 videosβ’Total 51 minutes
- Sampling Strategiesβ’14 minutes
- Parameters and Statisticsβ’6 minutes
- Accuracy of Sample Percentageβ’8 minutes
- Accuracy of Sample Meanβ’6 minutes
- Central Limit Theoremβ’8 minutes
- Confidence Intervalβ’9 minutes
1 readingβ’Total 10 minutes
- Essential Reading Material β Samplingβ’10 minutes
4 assignmentsβ’Total 72 minutes
- Graded Quizβ’60 minutes
- Practice Quizβ’4 minutes
- Practice Quizβ’4 minutes
- Practice Quizβ’4 minutes
This module introduces the critical distinction between experimental data and observational data. In addition, the module explores statistical inference in the context of experimental data using tests of significance. You will also learn about observational data and the problem of confounding, controlled experiment, and natural experiment. The module focuses on the concepts and methods for analyzing statistical significance, including analytical framework, one sample t-test, two sample t-test, and ANOVA.
What's included
8 videos4 readings3 assignments
8 videosβ’Total 77 minutes
- Observational Data and the Problem of Confoundingβ’6 minutes
- Controlled Experimentβ’12 minutes
- Natural Experimentβ’12 minutes
- Analytical Frameworkβ’8 minutes
- One Sample T-Testβ’11 minutes
- Two Sample T-Testβ’8 minutes
- Anova Testβ’8 minutes
- Experiments and Statistical Significanceβ’12 minutes
4 readingsβ’Total 40 minutes
- Essential Reading Material β Tests of Significanceβ’10 minutes
- Recommended Reading Material β Observational Data and Experimentsβ’10 minutes
- Essential Reading Material β Tests of Significanceβ’10 minutes
- Recommended Reading Material β Concepts and Methods for Analyzing Statistical Significance β’10 minutes
3 assignmentsβ’Total 76 minutes
- Graded Quizβ’60 minutes
- Practice Quizβ’6 minutes
- Practice Quizβ’10 minutes
This module introduces the foundational model for statistical inference with observational data, namely, the ordinary least squares (OLS) regression, paying particular attention to the conditions under which the OLS estimator is the best linear unbiased estimator (BLUE). You will learn about the concept of association, which helps to understand the relationship between two variables. You will also learn about the measures of association appropriate for each variable type: lambda coefficient for nominal variables, gamma coefficient for ordinal variables, and correlation coefficient for interval-ratio variables. Finally, the module focuses on regression analysis by explaining bivariate OLS and multivariate OLS.
What's included
8 videos2 readings4 assignments
8 videosβ’Total 82 minutes
- Measures of Associationβ’5 minutes
- Lambda Coefficientβ’12 minutes
- Gamma Coefficientβ’11 minutes
- Correlation Coefficientβ’6 minutes
- Bivariate OLSβ’12 minutes
- Multivariate OLSβ’15 minutes
- BLUE Assumptionsβ’7 minutes
- Extensions of Basic OLSβ’13 minutes
2 readingsβ’Total 20 minutes
- Essential Reading Material β Correlation and Regressionβ’10 minutes
- Recommended Reading Material β OLS Assumptions and Extensionsβ’10 minutes
4 assignmentsβ’Total 76 minutes
- Graded Quizβ’60 minutes
- Practice Quizβ’8 minutes
- Practice Quizβ’4 minutes
- Practice Quizβ’4 minutes
This module focuses on advanced modeling strategies in settings where the best linear unbiased estimator (BLUE) assumptions are violated. You will learn about how to get valid ordinary least squares (OLS) estimates when one or the other key assumption on regression errors for OLS estimates to be BLUE is violated. In particular, you will learn how to detect and correct OLS estimates for reverse causality, heteroscedasticity, and serial correlation. Next, under violations of BLUE assumptions on model and variable specification, you will learn how to model nominal and ordinal dependent variables.
What's included
5 videos4 readings3 assignments
5 videosβ’Total 68 minutes
- Reverse Causalityβ’17 minutes
- Heteroscedasticityβ’12 minutes
- Serial Correlationβ’10 minutes
- Nominal Dependent Variableβ’18 minutes
- Ordinal Dependent Variableβ’10 minutes
4 readingsβ’Total 40 minutes
- Essential Reading Material β Measurement Error, Complex Residual Structures, and Limited Dependent Variablesβ’10 minutes
- Recommended Reading Material β Violations of BLUE Assumptions: Errorsβ’10 minutes
- Essential Reading Material β Measurement Error, Complex Residual Structures, and Limited Dependent Variablesβ’10 minutes
- Recommended Reading Material β Violations of BLUE Assumptions: Model and Variable Specificationβ’10 minutes
3 assignmentsβ’Total 70 minutes
- Graded Quizβ’60 minutes
- Practice Quizβ’6 minutes
- Practice Quizβ’4 minutes
Running regression models on large-scale datasets with millions of observations and thousands of variables can be a daunting task. This module examines the strategies for building regression models when dealing with such datasets. For conducting big data regression analysis with nominal dependent variables, you will learn the concepts of decision tree, pruning, cross-validation, and random forest. You will also learn about the penalized regression approach, which is useful for running big data regressions when the dependent variable is an interval-ratio variable.
What's included
5 videos2 readings3 assignments
5 videosβ’Total 40 minutes
- Decision Treeβ’11 minutes
- Pruningβ’8 minutes
- Cross-validationβ’8 minutes
- Random Forestβ’7 minutes
- Penalized Regressionβ’7 minutes
2 readingsβ’Total 20 minutes
- Essential Reading Material β Variable Selectionβ’10 minutes
- Course Wrap- Upβ’10 minutes
3 assignmentsβ’Total 70 minutes
- Graded Quizβ’60 minutes
- Practice Quizβ’8 minutes
- Practice Quizβ’2 minutes
Build toward a degree
This course is part of the following degree program(s) offered by O.P. Jindal Global University. If you are admitted and enroll, your completed coursework may count toward your degree learning and your progress can transfer with you.ΒΉ
Instructor
Offered by
Explore more from Data Analysis
- Status: Free TrialU
University of Colorado Boulder
Course
- Status: Free Trial
Course
- Status: Free TrialU
University of Pittsburgh
Course
- Status: Free TrialM
Macquarie University
Course
Why people choose Coursera for their career
Frequently asked questions
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you purchase a Certificate you get access to all course materials, including graded assignments. Upon completing the course, your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Yes. In select learning programs, you can apply for financial aid or a scholarship if you canβt afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, youβll find a link to apply on the description page.
More questions
Financial aid available,
