Understanding and Visualizing Data with Python
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Understanding and Visualizing Data with Python
This course is part of Statistics with Python Specialization
155,682 already enrolled
Included with
Ask Coursera
2,730 reviews
Recommended experience
2,730 reviews
Recommended experience
What you'll learn
Properly identify various data types and understand the different uses for each
Create data visualizations and numerical summaries with Python
Communicate statistical ideas clearly and concisely to a broad audience
Identify appropriate analytic techniques for probability and non-probability samples
Skills you'll gain
- Statistical Visualization
- Data Visualization
- Matplotlib
- Statistical Methods
- Data Collection
- Data Visualization Software
- Descriptive Statistics
- Data Analysis
- Statistical Programming
- Statistical Inference
- Statistical Analysis
- Probability & Statistics
- Plot (Graphics)
- Exploratory Data Analysis
- Sampling (Statistics)
- Statistics
Tools you'll learn
Details to know
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
There are 4 modules in this course
In this course, learners will be introduced to the field of statistics, including where data come from, study design, data management, and exploring and visualizing data. Learners will identify different types of data, and learn how to visualize, analyze, and interpret summaries for both univariate and multivariate data. Learners will also be introduced to the differences between probability and non-probability sampling from larger populations, the idea of how sample estimates vary, and how inferences can be made about larger populations based on probability sampling.
At the end of each week, learners will apply the statistical concepts they’ve learned using Python within the course environment. During these lab-based sessions, learners will discover the different uses of Python as a tool, including the Numpy, Pandas, Statsmodels, Matplotlib, and Seaborn libraries. Tutorial videos are provided to walk learners through the creation of visualizations and data management, all within Python. This course utilizes the Jupyter Notebook environment within Coursera.
In the first week of the course, we will review a course outline and discover the various concepts and objectives to be mastered in the weeks to come. You will get an introduction to the field of statistics and explore a variety of perspectives the field has to offer. We will identify numerous types of data that exist and observe where they can be found in everyday life. You will delve into basic Python functionality, along with an introduction to Jupyter Notebook. All of the course information on grading, prerequisites, and expectations are on the course syllabus and you can find more information on our Course Resources page.
What's included
11 videos7 readings2 assignments1 discussion prompt5 ungraded labs
11 videos•Total 114 minutes
- Welcome to the Course!•3 minutes
- Understanding and Visualizing Data Guidelines•3 minutes
- What is Statistics?•10 minutes
- Interview: Perspectives on Statistics in Real Life•29 minutes
- (Cool Stuff in) Data•9 minutes
- Where Do Data Come From?•13 minutes
- Variable Types•6 minutes
- Study Design•6 minutes
- Optional: Introduction to Jupyter Notebooks•10 minutes
- Optional: Data Types in Python•12 minutes
- Optional: Introduction to Libraries and Data Management•13 minutes
7 readings•Total 62 minutes
- Syllabus•10 minutes
- Meet the Course Team!•10 minutes
- About Our Datasets•2 minutes
- Help Us Learn More About You!•10 minutes
- Resource: This is Statistics•10 minutes
- Let's Play with Data!•10 minutes
- Data management and manipulation•10 minutes
2 assignments•Total 40 minutes
- Assessment: Different Data Types•10 minutes
- Practice Quiz - Variable Types •30 minutes
1 discussion prompt•Total 10 minutes
- Discussion: Three Guiding Questions•10 minutes
5 ungraded labs•Total 60 minutes
- Introduction to Jupyter Notebooks•0 minutes
- Data Types in Python•0 minutes
- Introduction to Libraries and Data Management•0 minutes
- Continued Data Basics•30 minutes
- Deeper Dive into Data Management & Python Resources•30 minutes
In the second week of this course, we will be looking at graphical and numerical interpretations for one variable (univariate data). In particular, we will be creating and analyzing histograms, box plots, and numerical summaries of our data in order to give a basis of analysis for quantitative data and bar charts and pie charts for categorical data. A few key interpretations will be made about our numerical summaries such as mean, IQR, and standard deviation. An assessment is included at the end of the week concerning numerical summaries and interpretations of these summaries.
What's included
6 videos3 readings3 assignments1 discussion prompt6 ungraded labs
6 videos•Total 45 minutes
- Categorical Data: Tables, Bar Charts & Pie Charts•4 minutes
- Quantitative Data: Histograms•13 minutes
- Quantitative Data: Numerical Summaries•9 minutes
- Standard Score (Empirical Rule)•7 minutes
- Quantitative Data: Boxplots•7 minutes
- Demo: Interactive Histogram & Boxplot•5 minutes
3 readings•Total 30 minutes
- What's Going on in This Graph?•10 minutes
- Modern Infographics•10 minutes
- Optional: Link to a Graphics Gallery•10 minutes
3 assignments•Total 35 minutes
- Assessment: Numerical Summaries•10 minutes
- Python Assessment: Univariate Analysis•10 minutes
- Practice Quiz: Summarizing Graphs in Words •15 minutes
1 discussion prompt•Total 10 minutes
- What is There? What isn't There?•10 minutes
6 ungraded labs•Total 180 minutes
- Python Libraries and an Introduction to Graphing•0 minutes
- Tables, Histograms, and Boxplots in Python•0 minutes
- Case Study of Univariate Data Analyses using NHANES Data•30 minutes
- More Practice: Univariate Analysis Using NHANES•60 minutes
- More Practice: Univariate Analysis Using NHANES (Solutions)•60 minutes
- Univariate Analysis: Assessment Notebook•30 minutes
In the third week of this course on looking at data, we’ll introduce key ideas for examining research questions that require looking at more than one variable. In particular, we will consider both numerically and visually how different variables interact, how summaries can appear deceiving if you don’t properly account for interactions, and differences between quantitative and categorical variables. This week’s assignment will consist of a writing assignment along with reviewing those of your peers.
What's included
4 videos2 readings2 assignments1 peer review1 discussion prompt6 ungraded labs
4 videos•Total 22 minutes
- Looking at Associations with Multivariate Categorical Data•10 minutes
- Looking at Associations with Multivariate Quantitative Data•8 minutes
- Demo: Interactive Scatterplot•3 minutes
- Introduction to Pizza Assignment•3 minutes
2 readings•Total 20 minutes
- Pitfall: Simpson's Paradox•10 minutes
- Modern Ways to Visualize Data•10 minutes
2 assignments•Total 25 minutes
- Python Assessment: Multivariate Analysis•15 minutes
- Practice Quiz: Multivariate Data •10 minutes
1 peer review•Total 60 minutes
- Pizza Study Design Assignment•60 minutes
1 discussion prompt•Total 15 minutes
- Discussion: Find Your Own Example•15 minutes
6 ungraded labs•Total 120 minutes
- Multivariate Data Selection•0 minutes
- Multivariate Distributions•0 minutes
- Unit Testing•0 minutes
- Case Study of Multivariate Analyses in NHANES•30 minutes
- More Practice: Multivariate Analyses with NHANES•60 minutes
- Multivariate Analysis: Assessment Notebook•30 minutes
In this week, you’ll spend more time thinking about where data come from. The highest-quality statistical analyses of data will always incorporate information about the process used to generate the data, or features of the data collection design. You’ll be exposed to important concepts related to sampling from larger populations, including probability and non-probability sampling, and how we can make inferences about larger populations based on well-designed samples. You’ll also learn about the concept of a sampling distribution, and how estimation of the variance of that distribution plays a critical role in making statements about populations. Finally, you’ll learn about the importance of reading the documentation for a given data set; a key step in looking at data is also looking at the available documentation for that data set, which describes how the data were generated.
What's included
12 videos10 readings2 assignments4 ungraded labs
12 videos•Total 174 minutes
- Sampling from Well-Defined Populations•17 minutes
- Probability Sampling: Part I•11 minutes
- Probability Sampling: Part II•16 minutes
- Non-Probability Sampling: Part I•11 minutes
- Non-Probability Sampling: Part II•10 minutes
- Sampling Variance & Sampling Distributions: Part I•15 minutes
- Sampling Variance & Sampling Distributions: Part II•7 minutes
- Demo: Interactive Sampling Distribution•22 minutes
- Beyond Means: Sampling Distributions of Other Common Statistics•10 minutes
- Making Population Inference Based on Only One Sample•14 minutes
- Inference for Non-Probability Samples•17 minutes
- Complex Samples•24 minutes
10 readings•Total 95 minutes
- Building on Visualization Concepts•5 minutes
- More on SRS Probabilities of Inclusion•10 minutes
- Potential Pitfalls of Non-Probability Sampling: A Case Study•10 minutes
- Cluster Sampling and Design Effects•10 minutes
- Resource: Seeing Theory•10 minutes
- Article: Jerzy Neyman on Population Inference•10 minutes
- Preventing Bad/Biased Samples•10 minutes
- Optional: Deeper Dive Reference•10 minutes
- Course Feedback•10 minutes
- Keep Learning with Michigan Online•10 minutes
2 assignments•Total 30 minutes
- Assessment: Distinguishing Between Probability & Non-Probability Samples•10 minutes
- Generating Random Data and Samples•20 minutes
4 ungraded labs•Total 30 minutes
- Sampling from a Biased Population•0 minutes
- Randomness and Reproducibility•0 minutes
- The Empirical Rule of Distribution•0 minutes
- Illustrating sampling distributions using NHANES•30 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructors
Offered by
Explore more from Data Analysis
- Status: Free Trial
Course
- Status: Preview
Course
- Status: Free Trial
Course
Why people choose Coursera for their career
Learner reviews
- 5 stars
75.86%
- 4 stars
18.46%
- 3 stars
3.55%
- 2 stars
0.98%
- 1 star
1.13%
Showing 3 of 2730
Reviewed on Jun 2, 2020
Never have I come across a course half as interactive as this and it was a much needed confidence booster for a beginner like me. I look forward to completing the specialization : )
Reviewed on Mar 2, 2021
20 studying hours that helps me getting back to speed on manipulating the quantitative data in Pandas with different query conditions, powerful statistics and Sampling Distributions.
Reviewed on Jan 19, 2021
This was a quick way of understanding the basics. I liked how detailed and basic the learning instructions were. Anyone, even those without a statistics background can begin from here
Frequently asked questions
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.
More questions
Financial aid available,
¹ Some assignments in this course are AI-graded. For these assignments, your data will be used in accordance with Coursera's Privacy Notice.
