VOOZH about

URL: https://www.coursera.org/learn/understanding-visualization-data

⇱ Understanding and Visualizing Data with Python | Coursera


Understanding and Visualizing Data with Python

Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

Understanding and Visualizing Data with Python

155,682 already enrolled

Included with

Ask Coursera

Gain insight into a topic and learn the fundamentals.
4.7

2,730 reviews

Beginner level

Recommended experience

Flexible schedule
2 weeks at 10 hours a week
Learn at your own pace
96%
Most learners liked this course

Gain insight into a topic and learn the fundamentals.
4.7

2,730 reviews

Beginner level

Recommended experience

Flexible schedule
2 weeks at 10 hours a week
Learn at your own pace
96%
Most learners liked this course

What you'll learn

  • Properly identify various data types and understand the different uses for each

  • Create data visualizations and numerical summaries with Python

  • Communicate statistical ideas clearly and concisely to a broad audience

  • Identify appropriate analytic techniques for probability and non-probability samples

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

9 assignments¹

AI Graded see disclaimer
Taught in English

Build your subject-matter expertise

This course is part of the Statistics with Python Specialization
When you enroll in this course, you'll also be enrolled in this Specialization.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 4 modules in this course

In this course, learners will be introduced to the field of statistics, including where data come from, study design, data management, and exploring and visualizing data. Learners will identify different types of data, and learn how to visualize, analyze, and interpret summaries for both univariate and multivariate data. Learners will also be introduced to the differences between probability and non-probability sampling from larger populations, the idea of how sample estimates vary, and how inferences can be made about larger populations based on probability sampling.

At the end of each week, learners will apply the statistical concepts they’ve learned using Python within the course environment. During these lab-based sessions, learners will discover the different uses of Python as a tool, including the Numpy, Pandas, Statsmodels, Matplotlib, and Seaborn libraries. Tutorial videos are provided to walk learners through the creation of visualizations and data management, all within Python. This course utilizes the Jupyter Notebook environment within Coursera.

In the first week of the course, we will review a course outline and discover the various concepts and objectives to be mastered in the weeks to come. You will get an introduction to the field of statistics and explore a variety of perspectives the field has to offer. We will identify numerous types of data that exist and observe where they can be found in everyday life. You will delve into basic Python functionality, along with an introduction to Jupyter Notebook. All of the course information on grading, prerequisites, and expectations are on the course syllabus and you can find more information on our Course Resources page.

What's included

11 videos7 readings2 assignments1 discussion prompt5 ungraded labs

11 videosTotal 114 minutes
  • Welcome to the Course!3 minutes
  • Understanding and Visualizing Data Guidelines3 minutes
  • What is Statistics?10 minutes
  • Interview: Perspectives on Statistics in Real Life29 minutes
  • (Cool Stuff in) Data9 minutes
  • Where Do Data Come From?13 minutes
  • Variable Types6 minutes
  • Study Design6 minutes
  • Optional: Introduction to Jupyter Notebooks10 minutes
  • Optional: Data Types in Python12 minutes
  • Optional: Introduction to Libraries and Data Management13 minutes
7 readingsTotal 62 minutes
  • Syllabus10 minutes
  • Meet the Course Team!10 minutes
  • About Our Datasets2 minutes
  • Help Us Learn More About You!10 minutes
  • Resource: This is Statistics10 minutes
  • Let's Play with Data!10 minutes
  • Data management and manipulation10 minutes
2 assignmentsTotal 40 minutes
  • Assessment: Different Data Types10 minutes
  • Practice Quiz - Variable Types 30 minutes
1 discussion promptTotal 10 minutes
  • Discussion: Three Guiding Questions10 minutes
5 ungraded labsTotal 60 minutes
  • Introduction to Jupyter Notebooks0 minutes
  • Data Types in Python0 minutes
  • Introduction to Libraries and Data Management0 minutes
  • Continued Data Basics30 minutes
  • Deeper Dive into Data Management & Python Resources30 minutes

In the second week of this course, we will be looking at graphical and numerical interpretations for one variable (univariate data). In particular, we will be creating and analyzing histograms, box plots, and numerical summaries of our data in order to give a basis of analysis for quantitative data and bar charts and pie charts for categorical data. A few key interpretations will be made about our numerical summaries such as mean, IQR, and standard deviation. An assessment is included at the end of the week concerning numerical summaries and interpretations of these summaries.

What's included

6 videos3 readings3 assignments1 discussion prompt6 ungraded labs

6 videosTotal 45 minutes
  • Categorical Data: Tables, Bar Charts & Pie Charts4 minutes
  • Quantitative Data: Histograms13 minutes
  • Quantitative Data: Numerical Summaries9 minutes
  • Standard Score (Empirical Rule)7 minutes
  • Quantitative Data: Boxplots7 minutes
  • Demo: Interactive Histogram & Boxplot5 minutes
3 readingsTotal 30 minutes
  • What's Going on in This Graph?10 minutes
  • Modern Infographics10 minutes
  • Optional: Link to a Graphics Gallery10 minutes
3 assignmentsTotal 35 minutes
  • Assessment: Numerical Summaries10 minutes
  • Python Assessment: Univariate Analysis10 minutes
  • Practice Quiz: Summarizing Graphs in Words 15 minutes
1 discussion promptTotal 10 minutes
  • What is There? What isn't There?10 minutes
6 ungraded labsTotal 180 minutes
  • Python Libraries and an Introduction to Graphing0 minutes
  • Tables, Histograms, and Boxplots in Python0 minutes
  • Case Study of Univariate Data Analyses using NHANES Data30 minutes
  • More Practice: Univariate Analysis Using NHANES60 minutes
  • More Practice: Univariate Analysis Using NHANES (Solutions)60 minutes
  • Univariate Analysis: Assessment Notebook30 minutes

In the third week of this course on looking at data, we’ll introduce key ideas for examining research questions that require looking at more than one variable. In particular, we will consider both numerically and visually how different variables interact, how summaries can appear deceiving if you don’t properly account for interactions, and differences between quantitative and categorical variables. This week’s assignment will consist of a writing assignment along with reviewing those of your peers.

What's included

4 videos2 readings2 assignments1 peer review1 discussion prompt6 ungraded labs

4 videosTotal 22 minutes
  • Looking at Associations with Multivariate Categorical Data10 minutes
  • Looking at Associations with Multivariate Quantitative Data8 minutes
  • Demo: Interactive Scatterplot3 minutes
  • Introduction to Pizza Assignment3 minutes
2 readingsTotal 20 minutes
  • Pitfall: Simpson's Paradox10 minutes
  • Modern Ways to Visualize Data10 minutes
2 assignmentsTotal 25 minutes
  • Python Assessment: Multivariate Analysis15 minutes
  • Practice Quiz: Multivariate Data 10 minutes
1 peer reviewTotal 60 minutes
  • Pizza Study Design Assignment60 minutes
1 discussion promptTotal 15 minutes
  • Discussion: Find Your Own Example15 minutes
6 ungraded labsTotal 120 minutes
  • Multivariate Data Selection0 minutes
  • Multivariate Distributions0 minutes
  • Unit Testing0 minutes
  • Case Study of Multivariate Analyses in NHANES30 minutes
  • More Practice: Multivariate Analyses with NHANES60 minutes
  • Multivariate Analysis: Assessment Notebook30 minutes

In this week, you’ll spend more time thinking about where data come from. The highest-quality statistical analyses of data will always incorporate information about the process used to generate the data, or features of the data collection design. You’ll be exposed to important concepts related to sampling from larger populations, including probability and non-probability sampling, and how we can make inferences about larger populations based on well-designed samples. You’ll also learn about the concept of a sampling distribution, and how estimation of the variance of that distribution plays a critical role in making statements about populations. Finally, you’ll learn about the importance of reading the documentation for a given data set; a key step in looking at data is also looking at the available documentation for that data set, which describes how the data were generated.

What's included

12 videos10 readings2 assignments4 ungraded labs

12 videosTotal 174 minutes
  • Sampling from Well-Defined Populations17 minutes
  • Probability Sampling: Part I11 minutes
  • Probability Sampling: Part II16 minutes
  • Non-Probability Sampling: Part I11 minutes
  • Non-Probability Sampling: Part II10 minutes
  • Sampling Variance & Sampling Distributions: Part I15 minutes
  • Sampling Variance & Sampling Distributions: Part II7 minutes
  • Demo: Interactive Sampling Distribution22 minutes
  • Beyond Means: Sampling Distributions of Other Common Statistics10 minutes
  • Making Population Inference Based on Only One Sample14 minutes
  • Inference for Non-Probability Samples17 minutes
  • Complex Samples24 minutes
10 readingsTotal 95 minutes
  • Building on Visualization Concepts5 minutes
  • More on SRS Probabilities of Inclusion10 minutes
  • Potential Pitfalls of Non-Probability Sampling: A Case Study10 minutes
  • Cluster Sampling and Design Effects10 minutes
  • Resource: Seeing Theory10 minutes
  • Article: Jerzy Neyman on Population Inference10 minutes
  • Preventing Bad/Biased Samples10 minutes
  • Optional: Deeper Dive Reference10 minutes
  • Course Feedback10 minutes
  • Keep Learning with Michigan Online10 minutes
2 assignmentsTotal 30 minutes
  • Assessment: Distinguishing Between Probability & Non-Probability Samples10 minutes
  • Generating Random Data and Samples20 minutes
4 ungraded labsTotal 30 minutes
  • Sampling from a Biased Population0 minutes
  • Randomness and Reproducibility0 minutes
  • The Empirical Rule of Distribution0 minutes
  • Illustrating sampling distributions using NHANES30 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructors

Instructor ratings
4.7 (588 ratings)
University of Michigan
3 Courses172,304 learners

Explore more from Data Analysis

Why people choose Coursera for their career

👁 Image

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
👁 Image

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
👁 Image

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
👁 Image

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

  • 5 stars

    75.86%

  • 4 stars

    18.46%

  • 3 stars

    3.55%

  • 2 stars

    0.98%

  • 1 star

    1.13%

Showing 3 of 2730

MR
·

Reviewed on Jun 2, 2020

Never have I come across a course half as interactive as this and it was a much needed confidence booster for a beginner like me. I look forward to completing the specialization : )

AS
·

Reviewed on Mar 2, 2021

20 studying hours that helps me getting back to speed on manipulating the quantitative data in Pandas with different query conditions, powerful statistics and Sampling Distributions.

VM
·

Reviewed on Jan 19, 2021

This was a quick way of understanding the basics. I liked how detailed and basic the learning instructions were. Anyone, even those without a statistics background can begin from here

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Financial aid available,

¹ Some assignments in this course are AI-graded. For these assignments, your data will be used in accordance with Coursera's Privacy Notice.