VOOZH about

URL: https://www.coursera.org/learn/tidyverse-data-wrangling

⇱ Wrangling Data in the Tidyverse | Coursera


Wrangling Data in the Tidyverse

Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

Gain insight into a topic and learn the fundamentals.
4.5

33 reviews

1 week to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

Gain insight into a topic and learn the fundamentals.
4.5

33 reviews

1 week to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

What you'll learn

  • Apply Tidyverse functions to transform non-tidy data to tidy data

  • Conduct basic exploratory data analysis

  • Conduct analyses of text data

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

7 assignments

Taught in English

Build your subject-matter expertise

This course is part of the Tidyverse Skills for Data Science in R Specialization
When you enroll in this course, you'll also be enrolled in this Specialization.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 6 modules in this course

Data never arrive in the condition that you need them in order to do effective data analysis. Data need to be re-shaped, re-arranged, and re-formatted, so that they can be visualized or be inputted into a machine learning algorithm. This course addresses the problem of wrangling your data so that you can bring them under control and analyze them effectively. The key goal in data wrangling is transforming non-tidy data into tidy data.

This course covers many of the critical details about handling tidy and non-tidy data in R such as converting from wide to long formats, manipulating tables with the dplyr package, understanding different R data types, processing text data with regular expressions, and conducting basic exploratory data analyses. Investing the time to learn these data wrangling techniques will make your analyses more efficient, more reproducible, and more understandable to your data science team. In this specialization we assume familiarity with the R programming language. If you are not yet familiar with R, we suggest you first complete R Programming before returning to complete this course.

Data never arrive in the condition that you need them in order to do effective data analysis. Data need to be re-shaped, re-arranged, and re-formatted, so that they can be visualized or be inputted into a machine learning algorithm. This module addresses the problem of wrangling your data so that you can bring them under control and analyze them effectively. The key goal in data wrangling is transforming non-tidy data into tidy data.

What's included

19 readings2 assignments

19 readingsβ€’Total 155 minutes
  • About This Courseβ€’3 minutes
  • Tidy Data Reviewβ€’2 minutes
  • Reshaping Dataβ€’2 minutes
  • Wide Dataβ€’5 minutes
  • Long Dataβ€’5 minutes
  • Reshaping Dataβ€’30 minutes
  • Data Wranglingβ€’0 minutes
  • R Packagesβ€’15 minutes
  • The Pipe Operatorβ€’15 minutes
  • Filtering Dataβ€’20 minutes
  • Reorderingβ€’15 minutes
  • Creating New Columnsβ€’5 minutes
  • Separating Columnsβ€’5 minutes
  • Merging Columnsβ€’5 minutes
  • Cleaning Column Namesβ€’5 minutes
  • Combining Data Across Data Framesβ€’5 minutes
  • Grouping Dataβ€’5 minutes
  • Summarizing Dataβ€’3 minutes
  • Operations Across Columnsβ€’10 minutes
2 assignmentsβ€’Total 60 minutes
  • Reshaping Data Quizβ€’30 minutes
  • Data Wrangling Quizβ€’30 minutes

In R, categorical data are handled as factors. By definition, categorical data are limited in that they have a set number of possible values they can take. For example, there are 12 months in a calendar year. In a month variable, each observation is limited to taking one of these twelve values. Thus, with a limited number of possible values, month is a categorical variable. Categorical data, which will be referred to as factors for the rest of this lesson, are regularly found in data. Learning how to work with this type of variable effectively will be incredibly helpful.

What's included

14 readings2 assignments

14 readingsβ€’Total 75 minutes
  • Working with Factorsβ€’5 minutes
  • Factor Reviewβ€’5 minutes
  • Manually Changing the Labels of Factor Levels: fct_releve()β€’5 minutes
  • Keeping the Order of the Factor Levels: fct_inorder()β€’5 minutes
  • Advanced Factoringβ€’5 minutes
  • Re-ordering Factor Levels by Frequency: fct_infreq()β€’5 minutes
  • Reversing Order Levels: fct_rev()β€’5 minutes
  • Re-ordering Factor Levels by Another Variable: fct_reorder()β€’5 minutes
  • Combining Several Levels into One: fct_recode()β€’5 minutes
  • Converting Numeric Levels to factors: ifelse() + factor()β€’5 minutes
  • Dates and Times Basicsβ€’5 minutes
  • Creating Dates and Date-Time Objectsβ€’10 minutes
  • Working with Datesβ€’5 minutes
  • Time Spansβ€’5 minutes
2 assignmentsβ€’Total 60 minutes
  • Working With Factors Quizβ€’30 minutes
  • Working With Dates Quizβ€’30 minutes

Working with text data is increasingly common in data science projects. Text manipulation is often needed to clean up messy datasets and to create numerical measurements out of text input. In addition, often the text themselves are the data and this module covers tools to extract information from the text.

What's included

13 readings2 assignments

13 readingsβ€’Total 135 minutes
  • Working with Stringsβ€’5 minutes
  • stringrβ€’5 minutes
  • String Basicsβ€’15 minutes
  • Regular Expressionsβ€’3 minutes
  • glueβ€’15 minutes
  • Tidy Text Formatβ€’15 minutes
  • Sentiment Analysisβ€’15 minutes
  • Word and Document Frequencyβ€’30 minutes
  • Functional Programmingβ€’5 minutes
  • For Loops vs. Functionalsβ€’2 minutes
  • map Functionsβ€’5 minutes
  • Multiple Vectorsβ€’15 minutes
  • Anonymous Functionsβ€’5 minutes
2 assignmentsβ€’Total 60 minutes
  • Working With Strings Quizβ€’30 minutes
  • Functional Programming Quizβ€’30 minutes

The goal of an exploratory analysis is to examine, or explore the data and find relationships that weren’t previously known. Exploratory analyses explore how different measures might be related to each other but do not confirm that relationship as causal, i.e., one variable causing another. You’ve probably heard the phrase β€œCorrelation does not imply causation,” and exploratory analyses lie at the root of this saying. Just because you observe a relationship between two variables during exploratory analysis, it does not mean that one necessarily causes the other.

What's included

2 readings

2 readingsβ€’Total 35 minutes
  • Exploratory Data Analysisβ€’10 minutes
  • General Principles of EDAβ€’25 minutes

Now we will demonstrate how to import data using our case study examples. When working through the steps of the case studies, you can use either RStudio on your own computer or Coursera lab spaces provided for each case study.

What's included

11 readings2 ungraded labs

11 readingsβ€’Total 180 minutes
  • Case Studiesβ€’10 minutes
  • Healthcare Coverage Dataβ€’20 minutes
  • Healthcare Spending Dataβ€’20 minutes
  • Join the Dataβ€’30 minutes
  • Census Dataβ€’15 minutes
  • Violent Crimeβ€’15 minutes
  • Brady Scoresβ€’15 minutes
  • The Counted Fatal Shootingsβ€’15 minutes
  • Unemployment Dataβ€’15 minutes
  • Population Density: 2015β€’15 minutes
  • Firearm Ownershipβ€’10 minutes
2 ungraded labsβ€’Total 20 minutes
  • Case Study #1: Health Expendituresβ€’10 minutes
  • Case Study #2: Firearmsβ€’10 minutes

In this project, you will practice data exploration and data wrangling with the tidyverse using consumer complaint data from the Consumer Financial Protection Bureau (CFPB).

What's included

1 reading1 assignment

1 readingβ€’Total 5 minutes
  • Important information before you start the projectβ€’5 minutes
1 assignmentβ€’Total 60 minutes
  • Wrangling Data in the Tidyverse Course Projectβ€’60 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructors

Instructor ratings
4.6 (9 ratings)
Johns Hopkins University
5 Coursesβ€’7,093 learners
Johns Hopkins University
5 Coursesβ€’7,093 learners
Johns Hopkins University
37 Coursesβ€’1,689,638 learners

Explore more from Data Analysis

Why people choose Coursera for their career

πŸ‘ Image

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
πŸ‘ Image

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
πŸ‘ Image

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
πŸ‘ Image

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

  • 5 stars

    69.69%

  • 4 stars

    18.18%

  • 3 stars

    9.09%

  • 2 stars

    3.03%

  • 1 star

    0%

Showing 3 of 33

AN
Β·

Reviewed on Apr 18, 2022

G​reat course to get yourself acquanted with data wrangling in Tidyverse.

LV
Β·

Reviewed on Apr 24, 2021

Excellent course! I've learned so many useful R techniques/codes!

SM
Β·

Reviewed on Oct 1, 2021

Great course with clearly understandable lectures.

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Financial aid available,