VOOZH about

URL: https://www.coursera.org/learn/data-manipulation-and-cleaning-in-r

⇱ Data Manipulation and Cleaning in R | Coursera


Data Manipulation and Cleaning in R

Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

Data Manipulation and Cleaning in R

This course is part of multiple programs.

Included with

Ask Coursera

Gain insight into a topic and learn the fundamentals.
Beginner level

Recommended experience

3 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

Gain insight into a topic and learn the fundamentals.
Beginner level

Recommended experience

3 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

Build your subject-matter expertise

This course is available as part of
When you enroll in this course, you'll also be asked to select a specific program.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 5 modules in this course

Transform raw data into valuable insights using R's powerful tidyverse tools. This beginner-friendly course introduces you to essential data cleaning and manipulation techniques, making complex data tasks approachable and practical. Learn how to clean messy data, handle missing values, and prepare datasets for analysis using Microsoft's development environment and AI assistance.

Through hands-on practice, you'll master fundamental data cleaning skills while building confidence in: - Organizing and structuring data effectively - Handling common data issues - Working with different data formats - Using AI tools to enhance your workflow - Creating reproducible data cleaning processes Each concept is taught step-by-step with extensive examples and guided practice, ensuring you build a strong foundation in data manipulation skills.

In this module, you'll get hands-on experience with dplyr, the powerhouse package for data manipulation in R. We'll work with real retail sales data as you learn to filter, arrange, and transform your data with ease. By the end of this module, you'll be confidently writing clean, efficient code using the pipe operator and essential dplyr functions that professional data analysts use daily.

What's included

4 videos9 readings2 assignments3 ungraded labs

4 videosTotal 24 minutes
  • Welcome to Data Manipulation and Cleaning in R2 minutes
  • Understanding Tidy Data3 minutes
  • Hands-on with dplyr Basics10 minutes
  • Data Transformation in Action9 minutes
9 readingsTotal 185 minutes
  • Course Navigation and General Requirements30 minutes
  • Creating Your GitHub Folder for This Course 30 minutes
  • Getting Started with dplyr15 minutes
  • Essential dplyr Functions Guide15 minutes
  • Using R in your Visual Studio Code Lab 15 minutes
  • Connecting Copilot in your Visual Studio Code Labs10 minutes
  • Understanding Data Transformation30 minutes
  • Advanced Data Transformation Techniques30 minutes
  • Module Overview10 minutes
2 assignmentsTotal 60 minutes
  • dplyr Fundamentals Assessment30 minutes
  • Data Transformation Post Lab Assessment30 minutes
3 ungraded labsTotal 180 minutes
  • Hands-On Activity: Practice with Retail Data60 minutes
  • Hands-On Activity: Data Transformation60 minutes
  • Hands-On Activity: Advanced Data Transformation Practice60 minutes

Data rarely comes in the perfect format we need - and that's exactly what we'll tackle in this module. Using tidyr, you'll learn to reshape data like a pro, converting between wide and long formats, and handling complex data structures. Through practical exercises with regional sales data, you'll master the tools needed to transform messy data into clean, analysis-ready formats.

What's included

3 videos4 readings2 assignments2 ungraded labs

3 videosTotal 17 minutes
  • Understanding Data Formats3 minutes
  • Reshaping Data with tidyr7 minutes
  • Implementing Column Operations7 minutes
4 readingsTotal 70 minutes
  • Wide and Long Data Formats Guide15 minutes
  • Introduction to Column Operations15 minutes
  • Advanced Column Manipulation30 minutes
  • Module Overview10 minutes
2 assignmentsTotal 60 minutes
  • Data Format Fundamentals30 minutes
  • Column Operations Assessment30 minutes
2 ungraded labsTotal 120 minutes
  • Data Format Transformation Practice60 minutes
  • Column Operations Practice60 minutes

Text data can be particularly challenging. In this module, you'll work with stringr to clean and standardize text data effectively. Using real product descriptions and customer data, you'll learn pattern matching and advanced string manipulation techniques that make text data cleaning a breeze. You'll see how combining stringr with dplyr creates robust solutions for complex data cleaning challenges.

What's included

2 videos7 readings2 assignments3 ungraded labs

2 videosTotal 19 minutes
  • String Functions in Action9 minutes
  • Advanced String Functions Demo10 minutes
7 readingsTotal 130 minutes
  • Getting Started with String Manipulation30 minutes
  • Regular Expressions Guide15 minutes
  • Quantifiers and Patterns in Regex20 minutes
  • Advanced String Operations Overview30 minutes
  • Normalizing and Transforming Strings 10 minutes
  • Integration with dplyr15 minutes
  • Module Overview10 minutes
2 assignmentsTotal 60 minutes
  • String Manipulation Basics30 minutes
  • Advanced String Operations Post Lab Assessment30 minutes
3 ungraded labsTotal 180 minutes
  • Hands-On Activity: String Functions in Action60 minutes
  • Advanced String Operations Practice60 minutes
  • String Manipulation Practice60 minutes

In this module, you'll learn approaches to handling missing values, outliers, and duplicates. Working with actual order and inventory data, you'll develop strategies for maintaining data quality. You'll discover how modern AI tools can help automate your cleaning processes, making your work more efficient and consistent.

What's included

4 videos8 readings3 assignments3 ungraded labs

4 videosTotal 26 minutes
  • Impact of Missing Data3 minutes
  • Understanding Missing Values in R7 minutes
  • Outlier and Duplicate Detection Demo7 minutes
  • AI-Assisted Data Cleaning Demo9 minutes
8 readingsTotal 185 minutes
  • Missing Value Handling Guide30 minutes
  • Advanced Imputation Methods for Comprehensive Data Analysis (Optional)10 minutes
  • Missing Value Detection and Treatment10 minutes
  • Understanding Outliers and Duplicates30 minutes
  • Statistical Methods Guide30 minutes
  • Introduction to AI in Data Cleaning30 minutes
  • AI Tool Best Practices30 minutes
  • Module Summary15 minutes
3 assignmentsTotal 90 minutes
  • Missing Value Concepts30 minutes
  • Outlier and Duplicate Handling30 minutes
  • AI Tools Assessment30 minutes
3 ungraded labsTotal 180 minutes
  • Missing Value Practice60 minutes
  • Outlier and Duplicate Detection Practice60 minutes
  • AI-Assisted Data Cleaning Practice60 minutes

The comprehensive project simulates a real-world data cleaning scenario where you'll act as a data specialist tasked with standardizing a critical organizational dataset. You'll apply all the key skills learned throughout the course in a structured, step-by-step approach.

What's included

5 readings1 programming assignment2 ungraded labs

5 readingsTotal 110 minutes
  • Project Overview30 minutes
  • Sample Project: Project Implementation10 minutes
  • [Solution Reference] Final Project10 minutes
  • Module Overview30 minutes
  • Course Overview and Next Steps30 minutes
1 programming assignmentTotal 120 minutes
  • Final Project Implementation 120 minutes
2 ungraded labsTotal 105 minutes
  • Sample Project 1 - Practice60 minutes
  • Sample Project 2 - Practice45 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

343 Courses2,617,428 learners

Explore more from Software Development

Why people choose Coursera for their career

👁 Image

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
👁 Image

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
👁 Image

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
👁 Image

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Financial aid available,