VOOZH about

URL: https://www.coursera.org/learn/packt-clustering-and-classification-with-machine-learning-in-r-x895m

⇱ Clustering and Classification with Machine Learning in R | Coursera


Clustering and Classification with Machine Learning in R

Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

Clustering and Classification with Machine Learning in R

Included with

β€’

Learn more

Ask Coursera

Gain insight into a topic and learn the fundamentals.
Beginner level

Recommended experience

1 week to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

Gain insight into a topic and learn the fundamentals.
Beginner level

Recommended experience

1 week to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

What you'll learn

  • Perform basic data pre-processing and wrangling in R Studio.

  • Implement and analyze unsupervised clustering techniques, such as K-means clustering.

  • Implement supervised learning techniques and classification methods, such as Random Forests.

  • Utilize dimensional reduction techniques (PCA) and feature selection.

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

12 assignments

Taught in English

There are 10 modules in this course

Updated in May 2025.

This course now features Coursera Coach! A smarter way to learn with interactive, real-time conversations that help you test your knowledge, challenge assumptions, and deepen your understanding as you progress through the course. This course is a complete guide to supervised and unsupervised learning using R, covering practical data science comprehensively. Companies globally use R to analyze vast data, and mastering it can enhance your career. Unlike other courses, this one provides in-depth knowledge of R's machine learning features, from data reading and cleaning to implementing and evaluating algorithms. -You'll explore topics such as R framework, data structures, pre-processing, machine learning, model building, and selection. -Emphasizing real data, you'll use packages like Caret and understand unsupervised learning, dimension reduction, and supervised learning. -You'll read data, pre-process in R Studio, implement K-means clustering, PCA, Random Forests, and evaluate models. Ideal for students starting with R Studio data science, those wanting to apply unsupervised learning to real data, and anyone with R experience aiming to enhance practical skills. Prior exposure to common machine learning terms would be needed.

In this module, we will introduce the course, outlining the fundamental concepts of clustering and classification in machine learning. We will also guide you through the installation and setup of R and R Studio, ensuring you are prepared to dive into the practical aspects of the course.

What's included

2 videos1 reading1 assignment

2 videosβ€’Total 11 minutes
  • Welcome to Clustering & Classification with Machine Learning in Rβ€’5 minutes
  • Installing R and R Studioβ€’7 minutes
1 readingβ€’Total 10 minutes
  • Full Course Resourcesβ€’10 minutes
1 assignmentβ€’Total 15 minutes
  • Introduction to the Course Assessmentβ€’15 minutes

In this module, we will explore the different methods to import data into R from various sources. You will learn to read data from CSV and Excel files, unzipped folders, online CSVs, Google Sheets, HTML tables, and databases, setting the foundation for data manipulation and analysis.

What's included

7 videos1 assignment

7 videosβ€’Total 40 minutes
  • Read in CSV & Excel Dataβ€’10 minutes
  • Read in Unzipped Folderβ€’3 minutes
  • Read in Online CSVβ€’4 minutes
  • Read in Googlesheetsβ€’4 minutes
  • Read in Data from Online HTML Tables-Part 1β€’4 minutes
  • Read in Data from Online HTML Tables-Part 2β€’6 minutes
  • Read Data from a Databaseβ€’8 minutes
1 assignmentβ€’Total 15 minutes
  • Read in Data from Different Sources in R Assessmentβ€’15 minutes

In this module, we will delve into data cleaning and preprocessing, ensuring your data is ready for analysis. You will learn to summarize and explore data using the dplyr package and create visualizations with ggplot2. Additionally, we'll cover methods to evaluate associations between variables and test for correlation.

What's included

11 videos1 assignment

11 videosβ€’Total 99 minutes
  • Remove Missing Valuesβ€’17 minutes
  • More Data Cleaningβ€’8 minutes
  • Introduction to dplyr for Data Summarizing-Part 1β€’6 minutes
  • Introduction to dplyr for Data Summarizing-Part 2β€’5 minutes
  • Exploratory Data Analysis (EDA): Basic Visualizations with Rβ€’19 minutes
  • More Exploratory Data Analysis with xdaβ€’4 minutes
  • Data Exploration & Visualization With dplyr & ggplot2β€’6 minutes
  • Associations Between Quantitative Variables- Theoryβ€’4 minutes
  • Testing for Correlationβ€’20 minutes
  • Evaluate the Relation Between Nominal Variablesβ€’6 minutes
  • Cramer's V for Examining the Strength of Association Between Nominal Variableβ€’4 minutes
1 assignmentβ€’Total 15 minutes
  • Data Pre-processing and Visualization Assessmentβ€’15 minutes

In this module, we will explore the differences between machine learning and traditional statistical analysis, providing a theoretical overview of machine learning. You will gain a foundational understanding of machine learning concepts and their relevance to data science.

What's included

2 videos1 assignment

2 videosβ€’Total 11 minutes
  • How is Machine Learning Different from Statistical Data Analysis?β€’6 minutes
  • What is Machine Learning (ML) About? Some Theoretical Pointersβ€’6 minutes
1 assignmentβ€’Total 15 minutes
  • Machine Learning for Data Science Assessmentβ€’15 minutes

In this module, we will cover unsupervised learning techniques, focusing on clustering algorithms. You will learn to implement and evaluate different clustering methods, including K-Means, Fuzzy K-Means, DBSCAN, and more. We'll also discuss how to select the best algorithm for your specific data needs.

What's included

12 videos1 assignment

12 videosβ€’Total 91 minutes
  • K-Means Clusteringβ€’15 minutes
  • Other Ways of Selecting Cluster Numbersβ€’3 minutes
  • Fuzzy K-Means Clusteringβ€’18 minutes
  • Weighted k-meansβ€’6 minutes
  • Partitioning Around Meloids (PAM)β€’7 minutes
  • Hierarchical Clustering in Rβ€’14 minutes
  • Expectation-Maximization (EM) in Rβ€’6 minutes
  • DBSCAN Clustering in Rβ€’5 minutes
  • Cluster a Mixed Datasetβ€’4 minutes
  • Should We Even Do Clustering?β€’3 minutes
  • Assess Clustering Performanceβ€’6 minutes
  • Which Clustering Algorithm to Choose?β€’4 minutes
1 assignmentβ€’Total 15 minutes
  • Unsupervised Learning in R Assessmentβ€’15 minutes

In this module, we will explore techniques for reducing the dimensionality of your data. You will learn the theoretical aspects of dimension reduction and how to apply methods such as PCA, Multidimensional Scaling, and SVD in R to simplify your datasets while preserving essential information.

What's included

5 videos1 assignment

5 videosβ€’Total 27 minutes
  • Dimension Reduction-theoryβ€’3 minutes
  • Principal Component Analysis (PCA)β€’13 minutes
  • More on PCAβ€’4 minutes
  • Multidimensional Scalingβ€’3 minutes
  • Singular Value Decomposition (SVD)β€’3 minutes
1 assignmentβ€’Total 15 minutes
  • Feature/Dimension Reduction Assessmentβ€’15 minutes

In this module, we will focus on feature selection techniques to identify the most relevant predictors for your models. You will learn to remove correlated variables and use methods like LASSO regression, FSelector, and Boruta analysis to select important features, enhancing your model's performance.

What's included

4 videos1 assignment

4 videosβ€’Total 39 minutes
  • Removing Highly Correlated Predictor Variablesβ€’17 minutes
  • Variable Selection Using LASSO Regressionβ€’4 minutes
  • Variable Selection with FSelectorβ€’14 minutes
  • Boruta Analysis for Feature Selectionβ€’5 minutes
1 assignmentβ€’Total 15 minutes
  • Feature Selection to Select the Most Relevant Predictors Assessmentβ€’15 minutes

In this module, we will introduce the fundamental concepts of supervised learning. You will learn how to preprocess data for supervised learning and gain insights into various types of supervised learning problems, preparing you for more advanced classification and regression techniques.

What's included

2 videos1 assignment

2 videosβ€’Total 16 minutes
  • Some Basic Supervised Learning Conceptsβ€’10 minutes
  • Pre-processing for Supervised Learningβ€’6 minutes
1 assignmentβ€’Total 15 minutes
  • Supervised Learning Theory Assessmentβ€’15 minutes

In this module, we will delve into classification techniques in supervised learning. You will learn to implement logistic regression, Decision Trees, Random Forests, and Support Vector Machines (SVM). We will also cover methods to evaluate classification accuracy and understand variable importance in your models.

What's included

18 videos1 assignment

18 videosβ€’Total 128 minutes
  • What are GLMs?β€’5 minutes
  • Logistic Regression Models as Binary Classifiersβ€’9 minutes
  • Binary Classifier with PCAβ€’6 minutes
  • Some Pointers on Evaluating Accuracyβ€’10 minutes
  • Obtain Binary Classification Accuracy Metricsβ€’8 minutes
  • More on Binary Accuracy Measuresβ€’4 minutes
  • Linear Discriminant Analysisβ€’13 minutes
  • Our Multi-class Classification Problemβ€’6 minutes
  • Classification Treesβ€’12 minutes
  • More on Classification Tree Visualizationβ€’9 minutes
  • Classification with Party Packageβ€’5 minutes
  • Decision Treesβ€’9 minutes
  • Random Forest (RF) Classificationβ€’8 minutes
  • Examine Individual Variable Importance for Random Forestsβ€’4 minutes
  • GBM Classificationβ€’8 minutes
  • Support Vector Machines (SVM) for Classificationβ€’4 minutes
  • More SVM for Classificationβ€’4 minutes
  • Variable Importance in SVM Modelling with rminerβ€’3 minutes
1 assignmentβ€’Total 15 minutes
  • Supervised Learning: Classification Assessmentβ€’15 minutes

In this module, we will provide additional lectures focusing on advanced clustering methods. You will learn about Fuzzy C-Means Clustering, understanding its theoretical underpinnings and practical applications in R, further enhancing your clustering analysis skills.

What's included

1 video3 assignments

1 videoβ€’Total 6 minutes
  • Fuzzy C-Means Clusteringβ€’6 minutes
3 assignmentsβ€’Total 90 minutes
  • Full Course Practice Assessmentβ€’15 minutes
  • Additional Lectures Assessmentβ€’15 minutes
  • Full Course Assessmentβ€’60 minutes

Instructor

Packt
1,926 Coursesβ€’560,010 learners

Explore more from Machine Learning

Why people choose Coursera for their career

πŸ‘ Image

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
πŸ‘ Image

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
πŸ‘ Image

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
πŸ‘ Image

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Frequently asked questions

Yes, you can preview the first video and view the syllabus before you enroll. You must purchase the course to access content not included in the preview.

If you decide to enroll in the course before the session start date, you will have access to all of the lecture videos and readings for the course. You’ll be able to submit assignments once the session starts.

Once you enroll and your session begins, you will have access to all videos and other resources, including reading items and the course discussion forum. You’ll be able to view and submit practice assessments, and complete required graded assignments to earn a grade and a Course Certificate.

If you complete the course successfully, your electronic Course Certificate will be added to your Accomplishments page - from there, you can print your Course Certificate or add it to your LinkedIn profile.

This course is currently available only to learners who have paid or received financial aid, when available.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Financial aid available,