Clustering and Classification with Machine Learning in R
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Clustering and Classification with Machine Learning in R
Included with
Learn more
Ask Coursera
Recommended experience
Recommended experience
What you'll learn
Perform basic data pre-processing and wrangling in R Studio.
Implement and analyze unsupervised clustering techniques, such as K-means clustering.
Implement supervised learning techniques and classification methods, such as Random Forests.
Utilize dimensional reduction techniques (PCA) and feature selection.
Skills you'll gain
- Statistical Programming
- Feature Engineering
- Data Science
- Machine Learning Methods
- Dimensionality Reduction
- Applied Machine Learning
- Data Preprocessing
- Unsupervised Learning
- Data Wrangling
- Statistical Machine Learning
- Classification And Regression Tree (CART)
- Machine Learning Software
- Supervised Learning
- Machine Learning
- Machine Learning Algorithms
- Tidyverse (R Package)
Tools you'll learn
Details to know
12 assignments
See how employees at top companies are mastering in-demand skills
There are 10 modules in this course
Updated in May 2025.
This course now features Coursera Coach! A smarter way to learn with interactive, real-time conversations that help you test your knowledge, challenge assumptions, and deepen your understanding as you progress through the course. This course is a complete guide to supervised and unsupervised learning using R, covering practical data science comprehensively. Companies globally use R to analyze vast data, and mastering it can enhance your career. Unlike other courses, this one provides in-depth knowledge of R's machine learning features, from data reading and cleaning to implementing and evaluating algorithms. -You'll explore topics such as R framework, data structures, pre-processing, machine learning, model building, and selection. -Emphasizing real data, you'll use packages like Caret and understand unsupervised learning, dimension reduction, and supervised learning. -You'll read data, pre-process in R Studio, implement K-means clustering, PCA, Random Forests, and evaluate models. Ideal for students starting with R Studio data science, those wanting to apply unsupervised learning to real data, and anyone with R experience aiming to enhance practical skills. Prior exposure to common machine learning terms would be needed.
In this module, we will introduce the course, outlining the fundamental concepts of clustering and classification in machine learning. We will also guide you through the installation and setup of R and R Studio, ensuring you are prepared to dive into the practical aspects of the course.
What's included
2 videos1 reading1 assignment
2 videosβ’Total 11 minutes
- Welcome to Clustering & Classification with Machine Learning in Rβ’5 minutes
- Installing R and R Studioβ’7 minutes
1 readingβ’Total 10 minutes
- Full Course Resourcesβ’10 minutes
1 assignmentβ’Total 15 minutes
- Introduction to the Course Assessmentβ’15 minutes
In this module, we will explore the different methods to import data into R from various sources. You will learn to read data from CSV and Excel files, unzipped folders, online CSVs, Google Sheets, HTML tables, and databases, setting the foundation for data manipulation and analysis.
What's included
7 videos1 assignment
7 videosβ’Total 40 minutes
- Read in CSV & Excel Dataβ’10 minutes
- Read in Unzipped Folderβ’3 minutes
- Read in Online CSVβ’4 minutes
- Read in Googlesheetsβ’4 minutes
- Read in Data from Online HTML Tables-Part 1β’4 minutes
- Read in Data from Online HTML Tables-Part 2β’6 minutes
- Read Data from a Databaseβ’8 minutes
1 assignmentβ’Total 15 minutes
- Read in Data from Different Sources in R Assessmentβ’15 minutes
In this module, we will delve into data cleaning and preprocessing, ensuring your data is ready for analysis. You will learn to summarize and explore data using the dplyr package and create visualizations with ggplot2. Additionally, we'll cover methods to evaluate associations between variables and test for correlation.
What's included
11 videos1 assignment
11 videosβ’Total 99 minutes
- Remove Missing Valuesβ’17 minutes
- More Data Cleaningβ’8 minutes
- Introduction to dplyr for Data Summarizing-Part 1β’6 minutes
- Introduction to dplyr for Data Summarizing-Part 2β’5 minutes
- Exploratory Data Analysis (EDA): Basic Visualizations with Rβ’19 minutes
- More Exploratory Data Analysis with xdaβ’4 minutes
- Data Exploration & Visualization With dplyr & ggplot2β’6 minutes
- Associations Between Quantitative Variables- Theoryβ’4 minutes
- Testing for Correlationβ’20 minutes
- Evaluate the Relation Between Nominal Variablesβ’6 minutes
- Cramer's V for Examining the Strength of Association Between Nominal Variableβ’4 minutes
1 assignmentβ’Total 15 minutes
- Data Pre-processing and Visualization Assessmentβ’15 minutes
In this module, we will explore the differences between machine learning and traditional statistical analysis, providing a theoretical overview of machine learning. You will gain a foundational understanding of machine learning concepts and their relevance to data science.
What's included
2 videos1 assignment
2 videosβ’Total 11 minutes
- How is Machine Learning Different from Statistical Data Analysis?β’6 minutes
- What is Machine Learning (ML) About? Some Theoretical Pointersβ’6 minutes
1 assignmentβ’Total 15 minutes
- Machine Learning for Data Science Assessmentβ’15 minutes
In this module, we will cover unsupervised learning techniques, focusing on clustering algorithms. You will learn to implement and evaluate different clustering methods, including K-Means, Fuzzy K-Means, DBSCAN, and more. We'll also discuss how to select the best algorithm for your specific data needs.
What's included
12 videos1 assignment
12 videosβ’Total 91 minutes
- K-Means Clusteringβ’15 minutes
- Other Ways of Selecting Cluster Numbersβ’3 minutes
- Fuzzy K-Means Clusteringβ’18 minutes
- Weighted k-meansβ’6 minutes
- Partitioning Around Meloids (PAM)β’7 minutes
- Hierarchical Clustering in Rβ’14 minutes
- Expectation-Maximization (EM) in Rβ’6 minutes
- DBSCAN Clustering in Rβ’5 minutes
- Cluster a Mixed Datasetβ’4 minutes
- Should We Even Do Clustering?β’3 minutes
- Assess Clustering Performanceβ’6 minutes
- Which Clustering Algorithm to Choose?β’4 minutes
1 assignmentβ’Total 15 minutes
- Unsupervised Learning in R Assessmentβ’15 minutes
In this module, we will explore techniques for reducing the dimensionality of your data. You will learn the theoretical aspects of dimension reduction and how to apply methods such as PCA, Multidimensional Scaling, and SVD in R to simplify your datasets while preserving essential information.
What's included
5 videos1 assignment
5 videosβ’Total 27 minutes
- Dimension Reduction-theoryβ’3 minutes
- Principal Component Analysis (PCA)β’13 minutes
- More on PCAβ’4 minutes
- Multidimensional Scalingβ’3 minutes
- Singular Value Decomposition (SVD)β’3 minutes
1 assignmentβ’Total 15 minutes
- Feature/Dimension Reduction Assessmentβ’15 minutes
In this module, we will focus on feature selection techniques to identify the most relevant predictors for your models. You will learn to remove correlated variables and use methods like LASSO regression, FSelector, and Boruta analysis to select important features, enhancing your model's performance.
What's included
4 videos1 assignment
4 videosβ’Total 39 minutes
- Removing Highly Correlated Predictor Variablesβ’17 minutes
- Variable Selection Using LASSO Regressionβ’4 minutes
- Variable Selection with FSelectorβ’14 minutes
- Boruta Analysis for Feature Selectionβ’5 minutes
1 assignmentβ’Total 15 minutes
- Feature Selection to Select the Most Relevant Predictors Assessmentβ’15 minutes
In this module, we will introduce the fundamental concepts of supervised learning. You will learn how to preprocess data for supervised learning and gain insights into various types of supervised learning problems, preparing you for more advanced classification and regression techniques.
What's included
2 videos1 assignment
2 videosβ’Total 16 minutes
- Some Basic Supervised Learning Conceptsβ’10 minutes
- Pre-processing for Supervised Learningβ’6 minutes
1 assignmentβ’Total 15 minutes
- Supervised Learning Theory Assessmentβ’15 minutes
In this module, we will delve into classification techniques in supervised learning. You will learn to implement logistic regression, Decision Trees, Random Forests, and Support Vector Machines (SVM). We will also cover methods to evaluate classification accuracy and understand variable importance in your models.
What's included
18 videos1 assignment
18 videosβ’Total 128 minutes
- What are GLMs?β’5 minutes
- Logistic Regression Models as Binary Classifiersβ’9 minutes
- Binary Classifier with PCAβ’6 minutes
- Some Pointers on Evaluating Accuracyβ’10 minutes
- Obtain Binary Classification Accuracy Metricsβ’8 minutes
- More on Binary Accuracy Measuresβ’4 minutes
- Linear Discriminant Analysisβ’13 minutes
- Our Multi-class Classification Problemβ’6 minutes
- Classification Treesβ’12 minutes
- More on Classification Tree Visualizationβ’9 minutes
- Classification with Party Packageβ’5 minutes
- Decision Treesβ’9 minutes
- Random Forest (RF) Classificationβ’8 minutes
- Examine Individual Variable Importance for Random Forestsβ’4 minutes
- GBM Classificationβ’8 minutes
- Support Vector Machines (SVM) for Classificationβ’4 minutes
- More SVM for Classificationβ’4 minutes
- Variable Importance in SVM Modelling with rminerβ’3 minutes
1 assignmentβ’Total 15 minutes
- Supervised Learning: Classification Assessmentβ’15 minutes
In this module, we will provide additional lectures focusing on advanced clustering methods. You will learn about Fuzzy C-Means Clustering, understanding its theoretical underpinnings and practical applications in R, further enhancing your clustering analysis skills.
What's included
1 video3 assignments
1 videoβ’Total 6 minutes
- Fuzzy C-Means Clusteringβ’6 minutes
3 assignmentsβ’Total 90 minutes
- Full Course Practice Assessmentβ’15 minutes
- Additional Lectures Assessmentβ’15 minutes
- Full Course Assessmentβ’60 minutes
Instructor
Offered by
Explore more from Machine Learning
- Status: Preview
- P
Packt
Course
- Status: Free Trial
Specialization
- Status: Free Trial
Course
Why people choose Coursera for their career
Frequently asked questions
Yes, you can preview the first video and view the syllabus before you enroll. You must purchase the course to access content not included in the preview.
If you decide to enroll in the course before the session start date, you will have access to all of the lecture videos and readings for the course. Youβll be able to submit assignments once the session starts.
Once you enroll and your session begins, you will have access to all videos and other resources, including reading items and the course discussion forum. Youβll be able to view and submit practice assessments, and complete required graded assignments to earn a grade and a Course Certificate.
More questions
Financial aid available,
