VOOZH about

URL: https://www.coursera.org/learn/data-science-k-means-clustering-python

⇱ Foundations of Data Science: K-Means Clustering in Python | Coursera


Foundations of Data Science: K-Means Clustering in Python

Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

Gain insight into a topic and learn the fundamentals.
4.6

735 reviews

Beginner level

Recommended experience

Flexible schedule
3 weeks at 10 hours a week
Learn at your own pace
95%
Most learners liked this course

Gain insight into a topic and learn the fundamentals.
4.6

735 reviews

Beginner level

Recommended experience

Flexible schedule
3 weeks at 10 hours a week
Learn at your own pace
95%
Most learners liked this course

What you'll learn

  • Define and explain the key concepts of data clustering

  • Demonstrate understanding of the key constructs and features of the Python language.

  • Implement in Python the principle steps of the K-means algorithm.

  • Design and execute a whole data clustering workflow and interpret the outputs.

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

39 assignmentsΒΉ

AI Graded see disclaimer
Taught in English

There are 5 modules in this course

Organisations all around the world are using data to predict behaviours and extract valuable real-world insights to inform decisions. Managing and analysing big data has become an essential part of modern finance, retail, marketing, social science, development and research, medicine and government.

This MOOC, designed by an academic team from Goldsmiths, University of London, will quickly introduce you to the core concepts of Data Science to prepare you for intermediate and advanced Data Science courses. It focuses on the basic mathematics, statistics and programming skills that are necessary for typical data analysis tasks. You will consider these fundamental concepts on an example data clustering task, and you will use this example to learn basic programming skills that are necessary for mastering Data Science techniques. During the course, you will be asked to do a series of mathematical and programming exercises and a small data clustering project for a given dataset.

This week we will introduce you to the course and to the team who will be guiding you through the course over the next 5 weeks. The aim of this week's material is to gently introduce you to Data Science through some real-world examples of where Data Science is used, and also by highlighting some of the main concepts involved.

What's included

9 videos4 assignments3 discussion prompts

9 videosβ€’Total 22 minutes
  • Welcome and Introductionβ€’3 minutes
  • Introduction to Data Scienceβ€’3 minutes
  • What is Data?β€’2 minutes
  • Types of Dataβ€’1 minute
  • Machine Learningβ€’4 minutes
  • Supervised vs Unsupervised Learningβ€’3 minutes
  • K-Means Clusteringβ€’4 minutes
  • Preparing your Dataβ€’2 minutes
  • A Real World Datasetβ€’1 minute
4 assignmentsβ€’Total 100 minutes
  • Week 1 Summative Assessmentβ€’40 minutes
  • Types of Data – Review Informationβ€’15 minutes
  • Supervised vs Unsupervised – Review Informationβ€’15 minutes
  • K-Means Clustering – Review Informationβ€’30 minutes
3 discussion promptsβ€’Total 270 minutes
  • Welcome!β€’30 minutes
  • Examples of Dataβ€’120 minutes
  • Machine Learning in the Newsβ€’120 minutes

What's included

11 videos4 readings10 assignments1 peer review1 ungraded lab

11 videosβ€’Total 37 minutes
  • 2.0: Week 2 Introductionβ€’1 minute
  • 2.1 – Introduction to Mathematical Concepts of Data Clusteringβ€’2 minutes
  • 2.2 – Mean of One Dimensional Listsβ€’2 minutes
  • 2.3 – Variance and Standard Deviationβ€’4 minutes
  • 2.4 Jupyter Notebooksβ€’6 minutes
  • 2.5 Variablesβ€’4 minutes
  • 2.6 Listsβ€’5 minutes
  • 2.7 Computing the Meanβ€’3 minutes
  • 2.8 Better Lists: NumPyβ€’4 minutes
  • 2.9 Computing the Standard Deviationβ€’6 minutes
  • Week 2 Conclusionβ€’1 minute
4 readingsβ€’Total 50 minutes
  • Population vs Sample, Biasβ€’10 minutes
  • Variability, Standard Deviation and Biasβ€’10 minutes
  • Python Style Guideβ€’10 minutes
  • Numpy and Array Creationβ€’20 minutes
10 assignmentsβ€’Total 122 minutes
  • Week 2 Summative Assessmentβ€’40 minutes
  • Population vs Sample – Review Informationβ€’5 minutes
  • Mean of One Dimensional Lists – Review Informationβ€’3 minutes
  • Variance and Standard Deviation – Review Informationβ€’4 minutes
  • Jupyter Notebooks – Review Informationβ€’20 minutes
  • Variables – Review Informationβ€’10 minutes
  • Lists – Review Informationβ€’10 minutes
  • Computing the Mean – Review Informationβ€’10 minutes
  • Better Lists – Review Informationβ€’10 minutes
  • Computing the Standard Deviation – Review Informationβ€’10 minutes
1 peer reviewβ€’Total 30 minutes
  • Use Jupyter Notebooksβ€’30 minutes
1 ungraded labβ€’Total 15 minutes
  • Jupyter Notebook Environmentβ€’15 minutes

What's included

16 videos10 readings15 assignments

16 videosβ€’Total 53 minutes
  • Week 3 Introductionβ€’1 minute
  • 3.1 Multidimensional Data Points and Featuresβ€’2 minutes
  • 3.2 Multidimensional Meanβ€’3 minutes
  • 3.3 Dispersion: Multidimensional Variablesβ€’3 minutes
  • 3.4 Distance Metricsβ€’5 minutes
  • 3.5 Normalisationβ€’1 minute
  • 3.6 Outliersβ€’1 minute
  • 3.7 Basic Plottingβ€’3 minutes
  • 3.7a Storing 2D Coordinates in a Single Data Structureβ€’6 minutes
  • 3.8 Multidimensional Meanβ€’5 minutes
  • 3.9 Adding Graphical Overlaysβ€’6 minutes
  • 3.10 Calculating the Distance to the Meanβ€’4 minutes
  • 3.11 List Comprehensionβ€’4 minutes
  • 3.12 Normalisation in Pythonβ€’6 minutes
  • 3.13 Outliers and Plotting Normalised Dataβ€’3 minutes
  • Week 3 Conclusionβ€’1 minute
10 readingsβ€’Total 120 minutes
  • Multidimensional Data Points and Features Recapβ€’10 minutes
  • Multidimensional Mean Recapβ€’10 minutes
  • Multidimensional Variables Recapβ€’10 minutes
  • Distance Metrics Recapβ€’10 minutes
  • Normalisation Recapβ€’10 minutes
  • Note on Matplotlibβ€’10 minutes
  • Matplotlib Scatter Plot Documentationβ€’20 minutes
  • Matplotlib Patches Documentationβ€’10 minutes
  • List Comprehension Documentationβ€’20 minutes
  • 3.12 Errataβ€’10 minutes
15 assignmentsβ€’Total 290 minutes
  • Week 3 Summative Assessmentβ€’25 minutes
  • Multidimensional Data Points and Features – Review Informationβ€’3 minutes
  • Multidimensional Mean – Review Informationβ€’3 minutes
  • Dispersion: Multidimensional Variables – Review Informationβ€’5 minutes
  • Distance Metrics – Review Informationβ€’6 minutes
  • Normalisation – Review Informationβ€’3 minutes
  • Outliers – Review Informationβ€’30 minutes
  • Basic Plotting – Review Informationβ€’5 minutes
  • Storing 2D Coordinates – Review Informationβ€’30 minutes
  • Multidimensional Mean – Review Informationβ€’30 minutes
  • Adding Graphical Overlays – Review Informationβ€’30 minutes
  • Calculating Distance – Review Informationβ€’30 minutes
  • List Comprehension – Review Informationβ€’30 minutes
  • Normalisation in Python – Review Informationβ€’30 minutes
  • Outliers – Review Informationβ€’30 minutes

What's included

8 videos6 readings7 assignments1 peer review

8 videosβ€’Total 37 minutes
  • Week 4 Introductionβ€’1 minute
  • 4.1: Using the Pandas Library to Read csv Filesβ€’5 minutes
  • 4.1a: Sorting and Filtering Data Using Pandasβ€’8 minutes
  • 4.1b: Labelling Points on a Graphβ€’4 minutes
  • 4.1c: Labelling all the Points on a Graphβ€’3 minutes
  • 4.2: Eyeballing the Dataβ€’6 minutes
  • 4.3: Using K-Means to Interpret the Dataβ€’9 minutes
  • Week 4: Conclusionβ€’1 minute
6 readingsβ€’Total 60 minutes
  • Week 4 Code Resourcesβ€’5 minutes
  • Pandas Read_CSV Functionβ€’15 minutes
  • More Pandas Library Documentationβ€’10 minutes
  • The Pyplot Text Functionβ€’10 minutes
  • For Loops in Pythonβ€’10 minutes
  • Documentation for sklearn.cluster.KMeansβ€’10 minutes
7 assignmentsβ€’Total 75 minutes
  • Week 4 Summative Assessmentβ€’40 minutes
  • Using the Pandas Library to Read csv Files – Review Informationβ€’5 minutes
  • Sorting and Filtering Data Using Pandas – Review Informationβ€’10 minutes
  • Labelling Points on a Graph – Review Informationβ€’5 minutes
  • Labelling all the Points on a Graph – Review Informationβ€’5 minutes
  • Eyeballing the Data – Review Informationβ€’5 minutes
  • Using K-Means to Interpret the Data – Review Informationβ€’5 minutes
1 peer reviewβ€’Total 60 minutes
  • Create a Labelled Plot of the Happiness Dataβ€’60 minutes

What's included

9 videos3 readings3 assignments3 peer reviews5 discussion prompts

9 videosβ€’Total 30 minutes
  • Introduction to Week 5β€’1 minute
  • 5.1 Can a Machine Detect Fake Notes?β€’2 minutes
  • 5.2 Working for a Clientβ€’5 minutes
  • 5.3 How to Organize Work on Your Projectβ€’4 minutes
  • 5.4 Dealing With Difficultiesβ€’3 minutes
  • 5.5 No Data no Data Science: Introduction of the Datasetβ€’5 minutes
  • 5.6 Modellingβ€’5 minutes
  • 5.7 Presenting the Project Resultsβ€’3 minutes
  • 5.8 Concluding Remarksβ€’1 minute
3 readingsβ€’Total 25 minutes
  • Week 5 Code Resource – the Dataset for our Projectβ€’10 minutes
  • Saving plt.scatter Outputs as Figuresβ€’10 minutes
  • Additional Recommended Reading for Week 5β€’5 minutes
3 assignmentsβ€’Total 44 minutes
  • Week 5 Summative Assessmentβ€’30 minutes
  • How Would You Help? – Review Informationβ€’10 minutes
  • Python – Review Informationβ€’4 minutes
3 peer reviewsβ€’Total 180 minutes
  • Exploratory Data Analysisβ€’60 minutes
  • Clusteringβ€’60 minutes
  • Your Reportβ€’60 minutes
5 discussion promptsβ€’Total 130 minutes
  • What Is Required to Train a Machine to Detect Fake Notes?β€’40 minutes
  • Your Project Planβ€’60 minutes
  • Self-reflectionβ€’10 minutes
  • Tips for Other Learnersβ€’10 minutes
  • Do You have Data Science Plans?β€’10 minutes

Instructors

Instructor ratings
4.6 (311 ratings)
University of London
24 Coursesβ€’437,754 learners
University of London
1 Courseβ€’76,961 learners

Explore more from Machine Learning

Why people choose Coursera for their career

πŸ‘ Image

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
πŸ‘ Image

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
πŸ‘ Image

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
πŸ‘ Image

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

  • 5 stars

    72.65%

  • 4 stars

    20%

  • 3 stars

    4.35%

  • 2 stars

    1.08%

  • 1 star

    1.90%

Showing 3 of 735

MP
Β·

Reviewed on Jan 2, 2022

T​his course was excellent. Though I already knew the concepts in this course and the programing skills, it has inspired me to 100% take the MSC provided. Excellent lectures. Just Excellent.

Y
Β·

Reviewed on Jul 26, 2021

Very informative course. You can learn how to cluster and clisfy data and how to write down report on the given statistical analysis.

MM
Β·

Reviewed on Jun 28, 2020

Very interesting course! The lecturers explain concepts thoroughly which makes the concepts easy to understand even for people without much knowledge in Data Science

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you purchase a Certificate you get access to all course materials, including graded assignments. Upon completing the course, your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Financial aid available,

ΒΉ Some assignments in this course are AI-graded. For these assignments, your data will be used in accordance with Coursera's Privacy Notice.