Machine Learning With Big Data
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Machine Learning With Big Data
This course is part of Big Data Specialization
Instructors: Mai Nguyen
79,300 already enrolled
Included with
Learn more
Ask Coursera
2,505 reviews
2,505 reviews
Skills you'll gain
- Model Evaluation
- Exploratory Data Analysis
- Data Transformation
- Descriptive Statistics
- Big Data
- Machine Learning Algorithms
- Data Mining
- Data Preprocessing
- Machine Learning
- Statistical Analysis
- Data Wrangling
- Machine Learning Methods
- Applied Machine Learning
- Model Training
- Data Analysis
- Unsupervised Learning
- Machine Learning Software
- Regression Analysis
Tools you'll learn
Details to know
11 assignments
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
There are 7 modules in this course
Want to make sense of the volumes of data you have collected? Need to incorporate data-driven decisions into your process? This course provides an overview of machine learning techniques to explore, analyze, and leverage data. You will be introduced to tools and algorithms you can use to create machine learning models that learn from data, and to scale those models up to big data problems.
At the end of the course, you will be able to: β’ Design an approach to leverage data using the steps in the machine learning process. β’ Apply machine learning techniques to explore and prepare data for modeling. β’ Identify the type of machine learning problem in order to apply the appropriate set of techniques. β’ Construct models that learn from data using widely available open source tools. β’ Analyze big data problems using scalable machine learning algorithms on Spark. Software Requirements: Cloudera VM, KNIME, Spark
What's included
2 videos2 discussion prompts
2 videosβ’Total 14 minutes
- Welcome to Machine Learning With Big Dataβ’4 minutes
- Summary of Big Data Integration and Processingβ’11 minutes
2 discussion promptsβ’Total 20 minutes
- Getting to Know You: Tell us about yourself and why you are taking this course.β’10 minutes
- Discussion Forum for Course Content Issuesβ’10 minutes
What's included
7 videos6 readings1 assignment1 discussion prompt
7 videosβ’Total 45 minutes
- Machine Learning Overviewβ’8 minutes
- Categories Of Machine Learning Techniquesβ’8 minutes
- Machine Learning Processβ’3 minutes
- Goals and Activities in the Machine Learning Processβ’11 minutes
- CRISP-DMβ’5 minutes
- Scaling Up Machine Learning Algorithmsβ’5 minutes
- Tools Used in this Courseβ’5 minutes
6 readingsβ’Total 125 minutes
- Slides: Machine Learning Overview and Applicationsβ’25 minutes
- Downloading and Installing Docker Desktop Instructionsβ’10 minutes
- Instroduction to Jupyter Notebooksβ’10 minutes
- Downloading Hands-On Materialsβ’10 minutes
- Basic terminal shell commandsβ’10 minutes
- Downloading, Installing and Using KNIMEβ’60 minutes
1 assignmentβ’Total 20 minutes
- Machine Learning Overviewβ’20 minutes
1 discussion promptβ’Total 10 minutes
- Machine Learning in Everyday Lifeβ’10 minutes
What's included
6 videos4 readings2 assignments1 discussion prompt
6 videosβ’Total 39 minutes
- Data Terminologyβ’5 minutes
- Data Explorationβ’4 minutes
- Data Exploration through Summary Statisticsβ’8 minutes
- Data Exploration through Plotsβ’8 minutes
- Exploring Data with KNIME Plotsβ’9 minutes
- Data Exploration in Sparkβ’5 minutes
4 readingsβ’Total 70 minutes
- Slides: Data Exploration Overview and Terminologyβ’10 minutes
- Description of Daily Weather Datasetβ’10 minutes
- Exploring Data with KNIME Plotsβ’40 minutes
- Data Exploration in Sparkβ’10 minutes
2 assignmentsβ’Total 40 minutes
- Data Explorationβ’20 minutes
- Data Exploration in KNIME and Spark Quizβ’20 minutes
1 discussion promptβ’Total 10 minutes
- What's Wrong with Pie Charts?β’10 minutes
What's included
8 videos3 readings2 assignments2 discussion prompts
8 videosβ’Total 40 minutes
- Data Preparationβ’3 minutes
- Data Qualityβ’4 minutes
- Addressing Data Quality Issuesβ’5 minutes
- Feature Selectionβ’5 minutes
- Feature Transformationβ’5 minutes
- Dimensionality Reductionβ’7 minutes
- Handling Missing Values in KNIMEβ’5 minutes
- Handling Missing Values in Sparkβ’4 minutes
3 readingsβ’Total 60 minutes
- Slides: Data Preparation for Machine Learningβ’30 minutes
- Handling Missing Values in KNIMEβ’20 minutes
- Handling Missing Values in Sparkβ’10 minutes
2 assignmentsβ’Total 45 minutes
- Data Preparationβ’25 minutes
- Handling Missing Values in KNIME and Spark Quizβ’20 minutes
2 discussion promptsβ’Total 20 minutes
- Quality Issues with Real Dataβ’10 minutes
- Domain Knowledge in Data Preparationβ’10 minutes
What's included
8 videos5 readings2 assignments1 discussion prompt
8 videosβ’Total 60 minutes
- Classificationβ’4 minutes
- Building and Applying a Classification Modelβ’6 minutes
- Classification Algorithmsβ’3 minutes
- k-Nearest Neighborsβ’5 minutes
- Decision Treesβ’13 minutes
- NaΓ―ve Bayesβ’14 minutes
- Classification using Decision Tree in KNIMEβ’8 minutes
- Classification in Sparkβ’7 minutes
5 readingsβ’Total 130 minutes
- Slides: What is Classification?β’10 minutes
- Slides: Classification Algorithmsβ’10 minutes
- Classification using Decision Tree in KNIMEβ’45 minutes
- Interpreting a Decision Tree in KNIMEβ’20 minutes
- Classification in Sparkβ’45 minutes
2 assignmentsβ’Total 36 minutes
- Classificationβ’20 minutes
- Classification in KNIME and Spark Quizβ’16 minutes
1 discussion promptβ’Total 10 minutes
- Why Exclude Relative Humidity?β’10 minutes
What's included
7 videos6 readings2 assignments1 discussion prompt
7 videosβ’Total 44 minutes
- Generalization and Overfittingβ’5 minutes
- Overfitting in Decision Treesβ’4 minutes
- Using a Validation Setβ’10 minutes
- Metrics to Evaluate Model Performanceβ’10 minutes
- Confusion Matrixβ’7 minutes
- Evaluation of Decision Tree in KNIMEβ’5 minutes
- Evaluation of Decision Tree in Sparkβ’3 minutes
6 readingsβ’Total 90 minutes
- Slides: Overfitting: What is it and how would you prevent it?β’10 minutes
- Slides: Model evaluation metrics and methodsβ’10 minutes
- Evaluation of Decision Tree in KNIMEβ’30 minutes
- Completed KNIME Workflowsβ’10 minutes
- Evaluation of Decision Tree in Sparkβ’20 minutes
- Comparing Classification Results for KNIME and Sparkβ’10 minutes
2 assignmentsβ’Total 50 minutes
- Model Evaluationβ’20 minutes
- Model Evaluation in KNIME and Spark Quizβ’30 minutes
1 discussion promptβ’Total 10 minutes
- Model Interpretability vs. Accuracyβ’10 minutes
What's included
8 videos5 readings2 assignments2 discussion prompts
8 videosβ’Total 48 minutes
- Regression Overviewβ’4 minutes
- Linear Regressionβ’4 minutes
- Cluster Analysisβ’8 minutes
- k-Means Clusteringβ’8 minutes
- Association Analysisβ’5 minutes
- Association Analysis in Detailβ’9 minutes
- Machine Learning With Big Data - Final Remarksβ’1 minute
- Cluster Analysis in Sparkβ’8 minutes
5 readingsβ’Total 100 minutes
- Slides: Regressionβ’10 minutes
- Slides: Cluster Analysisβ’10 minutes
- Slides: Association Analysisβ’10 minutes
- Description of Minute Weather Datasetβ’10 minutes
- Cluster Analysis in Sparkβ’60 minutes
2 assignmentsβ’Total 50 minutes
- Regression, Cluster Analysis, & Association Analysisβ’30 minutes
- Cluster Analysis in Spark Quizβ’20 minutes
2 discussion promptsβ’Total 20 minutes
- Clustering Applicationsβ’10 minutes
- Applications of Association Analysisβ’10 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructors
Explore more from Machine Learning
- Status: Free Trial
Course
- Status: Free TrialE
Edureka
Course
- Status: Free Trial
Course
Why people choose Coursera for their career
Learner reviews
- 5 stars
70.81%
- 4 stars
23.43%
- 3 stars
4.03%
- 2 stars
0.99%
- 1 star
0.71%
Showing 3 of 2505
Reviewed on Oct 30, 2018
Reasonable overview. The VM environment is a major challenge for my hardware. Takes more time to make it work than it should. I am wondering if a cloud solution e.g. GCP would be better.
Reviewed on Dec 26, 2019
The fact that the assignments are graded means that thereβs incentive to work on them, solve problems, and ask questions. Traditional online courses donβt offer that incentive.
Reviewed on Jul 30, 2020
I am so much pleased with this course. Very much convinced by the presentation, way of speech, and the script of the Instructor. I am so excited to learn more about Machine Learning.
Frequently asked questions
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Yes. In select learning programs, you can apply for financial aid or a scholarship if you canβt afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, youβll find a link to apply on the description page.
More questions
Financial aid available,
