VOOZH about

URL: https://www.coursera.org/learn/training-evaluating-and-monitoring-machine-learning-models

⇱ Training, Evaluating, and Monitoring Machine Learning Models | Coursera


Training, Evaluating, and Monitoring Machine Learning Models

Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

Training, Evaluating, and Monitoring Machine Learning Models

Included with

Ask Coursera

Gain insight into a topic and learn the fundamentals.
Intermediate level

Recommended experience

9 hours to complete
Flexible schedule
Learn at your own pace

Gain insight into a topic and learn the fundamentals.
Intermediate level

Recommended experience

9 hours to complete
Flexible schedule
Learn at your own pace

What you'll learn

  • Train machine learning models and analyze training dynamics using logs and loss curves

  • Evaluate model performance using metrics, confusion matrices, and statistical analysis

  • Design monitoring strategies to detect model drift and maintain model reliability

Details to know

Shareable certificate

Add to your LinkedIn profile

Recently updated!

March 2026

Assessments

10 assignments¹

AI Graded see disclaimer
Taught in English

Build your subject-matter expertise

This course is part of the Machine Learning Made Easy for Software Engineers Specialization
When you enroll in this course, you'll also be enrolled in this Specialization.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 10 modules in this course

Building machine learning models is only the first step. To create reliable ML systems, engineers must evaluate model performance, diagnose prediction errors, and monitor deployed models over time. In this course, you'll learn how to train, evaluate, and monitor machine learning models using practical engineering techniques.

You’ll begin by exploring model training strategies that improve convergence and performance. You’ll analyze training logs, loss curves, and class imbalance effects to understand how models learn and where they struggle. Next, you’ll learn how to evaluate machine learning models using appropriate performance metrics. You’ll analyze confusion matrices and residual patterns to identify systematic prediction errors and assess the statistical significance of model improvements. Finally, you’ll focus on monitoring machine learning models in production environments. You’ll apply validation techniques, analyze A/B testing results, and monitor model behavior over time to detect performance drift and trigger retraining workflows. Through a hands-on project, you'll design a model evaluation and monitoring framework that helps ensure machine learning systems remain accurate and reliable after deployment.

You will apply batch and mini-batch training procedures to optimize model convergence.

What's included

3 videos1 reading1 assignment

3 videosTotal 13 minutes
  • Introduction and Welcome4 minutes
  • Why Mini-Batches Improve Training Stability5 minutes
  • How Schedulers Influence Convergence4 minutes
1 readingTotal 6 minutes
  • Batch vs Mini-Batch: What Changes in Practice6 minutes
1 assignmentTotal 15 minutes
  • Hands-On Activity: Train a PyTorch Model with Mini-Batches and Scheduler15 minutes

You will analyze training logs and loss curves to diagnose common model training issues.

What's included

2 videos1 reading1 ungraded lab

2 videosTotal 5 minutes
  • Reading Loss Curves Like an Analyst3 minutes
  • Spotting Instability Using Training Logs2 minutes
1 readingTotal 6 minutes
  • Common Training Issues and How Logs Reveal Them6 minutes
1 ungraded labTotal 60 minutes
  • Fix Overfitting by Analyzing Divergence Patterns60 minutes

You will evaluate the impact of class-imbalance techniques on model performance.

What's included

1 video1 reading2 assignments

1 videoTotal 3 minutes
  • Choosing Class-Imbalance Methods with Confidence3 minutes
1 readingTotal 7 minutes
  • How Balanced Data Shapes Your Model’s F1 Score7 minutes
2 assignmentsTotal 37 minutes
  • Graded Quiz: Assessing Training, Diagnostics, and Imbalance Methods25 minutes
  • Hands-On Activity: Compare F1 Scores Using Class-Weights and SMOTE12 minutes

You will apply appropriate performance metrics to evaluate machine learning models.

What's included

2 videos1 reading1 assignment

2 videosTotal 10 minutes
  • Why Metrics Matter in Model Evaluation?4 minutes
  • RMSE vs. MAE for Regression Models6 minutes
1 readingTotal 10 minutes
  • Reflecting on Model Performance Metrics 10 minutes
1 assignmentTotal 15 minutes
  • Hands-On Activity: Metric Matching Exercise15 minutes

You will analyze confusion matrices and residual plots to identify systematic model prediction errors.

What's included

2 videos1 reading1 assignment

2 videosTotal 9 minutes
  • Looking Inside the Confusion Matrix5 minutes
  • Residual Plots for Regression Diagnostics4 minutes
1 readingTotal 10 minutes
  • Diagnosing Systematic Model Errors with Confusion Matrices and Residual Plots 10 minutes
1 assignmentTotal 15 minutes
  • Hands-On Activity: Spam Filter Failure Analysis15 minutes

You will evaluate the statistical significance of differences in metrics.

What's included

2 videos1 reading1 assignment1 ungraded lab

2 videosTotal 10 minutes
  • Why Statistical Significance Matters in Model Comparison4 minutes
  • Bootstrapping Metrics Step by Step6 minutes
1 readingTotal 10 minutes
  • Evaluating Statistical Significance in Automated Model Monitoring 10 minutes
1 assignmentTotal 20 minutes
  • Graded Quiz: Interpreting Metrics and Model Improvements20 minutes
1 ungraded labTotal 60 minutes
  • End-to-End Model Evaluation Practice60 minutes

You will apply validation techniques to assess model performance on unseen data.

What's included

2 videos1 reading1 assignment

2 videosTotal 6 minutes
  • Why Validation Is a Release Gate3 minutes
  • Hold-Out Sets and Evaluation Metrics in Practice3 minutes
1 readingTotal 10 minutes
  • Designing a Validation Checklist for Release Candidates10 minutes
1 assignmentTotal 15 minutes
  • Hands-On Activity: Validate a Release Candidate Model15 minutes

You will analyze A/B test or shadow deployment results to compare new model performance against a baseline.

What's included

2 videos1 reading1 assignment

2 videosTotal 8 minutes
  • From Offline Metrics to Online Impact4 minutes
  • A/B Tests vs. Shadow Deployments Explained4 minutes
1 readingTotal 10 minutes
  • Comparing Models Using A/B Testing and Shadow Deployments 10 minutes
1 assignmentTotal 15 minutes
  • Hands-On Activity: Analyze Shadow Deployment Results15 minutes

You will evaluate model-drift indicators to trigger retraining workflows.

What's included

2 videos1 reading1 assignment1 ungraded lab

2 videosTotal 8 minutes
  • Why Models Drift in Production4 minutes
  • Using PSI for Ongoing Monitoring4 minutes
1 readingTotal 10 minutes
  • Automating Monitoring and Retraining Triggers10 minutes
1 assignmentTotal 20 minutes
  • Graded Quiz: Validate, Analyze, and Monitor ML Models20 minutes
1 ungraded labTotal 60 minutes
  • Build a Drift Monitoring Workflow60 minutes

In this project, you will design and implement a machine learning model evaluation and monitoring framework for a production system. A technology company has deployed a recommendation model that predicts user engagement with content, but its performance has become inconsistent due to potential data drift and evolving user behavior. Your task is to build an evaluation pipeline that compares model versions, analyzes prediction errors, and monitors performance stability over time. You will train baseline and improved models, analyze training logs and loss curves to verify convergence, evaluate class-imbalance handling techniques to ensure fair evaluation across classes, evaluate them using appropriate metrics, analyze errors with confusion matrices and residual plots, perform statistical comparisons, simulate monitoring scenarios such as A/B testing or shadow deployments, calculate drift indicators like Population Stability Index (PSI), and define conditions for model retraining. The final deliverable is a modular Python evaluation framework along with a written engineering explanation demonstrating how evaluation insights support reliable model deployment decisions.

What's included

2 readings1 assignment

2 readingsTotal 12 minutes
  • Why Model Evaluation and Monitoring Matter in Production ML Systems 6 minutes
  • Project Requirements6 minutes
1 assignmentTotal 70 minutes
  • End-to-End Model Evaluation & Monitoring Framework 70 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Explore more from Machine Learning

Why people choose Coursera for their career

👁 Image

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
👁 Image

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
👁 Image

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
👁 Image

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Frequently asked questions

This course is designed for learners with some experience in programming and machine learning. It focuses on techniques used to evaluate and maintain ML models in real-world systems.

You'll learn how to use performance metrics, confusion matrices, residual analysis, and statistical evaluation techniques to assess model performance and diagnose prediction errors.

Models can degrade over time as data changes. Monitoring helps detect issues such as model drift or performance drops so teams can retrain or update models before problems affect users.

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Financial aid available,

¹ Some assignments in this course are AI-graded. For these assignments, your data will be used in accordance with Coursera's Privacy Notice.