Fine-Tuning and Evaluating Vision AI Models
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Fine-Tuning and Evaluating Vision AI Models
This course is part of Eyes on AI - Computer Vision Engineering Professional Certificate
Included with
Ask Coursera
Recommended experience
Recommended experience
What you'll learn
Apply transfer learning and learning-rate analysis to improve computer vision model accuracy
Evaluate model calibration, object detection metrics, and dataset annotation quality
Diagnose segmentation errors and refine model outputs using post-processing techniques
Skills you'll gain
- Data Quality
- Model Optimization
- Performance Metric
- Computer Vision
- Quality Assessment
- Image Analysis
- Statistical Machine Learning
- Applied Machine Learning
- Model Training
- Fine-tuning
- Transfer Learning
- Performance Measurement
- Quality Assurance
- Predictive Modeling
- Performance Analysis
- Model Evaluation
- Statistical Modeling
- Data Pipelines
Tools you'll learn
Details to know
March 2026
See how employees at top companies are mastering in-demand skills
Build your Machine Learning expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate from Coursera
There are 13 modules in this course
Building high-performing computer vision systems requires more than training a model—it requires careful evaluation, reliable predictions, and continuous refinement. In this course, you'll learn how to fine-tune and evaluate computer vision models used in real-world AI systems.
You'll begin by applying transfer learning techniques to improve model accuracy on domain-specific datasets and analyzing learning-rate schedules to understand training behavior. Next, you'll evaluate the calibration of classification models and apply post-hoc correction methods to improve prediction reliability. The course also explores data preparation and annotation practices for object detection. You'll analyze object-size distributions to configure anchor boxes and evaluate detector performance using standard metrics. Finally, you'll examine image segmentation models. You'll learn how to address class imbalance, analyze segmentation errors, and apply post-processing techniques to improve prediction quality. By the end of the course, you'll be able to evaluate, diagnose, and refine computer vision models across classification, detection, and segmentation tasks.
You’ll learn how to adapt a pre-trained ViT-B/16 model to a new domain using transfer learning. You’ll practice freezing and selectively unfreezing layers, explore how the model’s internal representations shift during fine-tuning, and document your choices in an experiment log. By the end, you’ll know how to unfreeze the final four transformer blocks, prepare your dataset effectively, and run a clean, reproducible training workflow that aligns with industry practice.
What's included
3 videos1 reading1 assignment
3 videos•Total 9 minutes
- Introduction and Welcome•3 minutes
- Why Transfer Learning Accelerates Vision Training•2 minutes
- Walkthrough: Unfreezing the Final Four Transformer Blocks in Keras•4 minutes
1 reading•Total 10 minutes
- How ViT-B/16 Learns Features and Why Layer Unfreezing Matters•10 minutes
1 assignment•Total 15 minutes
- Hands-On Activity: Fine-Tune ViT-B/16 for Retail Images and Log Experiment Decisions•15 minutes
You’ll explore how learning-rate schedules shape the trajectory of model training. You’ll compare cosine decay and the one-cycle policy, analyze their signatures in training curves, and choose the schedule that maximizes validation accuracy while reducing training time. By the end, you’ll be able to interpret LR curves, diagnose plateaus or instability, and make informed decisions about training efficiency.
What's included
2 videos1 reading2 assignments
2 videos•Total 3 minutes
- Why Learning-Rate Schedules Shape Convergence•2 minutes
- Visualizing LR Schedules & Training Curves in Keras•2 minutes
1 reading•Total 10 minutes
- Cosine versus One-Cycle Policies and Their Influence on Training•10 minutes
2 assignments•Total 35 minutes
- Graded Quiz: Optimize AI: Fine-Tune & Maximize Accuracy•20 minutes
- Hands-On Activity: Compare LR Schedules & Choose One That Improves Training Time•15 minutes
You’ll assess how well a model’s predicted probabilities match real outcomes using ECE and reliability diagrams. By the end, you’ll compute calibration metrics, diagnose over/under-confidence, and apply temperature scaling to improve trust in predictions.
What's included
3 videos1 reading1 assignment
3 videos•Total 9 minutes
- Introduction and Welcome•3 minutes
- Understanding Calibration: Metrics and Diagnostics•4 minutes
- Improving Calibration: Temperature Scaling in Practice•3 minutes
1 reading•Total 10 minutes
- How to Measure and Interpret Model Calibration•10 minutes
1 assignment•Total 15 minutes
- Hands-On Activity: Calibrate a Classification Model Using ECE and Temperature Scaling•15 minutes
You’ll design a serverless batch-inference workflow using AWS S3, Lambda, and DynamoDB. By the end, you will configure an end-to-end pipeline that runs a calibrated model, processes batch files, and stores predictions for analytics.
What's included
2 videos1 reading2 assignments
2 videos•Total 8 minutes
- Why Serverless Pipelines Matter for Scalable AI•4 minutes
- Common Pitfalls in Deploying ML Pipelines•4 minutes
1 reading•Total 10 minutes
- Designing Batch-Inference Workflows with AWS Lambda•10 minutes
2 assignments•Total 35 minutes
- Graded Quiz: Assess Financial Deals & Manage Risk•20 minutes
- Hands-On Activity: Deploy a Calibrated Batch-Inference Pipeline with AWS Lambda•15 minutes
You will walk through how annotation teams plan tasks, define rules, coach annotators, and measure dataset quality. You will practice reviewing examples, identifying inconsistencies, and applying a structured audit that produces a production-ready bounding-box dataset.
What's included
3 videos2 readings1 assignment
3 videos•Total 15 minutes
- Why Quality Annotation Shapes Model Accuracy•5 minutes
- Quality-Controlled Annotation: Rules and Edge Cases•5 minutes
- How Teams Run a CVAT Labeling Sprint•5 minutes
2 readings•Total 20 minutes
- Avoiding Common Bounding-Box Errors•10 minutes
- IoU Audits and Reviewer Checklists•10 minutes
1 assignment•Total 20 minutes
- Hands-On Activity: Audit and Correct 20 Bounding Boxes in a Mini Sprint•20 minutes
You will examine how bounding-box dimensions reveal object scales in a dataset. You will run clustering to generate three anchor sets and understand how these values shape model training and performance.
What's included
3 videos2 readings2 assignments
3 videos•Total 15 minutes
- Why Anchor Boxes Matter for Detection•4 minutes
- Understanding Box Dimensions and Object Scale•6 minutes
- Generate and Insert Anchors into YOLOv5 Config•5 minutes
2 readings•Total 20 minutes
- k-Means Clustering for Bounding-Box Dimensions•10 minutes
- Visualizing Anchor Fit and Diagnosing Mismatch•10 minutes
2 assignments•Total 35 minutes
- Graded Quiz: Bounding-Box Quality and Anchor Selection Check•20 minutes
- Hands-On Activity: Run k-Means and Propose Three Anchors•15 minutes
You will explore why evaluation metrics matter, what mAP represents, and how metric breakdowns guide improvement decisions. You will connect evaluation to real deployment KPIs, such as accuracy targets and latency constraints.
What's included
3 videos2 readings1 assignment
3 videos•Total 8 minutes
- Introduction and Welcome•3 minutes
- Why Evaluation Comes First in Real-Time Detection•3 minutes
- Interpreting mAP: What To Look For in Real Projects•2 minutes
2 readings•Total 20 minutes
- Core Detection Metrics: mAP, APsmall, Precision, Recall•10 minutes
- Diagnosing Low AP on Small Objects•10 minutes
1 assignment•Total 20 minutes
- Hands-On Activity: Compute mAP from Provided COCO-Format Predictions•20 minutes
You will explore the components of real-time detection, including model selection, preprocessing, inference optimization, tracking, and system-level constraints. You will evaluate trade-offs such as accuracy vs. speed, batch size vs. latency, and resolution vs. FPS.
What's included
3 videos2 readings2 assignments
3 videos•Total 10 minutes
- Choosing the Right Model for Real-Time Requirements•3 minutes
- Tracker Basics: DeepSORT, BYTETrack, OC-SORT•2 minutes
- Integrating YOLOv8 with DeepSORT in OpenCV•5 minutes
2 readings•Total 20 minutes
- Where Latency Comes From: IO, Inference, NMS, and Tracking•10 minutes
- Benchmarking FPS and Latency on Embedded Device•10 minutes
2 assignments•Total 40 minutes
- Graded Quiz: Build & Evaluate Real-Time Object Detectors•20 minutes
- Hands-On Activity: Build a YOLOv8 + DeepSORT Pipeline Loop•20 minutes
You will explore why class imbalance disrupts training and practice applying class-balancing strategies, including focal-dice hybrid loss, weighting, and sampling. You will work through a realistic low-foreground medical dataset scenario and monitor recall after 15 epochs.
What's included
3 videos1 reading1 assignment
3 videos•Total 16 minutes
- Welcome and Overview•4 minutes
- Why Imbalance Breaks Segmentation Models•8 minutes
- Implementing Focal-Dice Hybrid Loss•4 minutes
1 reading•Total 10 minutes
- Class-Balancing Options for Segmentation•10 minutes
1 assignment•Total 15 minutes
- Hands-On Activity: Apply Hybrid Loss and Inspect Recall at 15 Epochs•15 minutes
You will quantify segmentation errors that arise in real deployments. Using skimage.measure, you will evaluate predicted masks and identify issues such as over-segmentation of elongated objects. You will write error logs that highlight recurring patterns.
What's included
2 videos1 reading2 assignments
2 videos•Total 11 minutes
- Why We Analyze Beyond IoU•5 minutes
- Region Properties With skimage.measure•6 minutes
1 reading•Total 10 minutes
- Common Systematic Mask Errors•10 minutes
2 assignments•Total 35 minutes
- Graded Quiz: Balance and Analyze Image Segmentation•20 minutes
- Hands-On Activity: Diagnose Over-Segmentation Using Region Stats•15 minutes
You will learn how to evaluate segmentation results using metrics and visualizations. We explore IoU, Dice, class-wise breakdowns, and overlay inspections that reveal where and why your model struggles. You’ll practice generating and interpreting these outputs, just like teams diagnosing performance before deploying a model.
What's included
3 videos1 reading2 assignments
3 videos•Total 11 minutes
- Welcome and Why Segmentation Evaluation Matters•3 minutes
- Understanding IoU, Dice, and Class-Wise Metrics•4 minutes
- Heat Maps in Action: Seeing Class Performance•4 minutes
1 reading•Total 10 minutes
- How to Read Segmentation Outputs Like a Practitioner•10 minutes
2 assignments•Total 15 minutes
- Hands-On Activity: Build Your First Class-Wise IoU Table and Heat Map•10 minutes
- Practice Quiz: Segmentation Metrics & Diagnostics•5 minutes
You will design and test a lightweight refinement pipeline that improves segmentation quality. You will also explore CRFs, boundary smoothing, hole-filling, morphological filters, and noise cleanup. You will build a pipeline and measure before-and-after improvements.
What's included
3 videos1 reading3 assignments
3 videos•Total 13 minutes
- Why Post-Processing Is a Key Part of CV Pipelines•4 minutes
- Smoothing, Filtering, and Boundary Refinement Techniques•5 minutes
- Building a Step-by-Step Refinement Workflow•4 minutes
1 reading•Total 10 minutes
- How CRFs Add Structure: A Simple Guide•10 minutes
3 assignments•Total 50 minutes
- Graded Quiz: Evaluate and Refine a Segmentation Model•30 minutes
- Hands-On Activity: Add a CRF Refiner and Measure Improvements•15 minutes
- Practice Quiz: Refinement & CRF Improvements•5 minutes
Modern vision systems often combine multiple model components such as classification, object detection, and segmentation. Preparing these systems for production requires more than training individual models. Engineers must evaluate fine-tuning strategies, analyze model confidence behavior, assess detection performance against operational KPIs, and diagnose segmentation errors that may affect reliability. In this project, you will act as a computer vision engineer responsible for evaluating a multi-task vision system before deployment. You will analyze fine-tuning decisions, examine model calibration reliability, interpret detection metrics, diagnose segmentation weaknesses, and assess dataset quality before approving deployment readiness. The project integrates several core evaluation activities used in real-world vision engineering workflows. You will interpret training behavior to assess transfer learning strategies, analyze calibration metrics to improve prediction reliability, evaluate detection performance using task-specific KPIs, and diagnose segmentation errors through metric analysis and qualitative inspection. Rather than optimizing a single component, the project requires you to assess the entire vision pipeline and recommend coordinated improvements across tasks. Your final deliverable will be a Vision Model Evaluation & Refinement Report, a structured technical analysis that identifies weaknesses, prioritizes corrective actions, and justifies engineering decisions across classification, detection, and segmentation modules. This project mirrors real-world responsibilities of computer vision engineers who must evaluate multiple model components simultaneously and communicate a clear production-readiness recommendation to engineering and product stakeholders.
What's included
2 readings1 assignment
2 readings•Total 10 minutes
- Why This Project Matters•5 minutes
- Project Requirements•5 minutes
1 assignment•Total 60 minutes
- Vision Model Evaluation & Refinement Report•60 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor
Explore more from Machine Learning
- Status: Free TrialC
Coursera
Course
- Status: Free Trial
Course
- Status: Free Trial
Course
- Status: Free Trial
Course
Why people choose Coursera for their career
Frequently asked questions
This course is designed for learners with prior machine learning knowledge. Familiarity with neural networks and computer vision concepts will help you follow the evaluation and optimization techniques.
You'll learn how to fine-tune models, evaluate prediction reliability, analyze object detection performance, and diagnose segmentation errors to improve real-world vision systems.
The course focuses on evaluation and refinement tasks commonly performed by machine learning engineers, including model calibration, performance analysis, dataset quality checks, and system-level optimization.
More questions
Financial aid available,
¹ Some assignments in this course are AI-graded. For these assignments, your data will be used in accordance with Coursera's Privacy Notice.
