Advanced Computer Vision with TensorFlow

Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

👁 DeepLearning.AI

Advanced Computer Vision with TensorFlow

This course is part of TensorFlow: Advanced Techniques Specialization

👁 Laurence Moroney

👁 Eddy Shyu

Instructors: Laurence Moroney

47,521 already enrolled

Ask Coursera

4 modules

Gain insight into a topic and learn the fundamentals.

4.7

533 reviews

Intermediate level

Recommended experience

Flexible schedule

2 weeks at 10 hours a week

Learn at your own pace

94%

Most learners liked this course

4 modules

Gain insight into a topic and learn the fundamentals.

4.7

533 reviews

Intermediate level

Recommended experience

Flexible schedule

2 weeks at 10 hours a week

Learn at your own pace

94%

Most learners liked this course

Skills you'll gain

Tools you'll learn

Details to know

👁 Image

Shareable certificate

Add to your LinkedIn profile

Assessments

4 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

👁 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is part of the TensorFlow: Advanced Techniques Specialization

When you enroll in this course, you'll also be enrolled in this Specialization.

Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate

👁 Image

There are 4 modules in this course

In this course, you will:

a) Explore image classification, image segmentation, object localization, and object detection. Apply transfer learning to object localization and detection. b) Apply object detection models such as regional-CNN and ResNet-50, customize existing models, and build your own models to detect, localize, and label your own rubber duck images. c) Implement image segmentation using variations of the fully convolutional network (FCN) including U-Net and d) Mask-RCNN to identify and detect numbers, pets, zombies, and more. d) Identify which parts of an image are being used by your model to make its predictions using class activation maps and saliency maps and apply these ML interpretation methods to inspect and improve the design of a famous network, AlexNet. The DeepLearning.AI TensorFlow: Advanced Techniques Specialization introduces the features of TensorFlow that provide learners with more control over their model architecture and tools that help them create and train advanced ML models. This Specialization is for early and mid-career software and machine learning engineers with a foundational understanding of TensorFlow who are looking to expand their knowledge and skill set by learning advanced TensorFlow features to build powerful models.

Get a conceptual overview of image classification, object localization, object detection, and image segmentation. Also be able to describe multi-label classification, and distinguish between semantic segmentation and instance segmentation. In the rest of this course, you will apply TensorFlow to build object detection and image segmentation models.

What's included

10 videos4 readings1 assignment1 programming assignment3 ungraded labs

10 videos•Total 41 minutes

Welcome to Course 3•7 minutes
Classification and Object Detection Intro•5 minutes
Segmentation Intro•3 minutes
Why Transfer Learning?•5 minutes
What is Transfer Learning?•4 minutes
Options in Transfer Learning•3 minutes
Transfer Learning with ResNet50•4 minutes
ResNet50 in code•5 minutes
Network architecture for Object Localization•4 minutes
Evaluating Object Localization•2 minutes

4 readings•Total 15 minutes

Welcome to the course!•2 minutes
Prerequisite & References•10 minutes
Join the DeepLearning.AI Forum to ask questions, get support, or share amazing ideas!•2 minutes
Lecture Notes Week 1•1 minute

1 assignment•Total 30 minutes

Introduction and Concepts of Computer Vision•30 minutes

1 programming assignment•Total 60 minutes

Bird Boxes•60 minutes

3 ungraded labs•Total 105 minutes

Transfer Learning•30 minutes
Transfer Learning with ResNet 50•30 minutes
Image Classification and Object Localization•45 minutes

This week, you’ll get an overview of some popular object detection models, such as regional-CNN and ResNet-50. You’ll use object detection models that you’ll retrieve from TensorFlow Hub, download your own models and configure them for training, and also build your own models for object detection. By using transfer learning, you will train a model to detect and localize rubber duckies using just five training examples. You’ll also get to manually label your own rubber ducky images!

What's included

12 videos8 readings1 assignment1 programming assignment2 ungraded labs

12 videos•Total 45 minutes

Object Detection and Sliding Windows•6 minutes
R-CNN•4 minutes
Fast R-CNN•4 minutes
Faster R-CNN•2 minutes
Getting the Model from TensorFlow Hub•2 minutes
Running the Model on an Image•3 minutes
Installation and overview of APIs•4 minutes
Visualization with APIs•4 minutes
Loading a RetinaNet Model•5 minutes
Loading Weights•4 minutes
Data Prep and Training Overview•3 minutes
Custom Training Loop Code•5 minutes

8 readings•Total 111 minutes

References: Amazon Rekognition, PowerAI & DIGITS•10 minutes
Reference: R-CNN, Fast R-CNN •10 minutes
Reference: TensorFlow Hub•10 minutes
Read about the Object Detection API•10 minutes
Use the Object Detection API•30 minutes
Reference: RetinaNet, Model Garden•10 minutes
Eager Few Shot Object Detection•30 minutes
Lecture Notes Week 2•1 minute

1 assignment•Total 30 minutes

Object Detection•30 minutes

1 programming assignment•Total 60 minutes

Zombie Detector•60 minutes

2 ungraded labs•Total 60 minutes

Implement Simple Object Detection•30 minutes
Predicting Bounding Boxes for Object Detection•30 minutes

This week is all about image segmentation using variations of the fully convolutional neural network. With these networks, you can assign class labels to each pixel, and perform much more detailed identification of objects compared to bounding boxes. You’ll build the fully convolutional neural network, U-Net, and Mask R-CNN this week to identify and detect numbers, pets, and even zombies!

What's included

11 videos4 readings1 assignment1 programming assignment3 ungraded labs

11 videos•Total 45 minutes

Image Segmentation Overview•5 minutes
Popular Image Segmentation Architectures•4 minutes
FCN Architecture Details•6 minutes
Upsampling Methods•3 minutes
Encoder in Code•3 minutes
Decoder in Code•4 minutes
Evaluation with IoU and Dice Score•4 minutes
U-Net Overview•5 minutes
U-Net Code: Encoder•4 minutes
U-Net Code: Decoder•3 minutes
Instance Segmentation•3 minutes

4 readings•Total 31 minutes

References: FCN•10 minutes
Reference: CamVid •10 minutes
Reference: U-Net•10 minutes
Lecture Notes Week 3•1 minute

1 assignment•Total 30 minutes

Image Segmentation•30 minutes

1 programming assignment•Total 60 minutes

Image Segmentation of Handwritten Digits•60 minutes

3 ungraded labs•Total 120 minutes

Implement a Fully Convolutional Neural Network•45 minutes
Implement a UNet•45 minutes
Instance Segmentation Demo•30 minutes

This week, you’ll learn about the importance of model interpretability, which is the understanding of how your model arrives at its decisions. You’ll also implement class activation maps, saliency maps, and gradient-weighted class activation maps to identify which parts of an image are being used by your model to make its predictions. You’ll also see an example of how visualizing a model’s intermediate layer activations can help to improve the design of a famous network, AlexNet.

What's included

6 videos6 readings1 assignment1 programming assignment4 ungraded labs

6 videos•Total 30 minutes

Why Interpretation Matters?•6 minutes
Class Activation Maps•4 minutes
Fashion MNIST Class Activation Map code walkthrough•5 minutes
Saliency•5 minutes
GradCAM•5 minutes
ZFNet•5 minutes

6 readings•Total 43 minutes

Reference: GradCam•10 minutes
Reference: ZFNet•10 minutes
Lecture Notes Week 4•1 minute
[IMPORTANT] Reminder about end of access to Lab Notebooks•2 minutes
References •10 minutes
Acknowledgments•10 minutes

1 assignment•Total 30 minutes

Visualization and Interpretation•30 minutes

1 programming assignment•Total 60 minutes

Cats vs Dogs Saliency Maps•60 minutes

4 ungraded labs•Total 135 minutes

Class Activation Maps with Fashion MNIST (Lab #1)•45 minutes
Class Activation Maps "Cats vs Dogs" (Lab #2)•30 minutes
Saliency Maps (Lab #3)•30 minutes
GradCAM (Lab #4)•30 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructors

Instructor ratings

4.8 (135 ratings)

👁 Laurence Moroney

Laurence Moroney

DeepLearning.AI

22 Courses•605,060 learners

Offered by

👁 Image

DeepLearning.AI

Explore more from Machine Learning

👁 Image
Status: Free Trial
D
DeepLearning.AI
Generative Deep Learning with TensorFlow
Course
👁 Image
Status: Free Trial
D
DeepLearning.AI
Convolutional Neural Networks in TensorFlow
Course
👁 Image
Status: Free Trial
D
DeepLearning.AI
Convolutional Neural Networks
Course
👁 Image
G
Google Cloud
Introduction to Computer Vision with TensorFlow
Project

Why people choose Coursera for their career

👁 Image

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

👁 Image

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

👁 Image

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

👁 Image

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

5 stars
81.42%
4 stars
13.13%
3 stars
4.12%
2 stars
0.18%
1 star
1.12%

Showing 3 of 533

Reviewed on Oct 26, 2022

course content was very informative.Learned the concepts with practical experience.Great Learning!!!!

Reviewed on Jul 14, 2021

This class was probably the most challenging so far, but I learned some valuable deep learning techniques.

Reviewed on Apr 17, 2021

One of the finest in depth course on computer vision. So much helpful if anyone wishes to dive into application oriented tasks of computer vision. Very much helpful for research also.

View more reviews

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

URL: https://www.coursera.org/learn/advanced-computer-vision-with-tensorflow