Advanced Computer Vision with TensorFlow
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Advanced Computer Vision with TensorFlow
This course is part of TensorFlow: Advanced Techniques Specialization
Instructors: Laurence Moroney
47,521 already enrolled
Ask Coursera
533 reviews
Recommended experience
533 reviews
Recommended experience
Skills you'll gain
Tools you'll learn
Details to know
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
There are 4 modules in this course
In this course, you will:
a) Explore image classification, image segmentation, object localization, and object detection. Apply transfer learning to object localization and detection. b) Apply object detection models such as regional-CNN and ResNet-50, customize existing models, and build your own models to detect, localize, and label your own rubber duck images. c) Implement image segmentation using variations of the fully convolutional network (FCN) including U-Net and d) Mask-RCNN to identify and detect numbers, pets, zombies, and more. d) Identify which parts of an image are being used by your model to make its predictions using class activation maps and saliency maps and apply these ML interpretation methods to inspect and improve the design of a famous network, AlexNet. The DeepLearning.AI TensorFlow: Advanced Techniques Specialization introduces the features of TensorFlow that provide learners with more control over their model architecture and tools that help them create and train advanced ML models. This Specialization is for early and mid-career software and machine learning engineers with a foundational understanding of TensorFlow who are looking to expand their knowledge and skill set by learning advanced TensorFlow features to build powerful models.
Get a conceptual overview of image classification, object localization, object detection, and image segmentation. Also be able to describe multi-label classification, and distinguish between semantic segmentation and instance segmentation. In the rest of this course, you will apply TensorFlow to build object detection and image segmentation models.
What's included
10 videos4 readings1 assignment1 programming assignment3 ungraded labs
10 videos•Total 41 minutes
- Welcome to Course 3•7 minutes
- Classification and Object Detection Intro•5 minutes
- Segmentation Intro•3 minutes
- Why Transfer Learning?•5 minutes
- What is Transfer Learning?•4 minutes
- Options in Transfer Learning•3 minutes
- Transfer Learning with ResNet50•4 minutes
- ResNet50 in code•5 minutes
- Network architecture for Object Localization•4 minutes
- Evaluating Object Localization•2 minutes
4 readings•Total 15 minutes
- Welcome to the course!•2 minutes
- Prerequisite & References•10 minutes
- Join the DeepLearning.AI Forum to ask questions, get support, or share amazing ideas!•2 minutes
- Lecture Notes Week 1•1 minute
1 assignment•Total 30 minutes
- Introduction and Concepts of Computer Vision•30 minutes
1 programming assignment•Total 60 minutes
- Bird Boxes•60 minutes
3 ungraded labs•Total 105 minutes
- Transfer Learning•30 minutes
- Transfer Learning with ResNet 50•30 minutes
- Image Classification and Object Localization•45 minutes
This week, you’ll get an overview of some popular object detection models, such as regional-CNN and ResNet-50. You’ll use object detection models that you’ll retrieve from TensorFlow Hub, download your own models and configure them for training, and also build your own models for object detection. By using transfer learning, you will train a model to detect and localize rubber duckies using just five training examples. You’ll also get to manually label your own rubber ducky images!
What's included
12 videos8 readings1 assignment1 programming assignment2 ungraded labs
12 videos•Total 45 minutes
- Object Detection and Sliding Windows•6 minutes
- R-CNN•4 minutes
- Fast R-CNN•4 minutes
- Faster R-CNN•2 minutes
- Getting the Model from TensorFlow Hub•2 minutes
- Running the Model on an Image•3 minutes
- Installation and overview of APIs•4 minutes
- Visualization with APIs•4 minutes
- Loading a RetinaNet Model•5 minutes
- Loading Weights•4 minutes
- Data Prep and Training Overview•3 minutes
- Custom Training Loop Code•5 minutes
8 readings•Total 111 minutes
- References: Amazon Rekognition, PowerAI & DIGITS•10 minutes
- Reference: R-CNN, Fast R-CNN •10 minutes
- Reference: TensorFlow Hub•10 minutes
- Read about the Object Detection API•10 minutes
- Use the Object Detection API•30 minutes
- Reference: RetinaNet, Model Garden•10 minutes
- Eager Few Shot Object Detection•30 minutes
- Lecture Notes Week 2•1 minute
1 assignment•Total 30 minutes
- Object Detection•30 minutes
1 programming assignment•Total 60 minutes
- Zombie Detector•60 minutes
2 ungraded labs•Total 60 minutes
- Implement Simple Object Detection•30 minutes
- Predicting Bounding Boxes for Object Detection•30 minutes
This week is all about image segmentation using variations of the fully convolutional neural network. With these networks, you can assign class labels to each pixel, and perform much more detailed identification of objects compared to bounding boxes. You’ll build the fully convolutional neural network, U-Net, and Mask R-CNN this week to identify and detect numbers, pets, and even zombies!
What's included
11 videos4 readings1 assignment1 programming assignment3 ungraded labs
11 videos•Total 45 minutes
- Image Segmentation Overview•5 minutes
- Popular Image Segmentation Architectures•4 minutes
- FCN Architecture Details•6 minutes
- Upsampling Methods•3 minutes
- Encoder in Code•3 minutes
- Decoder in Code•4 minutes
- Evaluation with IoU and Dice Score•4 minutes
- U-Net Overview•5 minutes
- U-Net Code: Encoder•4 minutes
- U-Net Code: Decoder•3 minutes
- Instance Segmentation•3 minutes
4 readings•Total 31 minutes
- References: FCN•10 minutes
- Reference: CamVid •10 minutes
- Reference: U-Net•10 minutes
- Lecture Notes Week 3•1 minute
1 assignment•Total 30 minutes
- Image Segmentation•30 minutes
1 programming assignment•Total 60 minutes
- Image Segmentation of Handwritten Digits•60 minutes
3 ungraded labs•Total 120 minutes
- Implement a Fully Convolutional Neural Network•45 minutes
- Implement a UNet•45 minutes
- Instance Segmentation Demo•30 minutes
This week, you’ll learn about the importance of model interpretability, which is the understanding of how your model arrives at its decisions. You’ll also implement class activation maps, saliency maps, and gradient-weighted class activation maps to identify which parts of an image are being used by your model to make its predictions. You’ll also see an example of how visualizing a model’s intermediate layer activations can help to improve the design of a famous network, AlexNet.
What's included
6 videos6 readings1 assignment1 programming assignment4 ungraded labs
6 videos•Total 30 minutes
- Why Interpretation Matters?•6 minutes
- Class Activation Maps•4 minutes
- Fashion MNIST Class Activation Map code walkthrough•5 minutes
- Saliency•5 minutes
- GradCAM•5 minutes
- ZFNet•5 minutes
6 readings•Total 43 minutes
- Reference: GradCam•10 minutes
- Reference: ZFNet•10 minutes
- Lecture Notes Week 4•1 minute
- [IMPORTANT] Reminder about end of access to Lab Notebooks•2 minutes
- References •10 minutes
- Acknowledgments•10 minutes
1 assignment•Total 30 minutes
- Visualization and Interpretation•30 minutes
1 programming assignment•Total 60 minutes
- Cats vs Dogs Saliency Maps•60 minutes
4 ungraded labs•Total 135 minutes
- Class Activation Maps with Fashion MNIST (Lab #1)•45 minutes
- Class Activation Maps "Cats vs Dogs" (Lab #2)•30 minutes
- Saliency Maps (Lab #3)•30 minutes
- GradCAM (Lab #4)•30 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructors
Offered by
Explore more from Machine Learning
- Status: Free TrialD
DeepLearning.AI
Course
- Status: Free TrialD
DeepLearning.AI
Course
- Status: Free TrialD
DeepLearning.AI
Course
- G
Google Cloud
Project
Why people choose Coursera for their career
Learner reviews
- 5 stars
81.42%
- 4 stars
13.13%
- 3 stars
4.12%
- 2 stars
0.18%
- 1 star
1.12%
Showing 3 of 533
Reviewed on Oct 26, 2022
course content was very informative.Learned the concepts with practical experience.Great Learning!!!!
Reviewed on Jul 14, 2021
This class was probably the most challenging so far, but I learned some valuable deep learning techniques.
Reviewed on Apr 17, 2021
One of the finest in depth course on computer vision. So much helpful if anyone wishes to dive into application oriented tasks of computer vision. Very much helpful for research also.
Frequently asked questions
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.
More questions
Financial aid available,
