Visual Perception for Self-Driving Cars

Ends soon! Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

👁 University of Toronto

Visual Perception for Self-Driving Cars

This course is part of Self-Driving Cars Specialization

👁 Steven Waslander

👁 Jonathan Kelly

Instructors: Steven Waslander

45,811 already enrolled

Included with

•

Learn more

7 modules

Gain insight into a topic and learn the fundamentals.

4.7

587 reviews

Advanced level

Recommended experience

Flexible schedule

3 weeks at 10 hours a week

Learn at your own pace

7 modules

Gain insight into a topic and learn the fundamentals.

4.7

587 reviews

Advanced level

Recommended experience

Flexible schedule

3 weeks at 10 hours a week

Learn at your own pace

What you'll learn

Work with the pinhole camera model, and perform intrinsic and extrinsic camera calibration
Detect, describe and match image features and design your own convolutional neural networks
Apply these methods to visual odometry, object detection and tracking
Apply semantic segmentation for drivable surface estimation

Skills you'll gain

Details to know

👁 Image

Shareable certificate

Add to your LinkedIn profile

Assessments

4 assignments

Taught in English

95%

Most learners liked this course

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

👁 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is part of the Self-Driving Cars Specialization

When you enroll in this course, you'll also be enrolled in this Specialization.

Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate

👁 Image

There are 7 modules in this course

Welcome to Visual Perception for Self-Driving Cars, the third course in University of Toronto’s Self-Driving Cars Specialization.

This course will introduce you to the main perception tasks in autonomous driving, static and dynamic object detection, and will survey common computer vision methods for robotic perception. By the end of this course, you will be able to work with the pinhole camera model, perform intrinsic and extrinsic camera calibration, detect, describe and match image features and design your own convolutional neural networks. You'll apply these methods to visual odometry, object detection and tracking, and semantic segmentation for drivable surface estimation. These techniques represent the main building blocks of the perception system for self-driving cars. For the final project in this course, you will develop algorithms that identify bounding boxes for objects in the scene, and define the boundaries of the drivable surface. You'll work with synthetic and real image data, and evaluate your performance on a realistic dataset. This is an advanced course, intended for learners with a background in computer vision and deep learning. To succeed in this course, you should have programming experience in Python 3.0, and familiarity with Linear Algebra (matrices, vectors, matrix multiplication, rank, Eigenvalues and vectors and inverses).

This module introduces the main concepts from the broad and exciting field of computer vision needed to progress through perception methods for self-driving vehicles. The main components include camera models and their calibration, monocular and stereo vision, projective geometry, and convolution operations.

What's included

4 videos4 readings1 discussion prompt

4 videos•Total 18 minutes

Welcome to the Self-Driving Cars Specialization!•6 minutes
Welcome to the course•5 minutes
Meet the Instructor, Steven Waslander•6 minutes
Meet the Instructor, Jonathan Kelly•2 minutes

4 readings•Total 60 minutes

Course Prerequisites•15 minutes
How to Use Discussion Forums•15 minutes
How to Use Supplementary Readings in This Course•15 minutes
Recommended Textbooks•15 minutes

1 discussion prompt•Total 30 minutes

Get to Know Your Classmates•30 minutes

This module introduces the main concepts from the broad field of computer vision needed to progress through perception methods for self-driving vehicles. The main components include camera models and their calibration, monocular and stereo vision, projective geometry, and convolution operations.

What's included

6 videos4 readings1 assignment1 programming assignment2 ungraded labs

6 videos•Total 43 minutes

Lesson 1 Part 1: The Camera Sensor•7 minutes
Lesson 1 Part 2: Camera Projective Geometry•8 minutes
Lesson 2: Camera Calibration•7 minutes
Lesson 3 Part 1: Visual Depth Perception - Stereopsis•8 minutes
Lesson 3 Part 2: Visual Depth Perception - Computing the Disparity•6 minutes
Lesson 4: Image Filtering•7 minutes

4 readings•Total 90 minutes

Supplementary Reading: The Camera Sensor•30 minutes
Supplementary Reading: Camera Calibration•15 minutes
Supplementary Reading: Visual Depth Perception•30 minutes
Supplementary Reading: Image Filtering•15 minutes

1 assignment•Total 30 minutes

Module 1 Graded Quiz•30 minutes

1 programming assignment•Total 90 minutes

(Submission) Applying Stereo Depth to a Driving Scenario•90 minutes

2 ungraded labs•Total 180 minutes

Practice Assignment: Applying Stereo Depth to a Driving Scenario•120 minutes
(Solution) Applying Stereo Depth to a Driving Scenario•60 minutes

Visual features are used to track motion through an environment and to recognize places in a map. This module describes how features can be detected and tracked through a sequence of images and fused with other sources for localization as described in Course 2. Feature extraction is also fundamental to object detection and semantic segmentation in deep networks, and this module introduces some of the feature detection methods employed in that context as well.

What's included

6 videos5 readings1 programming assignment1 ungraded lab

6 videos•Total 44 minutes

Lesson 1: Introduction to Image features and Feature Detectors•7 minutes
Lesson 2: Feature Descriptors•7 minutes
Lesson 3 Part 1: Feature Matching•7 minutes
Lesson 3 Part 2: Feature Matching: Handling Ambiguity in Matching•5 minutes
Lesson 4: Outlier Rejection•8 minutes
Lesson 5: Visual Odometry•10 minutes

5 readings•Total 85 minutes

Supplementary Reading: Feature Detectors and Descriptors•30 minutes
Supplementary Reading: Feature Matching•15 minutes
Supplementary Reading: Feature Matching•15 minutes
Supplementary Reading: Outlier Rejection•15 minutes
Supplementary Reading: Visual Odometry•10 minutes

1 programming assignment•Total 150 minutes

Visual Odometry for Localization in Autonomous Driving•150 minutes

1 ungraded lab•Total 150 minutes

Visual Odometry for Localization in Autonomous Driving•150 minutes

Deep learning is a core enabling technology for self-driving perception. This module briefly introduces the core concepts employed in modern convolutional neural networks, with an emphasis on methods that have been proven to be effective for tasks such as object detection and semantic segmentation. Basic network architectures, common components and helpful tools for constructing and training networks are described.

What's included

6 videos6 readings1 assignment

6 videos•Total 58 minutes

Lesson 1: Feed Forward Neural Networks•10 minutes
Lesson 2: Output Layers and Loss Functions•11 minutes
Lesson 3: Neural Network Training with Gradient Descent•11 minutes
Lesson 4: Data Splits and Neural Network Performance Evaluation•8 minutes
Lesson 5: Neural Network Regularization•9 minutes
Lesson 6: Convolutional Neural Networks•9 minutes

6 readings•Total 80 minutes

Supplementary Reading: Feed-Forward Neural Networks•15 minutes
Supplementary Reading: Output Layers and Loss Functions•15 minutes
Supplementary Reading: Neural Network Training with Gradient Descent•15 minutes
Supplementary Reading: Data Splits and Neural Network Performance Evaluation•10 minutes
Supplementary Reading: Neural Network Regularization•15 minutes
Supplementary Reading: Convolutional Neural Networks•10 minutes

1 assignment•Total 30 minutes

Feed-Forward Neural Networks•30 minutes

The two most prevalent applications of deep neural networks to self-driving are object detection, including pedestrian, cyclists and vehicles, and semantic segmentation, which associates image pixels with useful labels such as sign, light, curb, road, vehicle etc. This module presents baseline techniques for object detection and the following module introduce semantic segmentation, both of which can be used to create a complete self-driving car perception pipeline.

What's included

4 videos4 readings1 assignment

4 videos•Total 52 minutes

Lesson 1: The Object Detection Problem•15 minutes
Lesson 2: 2D Object detection with Convolutional Neural Networks•11 minutes
Lesson 3: Training vs. Inference•11 minutes
Lesson 4: Using 2D Object Detectors for Self-Driving Cars•14 minutes

4 readings•Total 120 minutes

Supplementary Reading: The Object Detection Problem•15 minutes
Supplementary Reading: 2D Object detection with Convolutional Neural Networks•30 minutes
Supplementary Reading: Training vs. Inference•45 minutes
Supplementary Reading: Using 2D Object Detectors for Self-Driving Cars•30 minutes

1 assignment•Total 30 minutes

Object Detection For Self-Driving Cars•30 minutes

The second most prevalent application of deep neural networks to self-driving is semantic segmentation, which associates image pixels with useful labels such as sign, light, curb, road, vehicle etc. The main use for segmentation is to identify the drivable surface, which aids in ground plane estimation, object detection and lane boundary assessment. Segmentation labels are also being directly integrated into object detection as pixel masks, for static objects such as signs, lights and lanes, and moving objects such cars, trucks, bicycles and pedestrians.

What's included

3 videos3 readings1 assignment

3 videos•Total 31 minutes

Lesson 1: The Semantic Segmentation Problem•8 minutes
Lesson 2: ConvNets for Semantic Segmentation•11 minutes
Lesson 3: Semantic Segmentation for Road Scene Understanding•11 minutes

3 readings•Total 90 minutes

Supplementary Reading: The Semantic Segmentation Problem•30 minutes
Supplementary Reading: ConvNets for Semantic Segmentation•30 minutes
Supplementary Reading: Semantic Segmentation for Road Scene Understanding•30 minutes

1 assignment•Total 20 minutes

Semantic Segmentation For Self-Driving Cars•20 minutes

The final module of this course focuses on the implementation of a collision warning system that alerts a self-driving car about the position and category of obstacles present in their lane. The project is comprised of three major segments: 1) Estimating the drivable space in 3D, 2) Semantic Lane Estimation and 3) Filter wrong output from object detection using semantic segmentation.

What's included

4 videos1 programming assignment1 discussion prompt1 ungraded lab

4 videos•Total 24 minutes

Project Overview: Using CARLA for object detection and segmentation•6 minutes
Final Project Hints•6 minutes
Final Project Solution [LOCKED]•9 minutes
Congratulations for completing the course!•3 minutes

1 programming assignment•Total 180 minutes

Environment Perception For Self-Driving Cars•180 minutes

1 discussion prompt•Total 15 minutes

Your Learning Journey•15 minutes

1 ungraded lab•Total 180 minutes

Environment Perception For Self-Driving Cars•180 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructors

Instructor ratings

4.7 (76 ratings)

👁 Steven Waslander

Steven Waslander

University of Toronto

4 Courses•181,898 learners

👁 Jonathan Kelly

Jonathan Kelly

University of Toronto

4 Courses•181,898 learners

Offered by

👁 Image

University of Toronto

Explore more from Software Development

👁 Image
Status: Free Trial
U
University of Toronto
Introduction to Self-Driving Cars
Course
👁 Image
Status: Free Trial
U
University of Toronto
Motion Planning for Self-Driving Cars
Course
👁 Image
Status: Free Trial
U
University of Toronto
State Estimation and Localization for Self-Driving Cars
Course
👁 Image
Status: Free Trial
C
Columbia University
Visual Perception
Course

Why people choose Coursera for their career

👁 Image

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

👁 Image

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

👁 Image

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

👁 Image

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

5 stars
77.55%
4 stars
16.32%
3 stars
3.91%
2 stars
0.68%
1 star
1.53%

Showing 3 of 587

Reviewed on Oct 17, 2021

This is EPIC. Love the profs for splitting it down to such easy to understand sections

Reviewed on Mar 18, 2025

it was good, but it could be more in depth. what provided in the course was just the tip of the iceberg.

Reviewed on Mar 24, 2019

Good intro for those with not much experience w/ image processing/computer vision w.r.t. autonomous driving.

View more reviews

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

URL: https://www.coursera.org/learn/visual-perception-self-driving-cars