Building and Training Neural Networks with PyTorch
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Building and Training Neural Networks with PyTorch
This course is part of PyTorch Ultimate 2024 - From Basics to Cutting-Edge Specialization
3,045 already enrolled
Included with
Ask Coursera
13 reviews
Recommended experience
13 reviews
Recommended experience
What you'll learn
Build and train neural networks using PyTorch for various tasks.
Implement classification models with multi-class, multi-label datasets, and CNNs for image and audio classification.
Apply object detection techniques using the YOLO algorithm.
Explore neural style transfer, transfer learning, and implement RNNs and LSTM networks.
Skills you'll gain
Tools you'll learn
Details to know
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
There are 7 modules in this course
Updated in May 2025.
This course now features Coursera Coach — your interactive learning companion that helps you test your knowledge, challenge assumptions, and deepen your understanding as you progress. Master the power of neural networks with this hands-on deep learning course built entirely in PyTorch. Designed for data scientists, AI practitioners, and developers, this course guides you step by step through building, training, and evaluating models for image, audio, and sequence-based tasks using one of the industry’s most popular frameworks. You’ll begin by exploring classification models, learning how to handle binary and multi-class problems, interpret confusion matrices, and analyze ROC curves. Through practical exercises, you’ll prepare data, design dataset classes, and build your own neural network architectures to solve real classification challenges. Next, you’ll move into Convolutional Neural Networks (CNNs), where you’ll develop both image and audio classification systems. You’ll learn how CNN layers work, implement preprocessing pipelines, and construct models for binary and multi-class image tasks. You’ll also extend these skills to audio classification, giving you a broader understanding of how CNNs apply across domains. From there, you’ll dive into object detection, mastering accuracy metrics, labeling formats, and the YOLO (You Only Look Once) algorithm. Hands-on coding sessions walk you through data preparation, training, and inference so you can build complete, end-to-end detection workflows. In the final modules, you’ll explore neural style transfer, transfer learning with pre-trained networks, and sequence modeling using RNNs and LSTMs — gaining the skills to tackle advanced deep learning applications. By the end of this course, you will have: - Built and evaluated neural network models for binary and multi-class classification. - Designed and trained CNNs for image and audio data. - Implemented object detection workflows using YOLO. - Applied neural style transfer and leveraged pre-trained models for transfer learning. - Developed RNN and LSTM models for sequence-based tasks. - Gained the confidence to use PyTorch for real-world deep learning projects. This course is ideal for learners with experience in Python and a foundational understanding of machine learning and deep learning concepts who want to advance their skills in building neural networks with PyTorch.
In this module, we will delve into the realm of classification models, focusing on their types, evaluation metrics, and implementation. You will learn about key concepts such as the confusion matrix and ROC curve, and engage in practical exercises to build and evaluate multi-class classification models.
What's included
16 videos2 readings
16 videos•Total 65 minutes
- Section Overview•2 minutes
- Classification Types (101)•5 minutes
- Confusion Matrix (101)•6 minutes
- ROC Curve (101)•7 minutes
- Multi-Class 1: Data Prep•3 minutes
- Multi-Class 2: Dataset Class (Exercise)•0 minutes
- Multi-Class 3: Dataset Class (Solution)•2 minutes
- Multi-Class 4: Network Class (Exercise)•1 minute
- Multi-Class 5: Network Class (Solution)•2 minutes
- Multi-Class 6: Loss, Optimizer, and Hyperparameters•3 minutes
- Multi-Class 7: Training Loop•3 minutes
- Multi-Class 8: Model Evaluation•3 minutes
- Multi-Class 9: Naive Classifier•3 minutes
- Multi-Class 10: Summary•1 minute
- Multi-Label (Exercise)•7 minutes
- Multi-Label (Solution)•16 minutes
2 readings•Total 20 minutes
- Introduction to the Course 'Building and Training Neural Networks with PyTorch'•10 minutes
- Full Specialization Resources•10 minutes
In this module, we will explore the power of convolutional neural networks (CNNs) in image classification tasks. You will learn about the CNN architecture, preprocess images for optimal results, and gain hands-on experience in implementing binary and multi-class image classification models.
What's included
11 videos
11 videos•Total 85 minutes
- Section Overview•2 minutes
- CNNs (101)•10 minutes
- CNN (Interactive)•4 minutes
- Image Preprocessing (101)•9 minutes
- Image Preprocessing (Coding)•10 minutes
- Binary Image Classification (101)•1 minute
- Binary Image Classification (Coding)•19 minutes
- Multi-Class Image Classification (Exercise)•4 minutes
- Multi-Class Image Classification (Solution)•9 minutes
- Layer Calculations (101)•7 minutes
- Layer Calculations (Coding)•11 minutes
In this module, we will focus on using convolutional neural networks for audio classification. You will get a comprehensive introduction to the topic, learn how to conduct exploratory data analysis on audio data, and engage in practical exercises to build and evaluate your own audio classification models.
What's included
5 videos1 assignment
5 videos•Total 33 minutes
- Audio Classification (101)•3 minutes
- Audio Classification (Exercise)•7 minutes
- Audio Classification (Exploratory Data Analysis)•5 minutes
- Audio Classification (Data Prep-Solution)•6 minutes
- Audio Classification (Model-Solution)•12 minutes
1 assignment•Total 15 minutes
- Assessment 1•15 minutes
In this module, we will dive into object detection using convolutional neural networks. You will learn about essential accuracy metrics, implement popular object detection algorithms like YOLO, and utilize GPU resources for training and inference to build robust object detection models.
What's included
13 videos
13 videos•Total 73 minutes
- Section Overview•1 minute
- Accuracy Metrics (101)•7 minutes
- Object Detection (101)•3 minutes
- Object Detection with detecto (Coding)•8 minutes
- Training a Model on GPU for Free (Coding)•3 minutes
- YOLO (101)•6 minutes
- Labeling Formats•4 minutes
- YOLOv7 Project (101)•10 minutes
- YOLOv7 Coding: Setup•7 minutes
- YOLOv7 Coding: Data Prep•6 minutes
- YOLOv7 Coding: Model Training•4 minutes
- YOLOv7 Coding: Model Inference•4 minutes
- YOLOv8 Coding: Model Training and Inference•9 minutes
In this module, we will cover the fascinating topic of neural style transfer. You will understand the underlying principles, implement style transfer algorithms through coding, and explore various creative applications to transform images in unique ways.
What's included
3 videos1 assignment
3 videos•Total 24 minutes
- Section Overview•1 minute
- Style Transfer (101)•8 minutes
- Style Transfer (Coding)•15 minutes
1 assignment•Total 15 minutes
- Assessment 2•15 minutes
In this module, we will delve into pre-trained networks and transfer learning. You will learn how to leverage pre-trained models, implement transfer learning techniques through coding exercises, and understand the advantages of applying these concepts to various machine learning tasks.
What's included
3 videos
3 videos•Total 16 minutes
- Section Overview•1 minute
- Transfer Learning and Pre-Trained Networks (101)•5 minutes
- Transfer Learning (Coding)•10 minutes
In this module, we will introduce recurrent neural networks (RNNs) and their applications. You will explore the basics of RNNs, implement Long Short-Term Memory (LSTM) networks through practical coding exercises, and engage in tasks designed to deepen your understanding of these powerful models.
What's included
4 videos1 reading3 assignments
4 videos•Total 27 minutes
- Section Overview•1 minute
- RNN (101)•6 minutes
- LSTM (Coding)•16 minutes
- LSTM (Exercise)•3 minutes
1 reading•Total 10 minutes
- Conclusion to the Course 'Building and Training Neural Networks with PyTorch'•10 minutes
3 assignments•Total 90 minutes
- Full Course Practice Assessment•15 minutes
- Assessment 3•15 minutes
- Full Course Assessment•60 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor
Explore more from Software Development
Why people choose Coursera for their career
Learner reviews
- 5 stars
84.61%
- 4 stars
7.69%
- 3 stars
7.69%
- 2 stars
0%
- 1 star
0%
Showing 3 of 13
Reviewed on Dec 18, 2024
Very practical! I learn a lot on coding in pytorch!
Reviewed on Aug 24, 2025
Another excellent course by the instructor. A bit more difficult that the first course. The sections on transfer learning are valuable.
Frequently asked questions
Yes, you can preview the first video and view the syllabus before you enroll. You must purchase the course to access content not included in the preview.
If you decide to enroll in the course before the session start date, you will have access to all of the lecture videos and readings for the course. You’ll be able to submit assignments once the session starts.
Once you enroll and your session begins, you will have access to all videos and other resources, including reading items and the course discussion forum. You’ll be able to view and submit practice assessments, and complete required graded assignments to earn a grade and a Course Certificate.
More questions
Financial aid available,
