Sequence Modeling, Transformers, and Transfer Learning

Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

👁 Packt

Sequence Modeling, Transformers, and Transfer Learning

This course is part of AI Engineer Professional Specialization

👁 Packt - Course Instructors

Instructor: Packt - Course Instructors

Included with

•

Learn more

Ask Coursera

3 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

1 week to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

3 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

1 week to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

What you'll learn

Understand the fundamentals of sequence modeling using RNNs, LSTMs, and GRUs.
Master the transformer architecture and attention mechanisms for NLP tasks.
Apply transfer learning to fine-tune pre-trained models for custom tasks.
Work on hands-on projects using RNNs, transformers, and transfer learning for text generation, translation, and summarization.

Skills you'll gain

Tools you'll learn

Generative AI

Details to know

👁 Image

Shareable certificate

Add to your LinkedIn profile

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

👁 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is part of the AI Engineer Professional Specialization

When you enroll in this course, you'll also be enrolled in this Specialization.

Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate

👁 Image

There are 3 modules in this course

This course features Coursera Coach!

A smarter way to learn with interactive, real-time conversations that help you test your knowledge, challenge assumptions, and deepen your understanding as you progress through the course. This course provides a comprehensive journey into sequence modeling, transformers, and transfer learning, equipping you with the skills to build powerful models for natural language processing (NLP) and other sequential data tasks. You'll begin by mastering Recurrent Neural Networks (RNNs), including their architecture, training techniques like backpropagation through time (BPTT), and specialized models such as Long Short-Term Memory (LSTM) and Gated Recurrent Units (GRUs). The course then moves into sequence-to-sequence models, which are critical for tasks like translation, summarization, and text generation. The next phase of the course explores the groundbreaking transformer architecture, the backbone of modern NLP models like BERT and GPT. You will dive into attention mechanisms, self-attention, and multi-head attention, understanding how these components capture contextual relationships in text. You'll also gain hands-on experience with pre-trained transformer models and learn how to apply them to real-world NLP tasks such as text summarization and translation. In the final section, you'll focus on transfer learning, a technique that enables the reuse of pre-trained models to solve new tasks with fewer resources. This course teaches you how to fine-tune models for both computer vision and NLP applications, including domain adaptation strategies and challenges. With a hands-on project at the end of the course, you’ll apply transfer learning to fine-tune a model for a custom task, demonstrating your ability to adapt state-of-the-art models to real-world problems. This course is ideal for learners with a foundational understanding of machine learning who want to advance their knowledge in deep learning, sequence modeling, and transfer learning. Prior knowledge of Python and basic machine learning concepts is recommended. The course is suitable for intermediate learners looking to deepen their understanding and practical skills in AI and deep learning. By the end of the course, you will be able to implement sequence models like RNNs, build transformers using attention mechanisms, apply transfer learning to fine-tune pre-trained models, and solve complex NLP tasks such as translation, summarization, and text generation.

In this module, we will explore the world of sequence modeling with Recurrent Neural Networks (RNNs). You'll learn about the architecture of RNNs, including how backpropagation through time works. We also cover advanced models like LSTMs and GRUs, and teach you how to preprocess text data and apply RNNs to sequence-to-sequence tasks. The module concludes with a hands-on project to implement RNNs for text generation or sentiment analysis.

What's included

7 videos2 readings1 assignment

7 videos•Total 165 minutes

Day 1: Introduction to Sequence Modeling and RNNs•34 minutes
Day 2: Understanding RNN Architecture and Backpropagation Through Time (BPTT)•25 minutes
Day 3: Long Short-Term Memory (LSTM) Networks•15 minutes
Day 4: Gated Recurrent Units (GRUs)•7 minutes
Day 5: Text Preprocessing and Word Embeddings for RNNs•24 minutes
Day 6: Sequence-to-Sequence Models and Applications•43 minutes
Day 7: RNN Project – Text Generation or Sentiment Analysis•18 minutes

2 readings•Total 20 minutes

Introduction to the Course 'Sequence Modeling, Transformers, and Transfer Learning'•10 minutes
Full Specialization Resources•10 minutes

1 assignment•Total 15 minutes

Recurrent Neural Networks (RNNs) and Sequence Modeling - Assessment•15 minutes

In this module, we introduce you to the transformative power of attention mechanisms in deep learning models. You’ll explore the architecture of transformers, learning about self-attention, multi-head attention, and positional encoding. With hands-on demonstrations of pre-trained transformer models like BERT and GPT, this section equips you to apply advanced NLP techniques to real-world projects like text summarization and translation.

What's included

7 videos1 assignment

7 videos•Total 134 minutes

Day 1: Introduction to Attention Mechanisms•15 minutes
Day 2: Introduction to Transformers Architecture•18 minutes
Day 3: Self-Attention and Multi-Head Attention in Transformers•21 minutes
Day 4: Positional Encoding and Feed-Forward Networks•20 minutes
Day 5: Hands-On with Pre-Trained Transformers – BERT and GPT•20 minutes
Day 6: Advanced Transformers – BERT Variants and GPT-3•21 minutes
Day 7: Transformer Project – Text Summarization or Translation•19 minutes

1 assignment•Total 15 minutes

Transformers and Attention Mechanisms - Assessment•15 minutes

In this module, we dive into the concept of transfer learning, a powerful technique that leverages pre-trained models for a wide range of applications. You will learn how to use transfer learning for both computer vision and natural language processing (NLP), including fine-tuning strategies and domain adaptation. The section concludes with a project where you will fine-tune a model for a custom task, helping you apply these techniques to solve real-world problems.

What's included

7 videos1 reading3 assignments

7 videos•Total 139 minutes

Day 1: Introduction to Transfer Learning•15 minutes
Day 2: Transfer Learning in Computer Vision•26 minutes
Day 3: Fine-Tuning Techniques in Computer Vision•22 minutes
Day 4: Transfer Learning in NLP•17 minutes
Day 5: Fine-Tuning Techniques in NLP•26 minutes
Day 6: Domain Adaptation and Transfer Learning Challenges•15 minutes
Day 7: Transfer Learning Project – Fine-Tuning for a Custom Task•18 minutes

1 reading•Total 10 minutes

Conclusion to the Course 'Sequence Modeling, Transformers, and Transfer Learning'•10 minutes

3 assignments•Total 90 minutes

Full Course Practice Assessment•15 minutes
Transfer Learning and Fine-Tuning - Assessment•15 minutes
Full Course Assessment•60 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

👁 Packt - Course Instructors

Packt - Course Instructors

Packt

1,926 Courses•558,431 learners

Offered by

👁 Image

Packt

Explore more from Machine Learning

👁 Image
P
Packt
AI Agents and MLOps for Production-Ready AI
Course
👁 Image
P
Packt
Foundations of Model Optimization and Deep Learning
Course

Why people choose Coursera for their career

👁 Image

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

👁 Image

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

👁 Image

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

👁 Image

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Frequently asked questions

Sequence modeling involves training machine learning models to work with sequential data, like time series or text, where the order of data matters. Transformers, a modern deep learning architecture, have revolutionized NLP tasks by using attention mechanisms to better capture long-range dependencies in sequences. Transfer learning allows models to leverage pre-trained knowledge from one task and apply it to another, significantly improving performance, especially when data is limited. These techniques are highly relevant as they are foundational to state-of-the-art AI models, particularly in natural language processing and computer vision.

The "Sequence Modeling, Transformers, and Transfer Learning" course explores advanced machine learning techniques for working with sequential data. It covers Recurrent Neural Networks (RNNs), including Long Short-Term Memory (LSTM) networks and Gated Recurrent Units (GRUs), as well as transformer models and attention mechanisms. The course also delves into transfer learning, including its applications in NLP and computer vision, with hands-on projects to reinforce learning. You will work with popular pre-trained models like BERT and GPT, and apply transfer learning to custom tasks.

After completing the course, you will have a deep understanding of sequence modeling, transformers, and transfer learning techniques. You will be capable of building and training RNNs and transformer models for tasks like text generation, sentiment analysis, text summarization, and translation. You will also be able to use transfer learning to fine-tune pre-trained models for specific applications in both NLP and computer vision, greatly enhancing your ability to solve real-world problems using AI.

This course assumes a basic understanding of machine learning, including knowledge of neural networks and deep learning concepts. Familiarity with Python programming and libraries such as NumPy, TensorFlow, and PyTorch will be helpful, as the course includes hands-on coding and project work. Prior experience with NLP or computer vision is not required, but it will be beneficial for those who want to fully grasp the applications of transfer learning in these fields.

This course is designed for individuals who already have a foundation in machine learning and want to deepen their knowledge of sequence modeling and modern deep learning techniques. It is particularly suited for AI enthusiasts, data scientists, machine learning engineers, or professionals looking to specialize in NLP and computer vision tasks, especially those interested in working with state-of-the-art models like transformers and pre-trained models.

The course consists of 9 hours of video content. The time required to complete the course will depend on your pace and how much time you dedicate to hands-on projects. On average, it may take around 12 to 15 hours to go through the material, complete the exercises, and work on the projects.

Yes, you can preview the first video and view the syllabus before you enroll. You must purchase the course to access content not included in the preview.

If you decide to enroll in the course before the session start date, you will have access to all of the lecture videos and readings for the course. You’ll be able to submit assignments once the session starts.

Once you enroll and your session begins, you will have access to all videos and other resources, including reading items and the course discussion forum. You’ll be able to view and submit practice assessments, and complete required graded assignments to earn a grade and a Course Certificate.

If you complete the course successfully, your electronic Course Certificate will be added to your Accomplishments page - from there, you can print your Course Certificate or add it to your LinkedIn profile.

This course is currently available only to learners who have paid or received financial aid, when available.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

URL: https://www.coursera.org/learn/packt-sequence-modeling-transformers-and-transfer-learning-s8p1m