Generative AI: Fine-Tuning LLMs and Diffusion Models

Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

👁 Board Infinity

Generative AI: Fine-Tuning LLMs and Diffusion Models

This course is part of Advanced Deep Learning Architectures Specialization

👁 Board Infinity

Instructor: Board Infinity

Included with

•

Learn more

Ask Coursera

4 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

2 weeks to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

4 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

2 weeks to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

What you'll learn

Build decoder-only transformer pipelines with KV caching optimizations
Fine-tune 7B+ LLMs using LoRA and QLoRA on consumer GPUs
Configure diffusers pipelines with ControlNet for controllable images
Train, export, and evaluate a domain-specialized LLM adapter

Skills you'll gain

Tools you'll learn

Details to know

👁 Image

Shareable certificate

Add to your LinkedIn profile

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

👁 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is part of the Advanced Deep Learning Architectures Specialization

When you enroll in this course, you'll also be enrolled in this Specialization.

Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate

👁 Image

There are 4 modules in this course

"Master Generative AI with hands-on training in Large Language Models (LLMs), PEFT techniques (LoRA, QLoRA), and Diffusion Models using Hugging Face, diffusers, peft, trl, and bitsandbytes. This course takes you from the internals of decoder-only transformers to building a specialist fine-tuned LLM and generating high-quality, controllable images with ControlNet.

In Module 1, explore decoder-only transformer architectures, self-attention, causal masking, KV caching, and token flow mechanics. Module 2 focuses on Parameter-Efficient Fine-Tuning (PEFT), where you'll implement LoRA, QLoRA, and 4-bit quantization to fine-tune large models on consumer GPUs using SFT pipelines. Module 3 dives into diffusion models, covering forward/reverse processes, UNet, schedulers (DDIM, Euler, DPM++), and ControlNet conditioning. Module 4 is a capstone where you'll build a Specialist LLM — from dataset creation to adapter export and evaluation. By the end of this course, you will: - Build and optimize decoder-only transformer pipelines with KV caching - Fine-tune 7B+ LLMs using LoRA, QLoRA, and SFT pipelines on limited hardware - Configure diffusers pipelines with ControlNet for controllable image generation - Train, export, and evaluate a domain-specialized LLM adapter end-to-end" Disclaimer: This is an independent educational resource created by Board Infinity for informational and educational purposes only. This course is not affiliated with, endorsed by, sponsored by, or officially associated with any company, organization, or certification body unless explicitly stated. The content provided is based on industry knowledge and best practices but does not constitute official training material for any specific employer or certification program. All company names, trademarks, service marks, and logos referenced are the property of their respective owners and are used solely for educational identification and comparison purposes.

Explore the inner workings of decoder-only transformer architectures, including token flow, self-attention, causal masking, and KV cache optimization.

What's included

11 videos3 readings4 assignments

11 videos•Total 107 minutes

Where LLM Engineers Work Today•10 minutes
Why Decoder-Only Models Dominate Industry Part 1•11 minutes
Why Decoder-Only Models Dominate Industry part -2•9 minutes
Skills Employers Want•11 minutes
Skills Employers Want Part 2•9 minutes
What Happens When You Type a Prompt•9 minutes
Decoder Block Components (Mental Model)•9 minutes
How Tokens Flow Through Layers•9 minutes
Causal Masking & "No Peeking"•10 minutes
Why Attention Is Expensive•10 minutes
KV Cache: Why Inference Gets Faster•10 minutes

3 readings•Total 90 minutes

The LLM Engineering Landscape (2026)•30 minutes
Inside the Decoder Block: Architecture & Data Flow•30 minutes
KV Cache Optimization & Attention Behavior Analysis•30 minutes

4 assignments•Total 105 minutes

Transformer Internals & Decoder-Only Architectures•60 minutes
Career Scope in LLM Engineering•15 minutes
Anatomy of Decoder-Only Transformers•15 minutes
Self-Attention, Causal Masking & KV-Cache Behavior•15 minutes

Master parameter-efficient fine-tuning techniques including LoRA, QLoRA with 4-bit quantization, and building supervised fine-tuning pipelines using peft and trl.

What's included

12 videos3 readings4 assignments

12 videos•Total 106 minutes

The Core Idea Behind PEFT•9 minutes
The Core Idea Behind PEFT part 2•11 minutes
The Core Idea Behind PEFT Part3•6 minutes
Implementing LoRA Layers in Attention Blocks•10 minutes
Where PEFT Works (and Where It Doesn't)•11 minutes
LoRA Explained•9 minutes
QLoRA: Training Big Models on Small GPUs•9 minutes
Setting QLoRA Hyperparameters for Stability•9 minutes
What Good SFT Data Looks Like Part 1•8 minutes
What Good SFT Data Looks Like Part2•6 minutes
Building Full SFT Pipelines Using TRL•8 minutes
Early Evaluation & Failure Signals•10 minutes

3 readings•Total 90 minutes

LoRA Fundamentals & Design Decisions•30 minutes
QLoRA Implementation Guide for Large Models•30 minutes
Constructing Reliable SFT Datasets for Behavior Modeling•30 minutes

4 assignments•Total 105 minutes

PEFT - LoRA, QLoRA, & SFT Pipelines•60 minutes
Introduction to PEFT & Low-Rank Adaptation•15 minutes
QLoRA & 4-Bit Quantization Pathway•15 minutes
Building SFT Pipelines with peft + trl•15 minutes

Understand the forward and reverse diffusion processes, configure diffusers pipelines with various schedulers, and apply ControlNets for conditioned image generation.

What's included

10 videos3 readings4 assignments

10 videos•Total 77 minutes

Adding Noise: The Forward Process•7 minutes
Removing Noise: The Reverse Process•8 minutes
Why Timesteps Matter•8 minutes
Diffusers Pipeline Components•7 minutes
Diffusers Pipeline Components Part2•10 minutes
Choosing the Right Scheduler•5 minutes
Style, Guidance & Sampling Tricks•7 minutes
Why Prompting Alone Is Not Enough•8 minutes
ControlNet Concepts (Visual)•8 minutes
Practical: Conditioning an Image with ControlNet•8 minutes

3 readings•Total 90 minutes

Mathematical Intuition Behind Diffusion Timesteps•30 minutes
Scheduler Comparison & Practical Recommendations•30 minutes
ControlNet for Structured Image Generation•30 minutes

4 assignments•Total 105 minutes

Diffusion Models & Image Generation•60 minutes
The Forward & Reverse Diffusion Process•15 minutes
Configuring diffusers Pipelines•15 minutes
ControlNets & Conditioning Techniques•15 minutes

Apply all course concepts in a capstone project building a specialist LLM through dataset creation, QLoRA training, and adapter exporting with rigorous evaluation.

What's included

12 videos3 readings4 assignments

12 videos•Total 81 minutes

Converting Logs Into SFT-Ready Training Data•9 minutes
Designing Prompt Templates & Chat Formats•6 minutes
Cleaning, Normalizing & Validating Training Data•8 minutes
Cleaning, Normalizing & Validating Training Data Part 2•5 minutes
Cleaning, Normalizing & Validating Training Data Part 3•7 minutes
Training Workflow Using LoRA/QLoRA•5 minutes
Monitoring Training Loss & Detecting Overfitting•6 minutes
Memory-Saving Techniques•7 minutes
Exporting LoRA/QLoRA Adapters for Deployment•8 minutes
Exporting LoRAQLoRA Adapters for Deployment part 2•5 minutes
Loading the Adapter Into Base Models for Inference•6 minutes
Evaluating Alignment: Perplexity, Behavior, Scenarios•7 minutes

3 readings•Total 90 minutes

Dataset Quality Standards for Persona-Level LLMs•30 minutes
Troubleshooting PEFT Training Failures•30 minutes
Evaluation Framework for Domain-Specialized LLMs•30 minutes

4 assignments•Total 105 minutes

The Hands-On Project - The Specialist LLM•60 minutes
Dataset Creation & Prompt-Style Engineering•15 minutes
Training the Specialist LoRA/QLoRA Adapter•15 minutes
Exporting, Loading & Evaluating the Adapter•15 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

👁 Board Infinity

Board Infinity

261 Courses•428,749 learners

Offered by

👁 Image

Board Infinity

Explore more from Machine Learning

👁 Image
B
Board Infinity
Deploying Deep Learning: Quantization, Serving, and Edge AI
Course
👁 Image
B
Board Infinity
Deep Learning: Advanced Backbones and Efficient GPU Training
Course

Why people choose Coursera for their career

👁 Image

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

👁 Image

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

👁 Image

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

👁 Image

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Frequently asked questions

A basic understanding of Python, PyTorch, and neural networks is helpful. Familiarity with transformers is a plus but not mandatory — we cover decoder internals from scratch.

You'll work with Hugging Face transformers, peft, trl, bitsandbytes, and diffusers for LLM fine-tuning and image generation.

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

URL: https://www.coursera.org/learn/generative-ai-fine-tuning-llms-and-diffusion-models