Generative AI: Foundations and Concepts
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Ask Coursera
Skills you'll gain
- Generative Model Architectures
- Probability & Statistics
- Model Evaluation
- Probability Distribution
- Applied Mathematics
- Mathematical Modeling
- Convolutional Neural Networks
- Network Architecture
- Machine Learning Methods
- Model Optimization
- Bayesian Network
- Estimation
- Deep Learning
- Model Training
- Artificial Neural Networks
Tools you'll learn
Details to know
9 assignments
See how employees at top companies are mastering in-demand skills
There are 4 modules in this course
This course provides an overview of some different concepts underpinning Generative AI, their mathematical principles, and their applications in engineering. The focus will be on the practical implementation of generative AI including, neural networks, attention mechanism, and advanced deep learning models.
In this module, you will explore the foundations of neural networks, including perceptrons, architectures, and learning algorithms. You will dive deeply into optimization methods critical for efficient training, focusing on advanced techniques like Newtonβs and quasi-Newton methods, momentum, RMSProp, and Adam optimization algorithms.
What's included
6 videos15 readings2 assignments2 discussion prompts
6 videosβ’Total 29 minutes
- Neural Networks Part 1: Perceptronβ’6 minutes
- Neural Networks Part 2: How Neural Networks Learnβ’6 minutes
- Neural Networks Part 3: Back Propagationβ’7 minutes
- Optimization Technique Overview Part 1β’3 minutes
- Optimization Technique Overview Part 2β’4 minutes
- Optimization Technique Overview Part 3β’3 minutes
15 readingsβ’Total 157 minutes
- Course Overviewβ’2 minutes
- Syllabus - Generative AI: Foundations and Conceptsβ’10 minutes
- Academic Integrityβ’1 minute
- Module Overviewβ’3 minutes
- Perceptron In-Depthβ’10 minutes
- Neural Network Breakdownβ’15 minutes
- Neural Network Structureβ’5 minutes
- How Neural Networks Learn: Deep Diveβ’10 minutes
- Backpropagation & SGDβ’20 minutes
- Matricesβ’15 minutes
- Newton's Methodsβ’15 minutes
- Quasi-Newton Methodsβ’15 minutes
- Root-Mean-Square Propagationβ’15 minutes
- Adaptive Moment Estimationβ’20 minutes
- Module Wrap-Upβ’1 minute
2 assignmentsβ’Total 20 minutes
- Check Your Knowledgeβ’10 minutes
- Check Your Knowledgeβ’10 minutes
2 discussion promptsβ’Total 70 minutes
- Meet Your Fellow Learnersβ’10 minutes
- Neural Networksβ’60 minutes
This module guides you through the mathematical approaches to regularization techniques that enhance neural network generalization and prevent overfitting. You will analyze concepts including Steinβs unbiased risk estimator, eigen decomposition, ensemble methods, dropout mechanisms, and advanced normalization techniques such as batch normalization.
What's included
4 videos17 readings2 assignments1 discussion prompt
4 videosβ’Total 23 minutes
- Regularization: Model Selection and Complexityβ’5 minutes
- Regularization Techniquesβ’8 minutes
- Introduction to Dropoutβ’4 minutes
- Introduction to Batch Normalizationβ’6 minutes
17 readingsβ’Total 160 minutes
- Module Overviewβ’1 minute
- Steinβs Unbiased Risk Estimatorβ’15 minutes
- Stein's Lemmaβ’15 minutes
- Regularizationβ’10 minutes
- Why Does Regularization Work?β’15 minutes
- Eigen Decomposition and Singular Value Decompositionβ’15 minutes
- Understanding the Search Spaceβ’5 minutes
- Regularization Techniquesβ’15 minutes
- Bagging and Other Ensemble Methodsβ’5 minutes
- Deep Dive Into Dropoutβ’15 minutes
- Applying Dropout to Linear Regressionβ’15 minutes
- Deep Dive Into Batch Normalizationβ’2 minutes
- Internal Covariate Shift and Domain Adaptationβ’10 minutes
- New Batch Normalization Techniquesβ’15 minutes
- Batch Normalization Effectsβ’5 minutes
- Alternatives to Batch Normalizationβ’1 minute
- Module Wrap-Upβ’1 minute
2 assignmentsβ’Total 20 minutes
- Check Your Knowledgeβ’10 minutes
- Check Your Knowledgeβ’10 minutes
1 discussion promptβ’Total 60 minutes
- Regularization & Stabilization Techniquesβ’60 minutes
In this module, you will examine convolutional neural networks (CNNs), including convolution operations, parameter sharing, kernel methods, and multi-dimensional data structures. You'll explore advanced CNN architectures, regularization, normalization techniques, and the implications of random kernels on network learning behavior.
What's included
5 videos31 readings2 assignments1 discussion prompt
5 videosβ’Total 46 minutes
- Convolutional Neural Networks Part 1: The First Principlesβ’10 minutes
- Convolutional Neural Networks Part 2: 1D Inputβ’8 minutes
- Convolutional Neural Networks Part 3: Multiple Dimensionsβ’9 minutes
- Convolutional Neural Networks Part 4: Backpropagationβ’12 minutes
- Convolutional Neural Networks Part 5: PixelCNNβ’7 minutes
31 readingsβ’Total 270 minutes
- Module Overviewβ’1 minute
- Introduction to Convolutional Neural Networksβ’2 minutes
- Invariance and Equivarianceβ’5 minutes
- Convolutionβ’5 minutes
- Translationβ’5 minutes
- Kernel Flippingβ’5 minutes
- Convolution vs. Cross-Correlationβ’5 minutes
- Edge Detectionβ’15 minutes
- Types of Kernelsβ’5 minutes
- Parameter Sharing and Filtersβ’2 minutes
- CNNs for 1D Inputsβ’10 minutes
- Paddingβ’5 minutes
- Stride, Kernel Size, and Dilationβ’2 minutes
- Convolutional Layers as Fully Connected Layersβ’10 minutes
- Convolution in Multidimensional Arraysβ’5 minutes
- Architecture of Convolutional NNsβ’10 minutes
- Downsamplingβ’15 minutes
- Upsampling and Layersβ’5 minutes
- End-to-End Visualization of CNNsβ’30 minutes
- Backpropagationβ’15 minutes
- Convolutional Layersβ’25 minutes
- Kernel Weightsβ’15 minutes
- Applications of CNNsβ’20 minutes
- Residual Neural Networksβ’20 minutes
- Recap on Regularizationβ’2 minutes
- Ideas to Get Around the Optimization Problemβ’5 minutes
- Layer Normalization Formulasβ’5 minutes
- Filter Response Normalization (FRN)β’10 minutes
- Normalizer-Free Networksβ’5 minutes
- Why Random Kernels Learn Different Thingsβ’5 minutes
- Module Wrap-Upβ’1 minute
2 assignmentsβ’Total 13 minutes
- Check Your Knowledgeβ’10 minutes
- Check Your Knowledgeβ’3 minutes
1 discussion promptβ’Total 60 minutes
- CNN Quirksβ’60 minutes
In this module, you will analyze the maths underpinning generative models and maximum likelihood estimation (MLE). You will explore divergence metrics such as Kullback-Leibler divergence, Bayesian network structures, and autoregressive modeling methods, focusing on their theoretical foundations and practical implications.
What's included
6 videos33 readings3 assignments1 discussion prompt
6 videosβ’Total 53 minutes
- Intro to Maximum Likelihood Learningβ’9 minutes
- Divergence Methods & Gradient Descentβ’11 minutes
- Representation Part 1: Distributionsβ’10 minutes
- Representation Part 2: Discriminative vs General Modelsβ’9 minutes
- Autoregressive Models General Principlesβ’9 minutes
- Autoregressive Models Continuedβ’7 minutes
33 readingsβ’Total 226 minutes
- Module Overviewβ’1 minute
- Learning a Generative Modelβ’8 minutes
- Goal of Learningβ’3 minutes
- What is βBest?"β’2 minutes
- Learning as Density Estimationβ’1 minute
- Kullback-Leibler (KL-Divergence)β’3 minutes
- Detour on KL-Divergenceβ’3 minutes
- Expected Log-Likelihoodβ’5 minutes
- Monte Carlo Estimationβ’8 minutes
- Extending the MLE Principle to Autoregressive Modelsβ’5 minutes
- MLE Learning: Gradient Descentβ’3 minutes
- MLE Learning: Stochastic Gradient Descentβ’4 minutes
- Empirical Risk and Overfittingβ’10 minutes
- Learning a Generative Model Part 2β’5 minutes
- Basic Discrete Distributionsβ’10 minutes
- Structure Through Independenceβ’3 minutes
- Key Notion: Conditional Independenceβ’15 minutes
- Bayesian Networksβ’5 minutes
- Examplesβ’10 minutes
- Naive Bayesβ’8 minutes
- Discriminative vs. Generative Modelsβ’10 minutes
- Generative Models Are Still Usefulβ’8 minutes
- Bayesian Networks vs. Neural Modelsβ’20 minutes
- Motivating Example: MNISTβ’2 minutes
- Introduction to Autoregressive Modelsβ’10 minutes
- Fully Visible Sigmoid Belief Networks (FVSBN)β’10 minutes
- NADE: Neural Autoregressive Density Estimationβ’25 minutes
- General Discrete Distributionsβ’5 minutes
- Real-Valued Neural Autoregressive Density-Estimator (RNADE)β’5 minutes
- Autoregressive Models vs. Autoencoderβ’15 minutes
- Summary of Autoregressive Modelsβ’2 minutes
- Module Wrap-Upβ’1 minute
- Congratulationsβ’1 minute
3 assignmentsβ’Total 30 minutes
- Check Your Knowledgeβ’10 minutes
- Check Your Knowledgeβ’10 minutes
- Check Your Knowledgeβ’10 minutes
1 discussion promptβ’Total 60 minutes
- Generative Modelingβ’60 minutes
Instructor
Offered by
Explore more from Machine Learning
- Status: Free TrialA
Alberta Machine Intelligence Institute
Course
- U
University of Colorado Boulder
Course
- Status: Free TrialU
University of Michigan
Course
Why people choose Coursera for their career
Frequently asked questions
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you purchase a Certificate you get access to all course materials, including graded assignments. Upon completing the course, your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Yes. In select learning programs, you can apply for financial aid or a scholarship if you canβt afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, youβll find a link to apply on the description page.
More questions
Financial aid available,
