VOOZH about

URL: https://www.coursera.org/learn/nvidia-large-language-models-and-generative-ai-deployment

⇱ NVIDIA: Large Language Models and Generative AI Deployment | Coursera


NVIDIA: Large Language Models and Generative AI Deployment

Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

NVIDIA: Large Language Models and Generative AI Deployment

1,950 already enrolled

Included with

β€’

Learn more

Ask Coursera

Gain insight into a topic and learn the fundamentals.
Intermediate level

Recommended experience

4 hours to complete
Flexible schedule
Learn at your own pace

Gain insight into a topic and learn the fundamentals.
Intermediate level

Recommended experience

4 hours to complete
Flexible schedule
Learn at your own pace

What you'll learn

  • Understand the foundational concepts of LLMs, including NLP and training data.

  • Explore model optimization techniques like loss functions, alignment, and PEFT.

  • Implement deployment strategies for LLMs and monitor performance using ONNX.

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

6 assignments

Taught in English

Build your subject-matter expertise

This course is part of the Exam Prep (NCA-GENL): NVIDIA-Certified Generative AI LLMs Specialization
When you enroll in this course, you'll also be enrolled in this Specialization.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 3 modules in this course

NVIDIA: Large Language Models and Generative AI Deployment is the fourth course of the Exam Prep (NCA-GENL): NVIDIA-Certified Generative AI LLMs - Associate Specialization. This course offers a comprehensive understanding of Large Language Models (LLMs) and Generative AI deployment, combining theoretical insights with practical skills.

Learners will explore key components of Generative AI, data requirements, and cleaning techniques for LLMs. The course covers model training, optimization, and evaluation methods, including Few-shot, Zero-shot, and Instruction Tuning. Additionally, the course dives into loss functions, alignment techniques, and evaluation metrics such as Perplexity. It also emphasizes the use of GPUs for training, fine-tuning methods like prompt tuning, and Parameter Efficient Fine Tuning (PEFT). Learners will gain expertise in LLM deployment strategies and monitoring with ONNX. This course is divided into three modules, each containing lessons and video lectures. Learners will engage with 4:30-5:00 hours of video content, covering both theoretical concepts and hands-on practices. Each module is equipped with quizzes to reinforce learning and assess understanding. Module 1: Fundamentals of Large Language Models Module 2: Training, Optimization, and Evaluation of LLMs Module 3: LLM Deployment Strategies and Monitoring By the end of this course, a learner will be able to: - Understand the foundational concepts of LLMs, including NLP and training data. - Explore model optimization techniques like loss functions, alignment, and PEFT. - Implement deployment strategies for LLMs and monitor performance using ONNX. This course is intended for professionals looking to deepen their expertise in deploying and optimizing LLMs for Generative AI applications.

Welcome to Week 1 of the NVIDIA: Large Language Models and Generative AI Deployment course. This week, we will begin by introducing you to Large Language Models (LLMs) and explore their significance in Natural Language Processing (NLP). We will also demonstrate how LLMs are applied to various NLP tasks using HuggingFace. Next, we will dive into the concept of Generative AI models and their components. We’ll cover the importance of training data for LLMs and best practices for data cleaning. By the end of this week, you will have a solid understanding of LLMs, their applications, and the essential processes involved in training them.

What's included

6 videos2 readings2 assignments1 discussion prompt

6 videosβ€’Total 30 minutes
  • Introduction to Large Language Modelsβ€’5 minutes
  • Usage of LLM on NLP Tasks - HuggingFace - Demoβ€’8 minutes
  • What is Generative AI Model ?β€’4 minutes
  • Components of Generative AIβ€’4 minutes
  • Training data for LLMsβ€’4 minutes
  • Data Cleaning for LLMsβ€’4 minutes
2 readingsβ€’Total 20 minutes
  • Welcome to the Courseβ€’10 minutes
  • Overview of Fundamentals of Large Language Modelsβ€’10 minutes
2 assignmentsβ€’Total 30 minutes
  • Fundamentals of Large Language Models - Assessmentβ€’15 minutes
  • LLM Foundations & Generative AI - Knowledge checkβ€’15 minutes
1 discussion promptβ€’Total 10 minutes
  • Meet and Greetβ€’10 minutes

Welcome to Week 2 of the NVIDIA: Large Language Models and Generative AI Deployment course. This week, we will cover the essentials of training and optimizing Large Language Models (LLMs). We will begin by exploring the various learning methods, including Few-shot, Zero-shot, Instruction Tuning, and Reinforcement Learning with Human Feedback (RLHF). Next, we will delve into loss functions used in LLMs and techniques for aligning models effectively. We will also cover evaluation metrics such as Perplexity and discuss the critical role of humans in evaluating LLMs. Additionally, we will examine the role of GPUs in training models and explore LLM fine-tuning techniques like Prompt Tuning and Parameter Efficient Fine-Tuning (PEFT). By the end of the week, you will have a solid understanding of how to train, optimize, and evaluate LLMs for real-world applications.

What's included

9 videos1 reading2 assignments

9 videosβ€’Total 52 minutes
  • LLM Training and Optimizationβ€’9 minutes
  • Techniques of Learning methods (Few-shot, Zero-shot, Instruction tuning, RLHF)β€’7 minutes
  • Loss Functions of LLMsβ€’6 minutes
  • LLM Alignment Techniquesβ€’5 minutes
  • Evaluation Metrics of LLMβ€’4 minutes
  • Perplexityβ€’4 minutes
  • Role of Humans in Evaluation of LLMsβ€’5 minutes
  • Role of GPUs in Model Trainingβ€’5 minutes
  • LLM Finetuning - Prompt Tuning & PEFTβ€’6 minutes
1 readingβ€’Total 10 minutes
  • Overview of Training, Optimization, and Evaluation of LLMsβ€’10 minutes
2 assignmentsβ€’Total 30 minutes
  • Training, Optimization, and Evaluation of LLMs - Assessmentβ€’15 minutes
  • LLM Training & Optimization - Knowledge checkβ€’15 minutes

Welcome to Week 3 of the NVIDIA: Large Language Models and Generative AI Deployment course. This week, we will cover essential strategies for deploying Large Language Models (LLMs) in real-world applications. We will start by exploring various deployment strategies and how to choose the right one for different scenarios. Next, we will introduce ONNX as a tool for unifying the deep learning landscape, and demonstrate how to convert deep learning models using ONNX. We will also focus on monitoring LLMs in production, covering best practices for ensuring their performance and reliability. Finally, we will dive into the NVIDIA ecosystem and how it supports LLM deployment, enhancing model efficiency and scalability. By the end of the week, you will have a clear understanding of LLM deployment and monitoring techniques.

What's included

5 videos3 readings2 assignments

5 videosβ€’Total 23 minutes
  • LLM Deployment Strategiesβ€’5 minutes
  • ONNX: Unifying the Deep Learning Landscapeβ€’4 minutes
  • Convert the Deep Learning Model with ONNX - Demoβ€’4 minutes
  • Monitoring the LLM Models in Productionβ€’6 minutes
  • NVIDIA Eco System in LLM Deploymentβ€’4 minutes
3 readingsβ€’Total 30 minutes
  • Overview of LLM Deployment Strategies and Monitoringβ€’10 minutes
  • Key Takeaways of the courseβ€’10 minutes
  • Course Conclusionβ€’10 minutes
2 assignmentsβ€’Total 30 minutes
  • LLM Deployment Strategies and Monitoring - Assessmentβ€’15 minutes
  • LLM Deployment and Optimization Strategies - Knowledge checkβ€’15 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

Whizlabs
166 Coursesβ€’125,579 learners

Explore more from Software Development

Why people choose Coursera for their career

πŸ‘ Image

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
πŸ‘ Image

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
πŸ‘ Image

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
πŸ‘ Image

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Financial aid available,