Working with large language models using Azure
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Working with large language models using Azure
This course is part of multiple programs.
Instructor: Microsoft
Included with
Ask Coursera
Recommended experience
Recommended experience
What you'll learn
Apply prompt engineering techniques to improve Large Language Model responses
Build Retrieval-Augmented Generation (RAG) pipelines using Azure services
Fine-tune and customize LLMs for domain-specific AI applications
Develop and deploy generative AI applications using Azure AI Foundry
Skills you'll gain
Tools you'll learn
Details to know
January 2026
23 assignments
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
There are 4 modules in this course
Learn how to build, customize, and deploy generative AI applications using Large Language Models (LLMs) and Microsoft Azure. This hands-on course introduces the practical techniques developers use to improve AI application performance, reliability, and business relevance.
You’ll begin by exploring how LLMs work, including their architecture, capabilities, and limitations. From there, you’ll apply prompt engineering strategies to improve model outputs and build more effective AI interactions. The course then introduces Retrieval-Augmented Generation (RAG) pipelines, teaching you how to connect LLMs with external data sources to deliver grounded, accurate responses. You’ll also learn how to customize models using fine-tuning techniques and evaluate when to use fine-tuning, RAG, or hybrid approaches for different business scenarios. In the final modules, you’ll build and deploy generative AI applications using Azure AI Foundry and Azure OpenAI services while learning deployment, monitoring, and cost management strategies. By the end of this course, you’ll have practical experience building AI-powered applications using modern Azure AI tools and workflows.
This foundational module introduces the core concepts behind Large Language Models (LLMs). You will start by exploring the fundamental architecture that powers models like GPT (Generative Pre-trained Transformer) and learn how they process information and generate human-like text. The second half of the module is dedicated to prompt engineering, where you will learn and apply essential techniques—from basic commands to advanced strategies like few-shot learning and chain-of-thought—to effectively communicate with and control AI models to achieve desired outcomes. Important Notice on the Azure Interface: The screencast videos and screenshots were last updated in late 2025. Please be aware that Microsoft may have updated the Azure interface since then. If the steps shown in the course materials look different from your current Azure environment, please follow the most up-to-date interface, as the underlying concepts and learning objectives remain the same.
What's included
7 videos8 readings5 assignments
7 videos•Total 30 minutes
- Building Solutions with Large Language Models on Azure•3 minutes
- Introduction to LLMs and prompt engineering•3 minutes
- The impact of LLMs•6 minutes
- A look inside an LLM: From prompt to response•5 minutes
- Why Prompt Engineering Matters•4 minutes
- Crafting effective prompts•6 minutes
- Applying LLM Fundamentals and Prompt Engineering in Practice•2 minutes
8 readings•Total 95 minutes
- Learning Paths and Prerequisites for Working with LLMs•5 minutes
- Overview of LLM interaction•10 minutes
- Exploring LLM architecture•15 minutes
- LLM fundamentals: From tokens to sequential models•15 minutes
- The blueprint of modern LLMs: The transformer architecture•15 minutes
- Insights from LLM interactions•10 minutes
- Techniques in prompt engineering•15 minutes
- Prompt engineering success strategies•10 minutes
5 assignments•Total 180 minutes
- Interacting with LLMs: Basics•30 minutes
- LLM architecture: Practice Quiz•30 minutes
- Creating successful prompts•60 minutes
- Prompt engineering skills: Practice Quiz•30 minutes
- Graded Quiz: LLM Fundamentals and Prompt Engineering•30 minutes
This module focuses on one of the most powerful techniques for enhancing LLMs: Retrieval-Augmented Generation (RAG). You will learn how to ground models in external, private, or real-time data sources to provide more accurate and contextually relevant responses. You will start by building a basic RAG pipeline using Azure services and then progress to constructing and optimizing advanced systems with techniques like semantic ranking and sophisticated data chunking strategies. Important Notice on the Azure Interface: The screencast videos and screenshots were last updated in late 2025. Please be aware that Microsoft may have updated the Azure interface since then. If the steps shown in the course materials look different from your current Azure environment, please follow the most up-to-date interface, as the underlying concepts and learning objectives remain the same.
What's included
5 videos6 readings6 assignments
5 videos•Total 24 minutes
- Introduction to RAG: Grounding AI with data•5 minutes
- RAG pipelines explained•6 minutes
- Data sources for RAG: Azure AI Search and the Marketplace•6 minutes
- Advanced RAG configurations•5 minutes
- End-to-End RAG Pipelines: From Setup to Optimization•3 minutes
6 readings•Total 70 minutes
- Understanding RAG frameworks•15 minutes
- Introduction to RAG techniques•10 minutes
- Reviewing your first RAG pipeline•10 minutes
- Advanced RAG pipeline techniques•15 minutes
- Effective RAG optimization strategies•10 minutes
- Case study: Implementing advanced RAG in a corporate setting•10 minutes
6 assignments•Total 215 minutes
- Exploring RAG pipelines•30 minutes
- Basic RAG pipeline setup•35 minutes
- RAG fundamentals: Practice Quiz•30 minutes
- Optimizing RAG implementations•60 minutes
- Advanced RAG skills evaluation: Practice Quiz•30 minutes
- RAG Pipeline Design, Optimization, and Evaluation Assessment•30 minutes
This module explores fine-tuning as a powerful method for customizing an LLM's core behavior, style, or knowledge for specialized tasks. You will learn the entire fine-tuning workflow, from preparing a high-quality dataset to launching the training job and evaluating the customized model's performance in Azure. Critically, you will learn to strategically decide when to use fine-tuning versus RAG—or a hybrid of both—to create highly effective, domain-specific AI solutions. Important Notice on the Azure Interface: The screencast videos and screenshots were last updated in late 2025. Please be aware that Microsoft may have updated the Azure interface since then. If the steps shown in the course materials look different from your current Azure environment, please follow the most up-to-date interface, as the underlying concepts and learning objectives remain the same.
What's included
4 videos7 readings6 assignments
4 videos•Total 19 minutes
- The art of fine-tuning•6 minutes
- A guided tour of Azure's fine-tuning interface•5 minutes
- Integrating domain expertise into your application•5 minutes
- Mastering Customization: From Fine-Tuning to Strategic Decision-Making•2 minutes
7 readings•Total 70 minutes
- Fine-tuning techniques•10 minutes
- Learnings from fine-tuning LLMs•10 minutes
- Evaluating your custom fine-tuned model•10 minutes
- Strategies for domain integration•10 minutes
- A framework for evaluating custom models•10 minutes
- Analyzing domain specific LLMs•10 minutes
- RAG vs. fine-tuning: A strategic decision framework•10 minutes
6 assignments•Total 215 minutes
- Fine-tuning practice•30 minutes
- Customized LLM implementation•35 minutes
- Fine-tuning comprehension: Practice Quiz•30 minutes
- From customization to application: A domain-specific LLM lab•60 minutes
- Real world use assessment: Practice Quiz•30 minutes
- Customization Strategies and Model Evaluation Assessment•30 minutes
This module transitions from theory to practice by guiding you through the end-to-end process of building and deploying a complete generative AI application. You will learn to design an application's architecture and user flow before using Azure AI Foundry and Prompt flow tools to build it. The module then covers the critical MLOps lifecycle, teaching you how to deploy your application as a secure endpoint, manage it in a production environment, and implement monitoring with Azure Monitor for performance and cost. Important Notice on the Azure Interface: The screencast videos and screenshots were last updated in late 2025.Please be aware that Microsoft may have updated the Azure interface since then. If the steps shown in the course materials look different from your current Azure environment, please follow the most up-to-date interface, as the underlying concepts and learning objectives remain the same.
What's included
6 videos6 readings6 assignments
6 videos•Total 28 minutes
- Introduction to application development: From model to product•3 minutes
- Harnessing Generative AI: From models to products•5 minutes
- Visualizing an application with prompt flow•7 minutes
- Deploying on Azure AI Foundry•6 minutes
- Module 4 summary: Your journey as an AI application developer•2 minutes
- From Models to Production: Mastering Generative AI Applications•4 minutes
6 readings•Total 65 minutes
- Foundations for generative applications•10 minutes
- Building successful generative AI apps•10 minutes
- Key concepts in prompt flow development•10 minutes
- Deployment and management techniques•15 minutes
- Effective management of AI applications•10 minutes
- The MLOps lifecycle for generative AI•10 minutes
6 assignments•Total 220 minutes
- Application design basics•60 minutes
- Application development with Azure•40 minutes
- Evaluating generative application architectures: Practice Quiz•30 minutes
- Application deployment and monitoring•30 minutes
- Deployment and management skills: Practice Quiz•30 minutes
- Generative AI Application Development and Deployment Assessment•30 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor
Why people choose Coursera for their career
Frequently asked questions
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
More questions
Financial aid available,
