RAG Systems in Practice
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
RAG Systems in Practice
This course is part of LLM Engineering: Prompting, Fine-Tuning, Optimization & RAG Specialization
Instructor: Edureka
Included with
Learn more
Ask Coursera
Recommended experience
Recommended experience
What you'll learn
How to build and optimize Retrieval-Augmented Generation (RAG) systems using LangChain and FAISS.
Techniques for enhancing retrieval accuracy through hybrid search, re-ranking, and grounding methods.
How to deploy RAG systems into production environments and integrate them with APIs and platforms like Streamlit.
Best practices for monitoring, evaluating, and scaling RAG systems for optimal performance.
Skills you'll gain
Details to know
January 2026
14 assignments
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
There are 4 modules in this course
This course introduces the core concepts and techniques behind Retrieval-Augmented Generation (RAG) systems, guiding you through building, optimizing, and deploying powerful AI systems that combine language models with external knowledge sources. Whether you are new to RAG or looking to deepen your understanding, this course provides a hands-on approach to mastering RAG workflows and improving model accuracy.
Through detailed lessons, demonstrations, and real-world applications, youβll learn how to preprocess and index documents, generate embeddings, construct RAG pipelines, and deploy production-ready systems. Youβll also explore advanced optimization techniques to enhance retrieval quality, scalability, and context relevance. By the end of this course, you will be able to: β’ Understand the fundamentals of Retrieval-Augmented Generation and its applications in AI. β’ Apply text preprocessing and embedding techniques to improve document retrieval. β’ Build and optimize RAG pipelines using LangChain and FAISS. β’ Utilize hybrid retrieval, re-ranking, and grounding methods to enhance context accuracy. β’ Deploy and evaluate RAG systems in production environments for optimal performance. This course is ideal for AI enthusiasts, machine learning practitioners, and developers looking to specialize in building advanced AI systems that integrate external knowledge with language models. No prior experience with RAG systems is required, but a basic understanding of Python and machine learning concepts will be beneficial. Join us to begin your journey into the world of Retrieval-Augmented Generation and learn how to build efficient, scalable, and accurate AI systems!
In this module, learners will explore the fundamentals of Retrieval-Augmented Generation (RAG), including how it combines language models with external knowledge sources for improved accuracy. Key concepts such as text embeddings, vector stores, and document preprocessing will be introduced, with hands-on demonstrations to build simple RAG workflows and visualize context retrieval.
What's included
13 videos5 readings4 assignments1 discussion prompt
13 videosβ’Total 72 minutes
- Specialization Introductionβ’7 minutes
- Course Introductionβ’4 minutes
- Introduction to RAGβ’7 minutes
- Demonstration: Building Simple Rag Workflowβ’6 minutes
- Demonstration: Visualizing Context Retrieval Flow-Iβ’5 minutes
- Demonstration: Visualizing Context Retrieval Flow IIβ’5 minutes
- Importance of Embeddings in Retrieval System Designβ’5 minutes
- Understanding Text Embeddings and Similarity Searchβ’5 minutes
- Demonstration: Generating Embeddings Using OpenAI APIβ’6 minutes
- Demonstration: Building a FAISS Vector Storeβ’5 minutes
- Splitting and Cleaning Documents for Indexingβ’5 minutes
- Demonstration: Using LangChain Loaders for PDFs and Text Filesβ’6 minutes
- Demonstration: Chunking and Normalizing Text Dataβ’6 minutes
5 readingsβ’Total 85 minutes
- Welcome to RAG Systems in Practiceβ’10 minutes
- Overview of Retrieval-Augmented Generation Systemsβ’20 minutes
- Text Embeddings and Semantic Search Fundamentalsβ’20 minutes
- Document Preprocessing Techniques for RAG Systemsβ’20 minutes
- Module Summary: Introduction to Retrieval Systemsβ’15 minutes
4 assignmentsβ’Total 48 minutes
- Knowledge Check: Introduction to Retrieval Systemsβ’30 minutes
- Practice Knowledge Check: Understanding Retrieval-Augmented Generation (RAG)β’6 minutes
- Practice Knowledge Check: Embeddings and Vector Storesβ’6 minutes
- Practice Knowledge Check: Preprocessing for Effective Retrievalβ’6 minutes
1 discussion promptβ’Total 10 minutes
- Introduce Yourselfβ’10 minutes
Learners will focus on building and optimizing RAG pipelines using LangChain. They will explore techniques like hybrid retrieval, re-ranking, and grounding to improve context accuracy. The module includes practical applications for creating, testing, and evaluating high-performance RAG workflows.
What's included
16 videos5 readings5 assignments
16 videosβ’Total 96 minutes
- Retrieval Pipelines in RAG Systemsβ’6 minutes
- Connecting Vector Stores to LLMsβ’6 minutes
- Demonstration: Creating a Retriever Chain with LangChainβ’4 minutes
- Demonstration: Query Testing and Context Rankingβ’7 minutes
- Hybrid Retrieval and Re-Ranking in RAGβ’6 minutes
- Re-Ranking with Cross-Encoder and BM25β’7 minutes
- Demonstration: Combining Dense and Sparse Retrievalβ’6 minutes
- Demonstration: Evaluating Retrieval Precisionβ’6 minutes
- Hallucinations as a Retrieval Problemβ’6 minutes
- Context Window Managementβ’5 minutes
- Demonstration: Reducing Hallucinations via Grounded Contextβ’7 minutes
- Demonstration: Adding Citation References in RAG Outputβ’6 minutes
- Introduction to LangGraphβ’5 minutes
- Demonstration: Building a Stateful RAG Graph with LangGraphβ’7 minutes
- Demonstration: Decision-Driven RAG Orchestration with LangGraph - Iβ’6 minutes
- Demonstration: Decision-Driven RAG Orchestration with LangGraph - IIβ’6 minutes
5 readingsβ’Total 120 minutes
- Building Retrieval Pipelines with LangChain and FAISSβ’45 minutes
- Hybrid Search Techniques for Context Accuracyβ’20 minutes
- Improving Context Relevance and Grounding in RAGβ’20 minutes
- Designing Graph-Based LLM Workflows with LangGraphβ’20 minutes
- Module Summary : Building and Optimizing RAG Pipelinesβ’15 minutes
5 assignmentsβ’Total 54 minutes
- Knowledge Check: Building and Optimizing RAG Pipelinesβ’30 minutes
- Practice Knowledge Check: Retrieval Pipelines in LangChainβ’6 minutes
- Practice Knowledge Check: Hybrid and Re-Ranking Techniquesβ’6 minutes
- Practice Knowledge Check: Enhancing Context Qualityβ’6 minutes
- Practice Knowledge Check: Orchestrating RAG Workflows with LangGraphβ’6 minutes
This module covers the deployment and evaluation of RAG systems in real-world applications. Learners will explore deployment strategies, API integration, and performance monitoring. They will also learn how to optimize RAG systems for scalability and efficiency in production environments.
What's included
19 videos5 readings4 assignments
19 videosβ’Total 100 minutes
- RAG System Deployment in Productionβ’5 minutes
- Optimized End-to-End RAG Pipeline and System Designβ’6 minutes
- Demonstration: Deploying RAG App with Streamlit : RAG Core - Retrieval Setupβ’7 minutes
- Demonstration: Deploying RAG App with Streamlit : RAG Core - Question Answeringβ’3 minutes
- Demonstration: Deploying RAG App with Streamlit : Data and Authenticationβ’6 minutes
- Demonstration: Deploying RAG App with Streamlit : Ingestion and Prompt Designβ’5 minutes
- Demonstration: Deploying RAG App with Streamlit : Context Retrieval and Utilsβ’4 minutes
- Demonstration: Deploying RAG App with Streamlit : Deployment β’5 minutes
- Demonstration: Integrating API Endpoints for Retrieval-Iβ’7 minutes
- Demonstration: Integrating API Endpoints for Retrieval-IIβ’6 minutes
- Evaluating RAG System Performanceβ’5 minutes
- Benchmarking RAG Performanceβ’3 minutes
- Demonstration: Using LangSmith or LlamaIndex for Local Evaluationβ’6 minutes
- Demonstration: Analyzing Cost and Latency Metricsβ’6 minutes
- Accuracy vs Scalability in RAGβ’4 minutes
- Enhancing Query Understanding and Rankingβ’4 minutes
- Demonstration: Implementing Hybrid Retrieval at Scaleβ’6 minutes
- Demonstration: Optimizing Latency and Throughput for RAG Systemsβ’6 minutes
- Production-Ready RAG System Architectureβ’5 minutes
5 readingsβ’Total 105 minutes
- Deploying Retrieval-Augmented Generation Applicationsβ’20 minutes
- Evaluating RAG Pipelines: Metrics and Observability Toolsβ’20 minutes
- Scaling RAG Systems for High-Performance Applicationsβ’20 minutes
- Module Summary : Deploying and Evaluating RAG Systemsβ’15 minutes
- A Practical Guide to Building Scalable LLM Applicationsβ’30 minutes
4 assignmentsβ’Total 48 minutes
- Knowledge Check: Deploying and Evaluating RAG Systemsβ’30 minutes
- Practice Knowledge Check: RAG Deployment Fundamentalsβ’6 minutes
- Practice Knowledge Check: Monitoring and Evaluationβ’6 minutes
- Practice Knowledge Check: Retrieval Accuracy and Scalabilityβ’6 minutes
In the final module, learners will apply their knowledge by completing a practice project and final assessment. They will review key concepts and build a production-ready RAG system, preparing them to implement RAG in real-world projects.
What's included
1 video1 reading1 assignment1 discussion prompt
1 videoβ’Total 2 minutes
- Course Summary: RAG Systems in Practiceβ’2 minutes
1 readingβ’Total 45 minutes
- Practice Project: Building and Deploying a Scalable RAG Systemβ’45 minutes
1 assignmentβ’Total 30 minutes
- End Course Knowledge Check: RAG Systems in Practiceβ’30 minutes
1 discussion promptβ’Total 10 minutes
- Describe your Learning Journeyβ’10 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor
Offered by
Explore more from Software Development
- Status: Free Trial
Course
- Status: Free Trial
Course
- Status: Free Trial
- D
Duke University
Guided Project
Why people choose Coursera for their career
Frequently asked questions
This course teaches how to build, optimize, and deploy Retrieval-Augmented Generation (RAG) systems, integrating language models with external knowledge sources for more accurate AI responses.
This course is for AI enthusiasts, machine learning practitioners, and developers interested in learning how to build advanced retrieval-based AI systems.
A basic understanding of Python and machine learning concepts is recommended for this course, though no prior RAG experience is required.
More questions
Financial aid available,
