Building RAG Systems with Open Models
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Building RAG Systems with Open Models
This course is part of Open Generative AI: Build with Open Models and Tools Professional Certificate
Included with
Ask Coursera
Recommended experience
Recommended experience
Skills you'll gain
Tools you'll learn
Details to know
February 2026
4 assignments
See how employees at top companies are mastering in-demand skills
Build your Machine Learning expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate from Coursera
There are 4 modules in this course
The Building RAG Systems with Open Models course is designed for developers, engineers, and technical product builders who are new to Generative AI but already have intermediate machine learning knowledge, basic Python proficiency, and familiarity with development environments such as VS Code, and who want to engineer, customize, and deploy open generative AI solutions while avoiding vendor lock-in.
The course provides learners with the skills to design and implement retrieval-augmented generation (RAG) applications for real-world use cases. Learners start by exploring the fundamentals of RAG architecture, breaking down key components such as retrievers, rankers, generators, and orchestration layers, while learning design patterns for tasks like question answering, summarization, and knowledge synthesis. They then dive into embeddings and vector databases, comparing FAISS, ChromaDB, Milvus, and Pinecone, and applying indexing and chunking strategies to improve retrieval efficiency and semantic relevance. The final module brings all elements together to build production-ready RAG pipelines using LangChain and open LLMs, incorporating advanced retrieval methods, hallucination mitigation, and evaluation frameworks for accuracy and reliability. By the end, learners will have built a functional RAG application with configurable components, optimized for performance and equipped with robust evaluation metrics.
Learn the fundamentals of Retrieval-Augmented Generation (RAG) and why it’s critical for reducing hallucinations and improving accuracy. You’ll break down RAG’s core components, retrievers, rankers, generators, and orchestration layers, and apply design patterns for use cases like Q&A, summarization, and knowledge synthesis. You’ll also explore advanced variations such as hierarchical retrieval and hybrid search, giving you practical strategies to match RAG designs to real-world needs.
What's included
1 video1 reading1 assignment2 ungraded labs
1 video•Total 11 minutes
- Inside RAG: Components That Make It Work•11 minutes
1 reading•Total 10 minutes
- Code Demonstration Transcripts•10 minutes
1 assignment•Total 30 minutes
- Matching RAG Architectures to Real Use Cases•30 minutes
2 ungraded labs•Total 120 minutes
- Explore a Working RAG Demo•60 minutes
- Make RAG Work for You•60 minutes
Evaluate embedding models and vector databases to understand how they impact retrieval quality and system performance. You’ll compare embedding options by dimensionality and domain fit, and explore database choices such as Facebook AI Similarity Search (FAISS), ChromaDB, Milvus, and Pinecone. You’ll also analyze indexing strategies, chunking methods, and update workflows—skills that help you make informed decisions when building retrieval systems for different environments.
What's included
2 videos1 reading1 assignment1 ungraded lab
2 videos•Total 14 minutes
- Podcast: Why Choosing the Right Embeddings Makes or Breaks Your System•4 minutes
- How Database & Embedding Choices Affect RAG•9 minutes
1 reading•Total 15 minutes
- The Building Blocks: Embeddings and Databases Explained•15 minutes
1 assignment•Total 30 minutes
- Which Setup Would You Choose?•30 minutes
1 ungraded lab•Total 60 minutes
- Compare Embeddings and Databases in Action•60 minutes
You’ll put theory into practice by integrating embeddings and vector databases into working RAG pipelines. You’ll test indexing strategies, experiment with chunking, and observe how different configurations affect retrieval accuracy and efficiency. You’ll also practice maintaining and updating vector indices, building the skills to manage RAG systems that remain reliable as datasets grow and change.
What's included
1 video1 reading1 assignment2 ungraded labs
1 video•Total 3 minutes
- Podcast: From Theory to Practice: Making RAG Actually Work•3 minutes
1 reading•Total 15 minutes
- Maintaining Vector Indices in the Real World•15 minutes
1 assignment•Total 30 minutes
- Applying What You Built•30 minutes
2 ungraded labs•Total 120 minutes
- Build and Query Your First Vector Database•60 minutes
- Tuning Your Retrieval Setup•60 minutes
Assemble full RAG pipelines using frameworks like LangChain and open Large Language Models (LLMs). You’ll implement advanced retrieval strategies such as hybrid search, re-ranking, and query expansion, and evaluate pipelines with metrics that track accuracy, latency, and reliability. You’ll also practice handling real-world challenges, such as hallucination mitigation and citation tracking, ensuring your systems are not just demos, but production-ready solutions.
What's included
4 videos1 reading1 assignment2 ungraded labs
4 videos•Total 26 minutes
- Building Your First RAG Workflow with LangChain•9 minutes
- Optimizing & Modularizing RAG with LangChain•6 minutes
- Evaluating and Optimizing Your RAG System•9 minutes
- Podcast: Bringing RAG Systems Together: From Concept to Production •3 minutes
1 reading•Total 8 minutes
- Advanced Retrieval Tactics That Improve Accuracy•8 minutes
1 assignment•Total 60 minutes
- End-to-End RAG Systems in Practice•60 minutes
2 ungraded labs•Total 120 minutes
- Assemble a RAG Pipeline•60 minutes
- Experiment with Retrieval Strategies•60 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor
Explore more from Machine Learning
- Status: Free TrialE
Edureka
Course
- P
Packt
Course
- Status: Free Trial
Course
- Status: Free Trial
Course
Why people choose Coursera for their career
Frequently asked questions
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
More questions
Financial aid available,
