Project: Generative AI Applications with RAG and LangChain
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Project: Generative AI Applications with RAG and LangChain
This course is part of multiple programs.
Instructors: Kang Wang
28,885 already enrolled
Included with
Ask Coursera
249 reviews
Recommended experience
249 reviews
Recommended experience
What you'll learn
Gain practical experience building your own real-world generative AI application to showcase in interviews
Create and configure a vector database to store document embeddings and develop a retriever to fetch relevant segments based on user queries
Set up a simple Gradio interface for user interaction and build a question-answering bot using LangChain and a large language model (LLM)
Skills you'll gain
Tools you'll learn
Details to know
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
There are 3 modules in this course
Get ready to put your generative AI engineering skills into practice! In this hands-on guided project, you’ll apply the knowledge and techniques gained throughout the previous courses in the program to build your own real-world generative AI application.
You’ll begin by filling in key knowledge gaps, such as using LangChain’s document loaders to ingest documents from various sources. You’ll then explore and apply text-splitting strategies to improve model responsiveness and use IBM watsonx to embed documents. These embeddings will be stored in a vector database, which you’ll connect to LangChain to develop an effective document retriever. As your project progresses, you’ll implement retrieval-augmented generation (RAG) to enhance retrieval accuracy, construct a question-answering bot, and build a simple Gradio interface for interactive model responses. By the end of the course, you’ll have a complete, portfolio-ready AI application that showcases your skills and serves as compelling evidence of your ability to engineer real-world generative AI solutions. If you're ready to elevate your career with hands-on experience, enroll today and take the next step toward becoming a confident AI engineer.
In this module, you will explore essential techniques for loading, preparing, and structuring documents to build effective retrieval-augmented generation (RAG) applications using LangChain. You will learn how to use LangChain’s document loaders to import content from various sources, apply best practices for document ingestion, and implement text-splitting strategies to enhance model responsiveness. You will also examine when and how to incorporate entire documents into prompts for optimal output. Through hands-on labs, you’ll gain practical experience by loading documents and applying text-splitting techniques in real-world scenarios.
What's included
3 videos4 readings2 assignments3 app items1 plugin
3 videos•Total 17 minutes
- Course Introduction •4 minutes
- Load Your Document from Different Sources •7 minutes
- Strategies for Splitting Text for Optimal Processing •6 minutes
4 readings•Total 14 minutes
- Course Overview •5 minutes
- Specialization Overview •5 minutes
- Best Practices for Loading Documents in LangChain Applications •3 minutes
- Reading: Summary and Highlights •1 minute
2 assignments•Total 18 minutes
- Practice Quiz: Different Document Loaders from LangChain•9 minutes
- Practice Quiz: Text Splitter•9 minutes
3 app items•Total 110 minutes
- Lab: Load Documents Using LangChain for Different Sources•60 minutes
- Lab: Put Whole Document into Prompt and Ask the Model•20 minutes
- Lab: Apply Text Splitting Techniques to Enhance Model Responsiveness•30 minutes
1 plugin•Total 1 minute
- Helpful Tips for Course Completion•1 minute
In this module, you will learn how to embed documents using watsonx’s embedding model and store these embeddings using vector databases, such as Chroma DB and FAISS. You will explore the role of embeddings in RAG pipelines, configure vector stores to manage these embeddings, and use LangChain to preprocess documents for embedding. Additionally, you will gain hands-on experience with advanced retrievers in LangChain, such as Vector Store-Based, Multi-Query, Self-Query, and Parent Document retrievers, to extract relevant information from documents efficiently. Finally, you’ll compare RAG-based approaches with fine-tuning using InstructLab to evaluate their trade-offs and applicability.
What's included
3 videos1 reading2 assignments3 app items2 plugins
3 videos•Total 12 minutes
- Introduction to Vector Databases for Storing Embeddings •5 minutes
- Explore Advanced Retrievers in Langchain: Part 1•3 minutes
- Explore Advanced Retrievers in Langchain - Part 2•5 minutes
1 reading•Total 2 minutes
- Module Summary: RAG Using LangChain •2 minutes
2 assignments•Total 18 minutes
- Practice Quiz: Embedding the Document•9 minutes
- Practice Quiz: Retriever•9 minutes
3 app items•Total 100 minutes
- Lab: Embed Documents using watsonx’s Embedding Model•30 minutes
- Lab: Create and Configure a Vector Database to Store Document Embeddings•30 minutes
- Lab: Develop a Retriever to Fetch Document Segments Based on Queries•40 minutes
2 plugins•Total 25 minutes
- Embed Documents Using watsonx’s Embedding Model•10 minutes
- Reading: Compare Fine-Tuning Using InstructLab with RAG•15 minutes
In this module, you will combine all the components you’ve learned to build a complete generative AI application using LangChain and RAG. You’ll learn how to implement RAG to improve information retrieval, set up user interfaces using Gradio, and construct a question-answering bot that leverages LLMs and LangChain to respond to queries from loaded documents. Through hands-on labs, you’ll practice building a Gradio interface and developing your own QA bot. In the final project, you will build an AI application using RAG and LangChain. The supporting materials, like a cheat sheet and glossary, will reinforce your understanding, build confidence in your implementation skills, and assess your learning through a graded quiz. You'll leave this module with a deployable AI-powered assistant and clear the next steps for advancing your skills.
What's included
1 video4 readings3 assignments1 peer review3 app items5 plugins
1 video•Total 4 minutes
- Getting Started with Gradio •4 minutes
4 readings•Total 8 minutes
- Module Summary: Create a QA Bot to Read Your Document •1 minute
- Course Conclusion •3 minutes
- Congratulations and Next Steps•2 minutes
- Thanks from the Course Team•2 minutes
3 assignments•Total 48 minutes
- Practice Quiz: Introduction to Gradio•9 minutes
- Practice Quiz: Build a QA Bot Web App•9 minutes
- Course Graded Quiz: Project: Generative AI Applications with RAG and LangChain•30 minutes
1 peer review•Total 30 minutes
- Option 2: Peer Graded - Final Project Submission and Evaluation•30 minutes
3 app items•Total 90 minutes
- Option 1: AI Graded - Final Project: Submission and Evaluation•30 minutes
- Lab: Set Up a Simple Gradio Interface to Interact with Your Models•30 minutes
- Lab: Construct a QA Bot that Leverages the LangChain and LLM to Answer Questions from Loaded Document•30 minutes
5 plugins•Total 49 minutes
- Reading: Project Overview•2 minutes
- Reading: Construct a QA Bot that Leverages the LangChain and LLM to answer questions from loaded document•15 minutes
- Reading: Final Submission Guidelines and Deliverables•2 minutes
- Cheat Sheet: Project: Generative AI Applications with RAG and LangChain•15 minutes
- Course Glossary: Project: Generative AI Applications with RAG and LangChain•15 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructors
Why people choose Coursera for their career
Learner reviews
- 5 stars
86.74%
- 4 stars
8.43%
- 3 stars
2%
- 2 stars
0.40%
- 1 star
2.40%
Showing 3 of 249
Reviewed on Aug 18, 2025
Simply great! Learnt a lot and also enjoyed the labs!
Reviewed on Mar 18, 2026
It covers the subjectt and includes fantastic lab notebooks.
Reviewed on May 21, 2026
It was really tough, but these 16 courses were well worth the money—because the material is truly awesome, informative, and in-depth, with a significant practical component.
Frequently asked questions
This course is suitable for those interested in AI engineering and includes training, developing, fine-tuning, and deploying large language models (LLMs). It is the ideal project course for learners who have completed the other courses in the Specialization title: Generative AI Engineering with LLMs.
Existing and aspiring data scientists, AI engineers, and machine learning engineers will benefit greatly from completing this project.
With 3–4 hours of study per week, you can complete this course and the guided project in 3 weeks. If you are able to put in more time per week, you can complete it a lot faster!
This course is intermediate level, so you must have basic knowledge of Python. Familiarity with LLMs, LangChain, and RAG would be an added advantage.However, to get the most out of this course, we recommend that you complete all the other courses in the IBM Generative AI Engineering with LLMs specialization.
More questions
Financial aid available,
¹ Some assignments in this course are AI-graded. For these assignments, your data will be used in accordance with Coursera's Privacy Notice.
