RAG-Driven Generative AI
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
RAG-Driven Generative AI
Included with
Learn more
Ask Coursera
Recommended experience
Recommended experience
What you'll learn
Scale RAG pipelines to handle large datasets efficiently
Implement techniques that reduce hallucinations and improve response accuracy
Customize and scale RAG-driven AI systems across different domains
Skills you'll gain
Tools you'll learn
Details to know
March 2026
10 assignments
See how employees at top companies are mastering in-demand skills
There are 10 modules in this course
This course introduces the powerful concept of Retrieval-Augmented Generation (RAG), a technique used to optimize the performance, accuracy, and cost of generative AI systems. Focused on building AI pipelines with LlamaIndex, Deep Lake, and Pinecone, this course will equip you with the skills to create robust AI models capable of handling complex datasets and delivering traceable, context-aware outputs.
You will explore how to scale RAG pipelines, implement strategies to minimize hallucinations, and improve response accuracy across multimodal AI systems. By the end of the course, you will have hands-on experience optimizing these systems for real-world applications, empowering you to enhance decision-making and operational efficiency. What sets this course apart is its unique combination of theory and practical implementation. By working with cutting-edge tools like LlamaIndex and Pinecone, you'll understand how to balance cost, performance, and accuracy, while gaining insight into the broader context of AI pipelines and decision-making. This course is ideal for data scientists, AI engineers, and MLOps professionals who are looking to expand their expertise in RAG and generative AI. A basic understanding of machine learning concepts is recommended, as the course builds on these foundations to explore more advanced techniques.
In this section, we explore Retrieval Augmented Generation (RAG) frameworks, focusing on naive, advanced, and modular configurations. We implement Python-based RAG systems for improved AI accuracy and adaptability.
What's included
2 videos6 readings1 assignment
2 videosβ’Total 2 minutes
- Course Overviewβ’1 minute
- Why Retrieval Augmented Generation? - Overview Videoβ’1 minute
6 readingsβ’Total 65 minutes
- Introductionβ’15 minutes
- Human Feedback (E2)β’10 minutes
- Advanced techniques and evaluationβ’10 minutes
- Metricsβ’10 minutes
- Generationβ’10 minutes
- Generationβ’10 minutes
1 assignmentβ’Total 10 minutes
- Exploring Retrieval Augmented Generationβ’10 minutes
In this section, we cover building and managing RAG pipelines with Deep Lake and OpenAI for efficient AI data handling.
What's included
1 video4 readings1 assignment
1 videoβ’Total 1 minute
- RAG Embedding Vector Stores with Deep Lake and OpenAI - Overview Videoβ’1 minute
4 readingsβ’Total 70 minutes
- Introductionβ’20 minutes
- Authentication Processβ’20 minutes
- Retrieving a Batch of Prepared Documentsβ’10 minutes
- 3. Augmented input generationβ’20 minutes
1 assignmentβ’Total 10 minutes
- RAG Embedding Vector Stores with Deep Lake and OpenAIβ’10 minutes
In this section, we explore index-based RAG pipelines using LlamaIndex, Deep Lake, and OpenAI to enhance traceability, precision, and control in AI-driven data retrieval and generation.
What's included
1 video5 readings1 assignment
1 videoβ’Total 1 minute
- Building Index-Based RAG with LlamaIndex, Deep Lake, and OpenAI - Overview Videoβ’1 minute
5 readingsβ’Total 90 minutes
- Introductionβ’10 minutes
- Building a Semantic Search Engine and Generative Agent for Drone Technologyβ’20 minutes
- Creating and Populating a Deep Lake Vector Storeβ’10 minutes
- Vector Store Index Query Engineβ’20 minutes
- Tree Index Query Engineβ’30 minutes
1 assignmentβ’Total 10 minutes
- Indexing and Retrieval in AI Systemsβ’10 minutes
In this section, we explore multimodal modular RAG for drone technology, integrating text and image data retrieval, generation, and performance evaluation using LLMs and MMLLMs.
What's included
1 video5 readings1 assignment
1 videoβ’Total 1 minute
- Multimodal Modular RAG for Drone Technology - Overview Videoβ’1 minute
5 readingsβ’Total 70 minutes
- Introductionβ’10 minutes
- Building a Multimodal Modular RAG Program for Drone Technologyβ’10 minutes
- Loading and Visualizing the Multimodal Datasetβ’20 minutes
- Building a Multimodal Query Engineβ’10 minutes
- Multimodal Modular Summaryβ’20 minutes
1 assignmentβ’Total 10 minutes
- Multimodal RAG in Drone Applicationsβ’10 minutes
In this section, we explore adaptive RAG with human feedback loops, focusing on improving retrieval quality and integrating expert input.
What's included
1 video4 readings1 assignment
1 videoβ’Total 1 minute
- Boosting RAG Performance with Expert Human Feedback - Overview Videoβ’1 minute
4 readingsβ’Total 65 minutes
- Introductionβ’10 minutes
- Building Hybrid Adaptive RAG in Pythonβ’20 minutes
- No RAGβ’5 minutes
- RAG with No Human-Expert Feedback Documentsβ’30 minutes
1 assignmentβ’Total 10 minutes
- Enhancing RAG Systems with Human and Data-Driven Insightsβ’10 minutes
In this section, we explore scalable RAG techniques for bank customer data using Pinecone and OpenAI. Key concepts include EDA, vector scaling, and AI-driven recommendations to reduce churn.
What's included
1 video7 readings1 assignment
1 videoβ’Total 1 minute
- Scaling RAG Bank Customer Data with Pinecone - Overview Videoβ’1 minute
7 readingsβ’Total 120 minutes
- Introductionβ’10 minutes
- Collecting the Datasetβ’10 minutes
- Data Preparation and Clusteringβ’10 minutes
- Scaling a Pinecone Index (Vector Store)β’10 minutes
- Chunkingβ’30 minutes
- Upsertingβ’20 minutes
- RAG Generative AIβ’30 minutes
1 assignmentβ’Total 10 minutes
- Scaling RAG Bank Customer Data with Pineconeβ’10 minutes
In this section, we explore building scalable RAG systems using knowledge graphs, implementing the Wikipedia API, populating a Deep Lake vector store, and constructing a LlamaIndex knowledge graph for semantic search.
What's included
1 video5 readings1 assignment
1 videoβ’Total 1 minute
- Building Scalable Knowledge-Graph-Based RAG with Wikipedia API and LlamaIndex - Overview Videoβ’1 minute
5 readingsβ’Total 80 minutes
- Introductionβ’10 minutes
- Building Graphs from Treesβ’20 minutes
- Preparing the Data for Upsertionβ’30 minutes
- Re-rankingβ’10 minutes
- Metric Calculation and Displayβ’10 minutes
1 assignmentβ’Total 10 minutes
- Exploring Knowledge-Graph-Based RAG Systemsβ’10 minutes
In this section, we explore dynamic RAG using Chroma and Llama, focusing on embedding and querying temporary data for real-time decision-making with open-source tools.
What's included
1 video5 readings1 assignment
1 videoβ’Total 1 minute
- Dynamic RAG with Chroma and Hugging Face Llama - Overview Videoβ’1 minute
5 readingsβ’Total 100 minutes
- Introductionβ’10 minutes
- Installing the Environmentβ’10 minutes
- Activating Session Timeβ’30 minutes
- Querying the Collectionβ’20 minutes
- Prompt and Retrievalβ’30 minutes
1 assignmentβ’Total 10 minutes
- Dynamic RAG and AI System Integrationβ’10 minutes
In this section, we explore RAG data reduction through fine-tuning, focusing on preparing JSONL datasets and evaluating model performance with OpenAI metrics for improved accuracy and cost-effectiveness.
What's included
1 video3 readings1 assignment
1 videoβ’Total 1 minute
- Empowering AI Models Fine-Tuning RAG Data and Human Feedback - Overview Videoβ’1 minute
3 readingsβ’Total 60 minutes
- Introductionβ’40 minutes
- Using the fine-tuned OpenAI modelβ’10 minutes
- Metricsβ’10 minutes
1 assignmentβ’Total 10 minutes
- Enhancing AI Performance Through Data and Feedbackβ’10 minutes
In this section, we explore RAG pipeline implementation for video generation, embedding video comments in Pinecone, and enhancing labels with GPT-4o analysis for efficient video stock production.
What's included
1 video7 readings1 assignment
1 videoβ’Total 1 minute
- RAG for Video Stock Production with Pinecone and OpenAI - Overview Videoβ’1 minute
7 readingsβ’Total 130 minutes
- Introductionβ’10 minutes
- The Environment of the Video Production Ecosystemβ’10 minutes
- Pipeline Generator and Commentatorβ’10 minutes
- Video Download and Display Functionsβ’30 minutes
- The Generator and the Commentatorβ’10 minutes
- Pipeline Controllerβ’30 minutes
- The Video Expertβ’30 minutes
1 assignmentβ’Total 10 minutes
- RAG and AI in Video Production Fundamentalsβ’10 minutes
Instructor
Offered by
Explore more from Software Development
- Status: Free Trial
Specialization
- Status: Free Trial
- Status: Free Trial
Course
- Status: Free TrialE
Edureka
Course
Why people choose Coursera for their career
Frequently asked questions
Yes, you can preview the first video and view the syllabus before you enroll. You must purchase the course to access content not included in the preview.
If you decide to enroll in the course before the session start date, you will have access to all of the lecture videos and readings for the course. Youβll be able to submit assignments once the session starts.
Once you enroll and your session begins, you will have access to all videos and other resources, including reading items and the course discussion forum. Youβll be able to view and submit practice assessments, and complete required graded assignments to earn a grade and a Course Certificate.
More questions
Financial aid available,
