Building Generative AI-Powered Applications with Python
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Building Generative AI-Powered Applications with Python
This course is part of multiple programs.
40,529 already enrolled
Included with
Ask Coursera
283 reviews
Recommended experience
283 reviews
Recommended experience
What you'll learn
Explain the core concepts of generative AI, including large language models, speech technologies, and platforms such as IBM watsonX, and Hugging Face
Build generative AI-powered applications and chatbots using LLMs, retrieval-augmented generation(RAG), and foundational Python frameworks
Integrate speech-to-text (STT) and text-to-speech (TTS) technologies to enable voice interfaces in generative AI applications
Develop web-based AI applications using Python libraries, such as Flask and Gradio, along with basic front-end tools like HTML, CSS, and JavaScript
Skills you'll gain
Details to know
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate from IBM
There are 7 modules in this course
Ready for an interactive learning experience to build real-world generative AI applications and chatbots?
In this hands-on course, you’ll develop a series of guided projects using Python, Flask, Gradio, and LangChain to create AI-powered applications for practical scenarios, including a voice assistant, a meeting summarizer, a language translator, and a personalized career coach. You’ll work with popular large language models (LLMs) such as GPT-3, Llama 2, and Flan-UL2, hosted on platforms like IBM watsonx and Hugging Face. You’ll also explore advanced concepts, such as retrieval-augmented generation (RAG), to enhance LLM responses with external knowledge, and integrate speech-to-text (STT) and text-to-speech (TTS) using IBM Watson® Speech Libraries and OpenAI Whisper to enable voice interactions. While a basic understanding of Python is essential, knowledge of HTML, CSS, or JavaScript is helpful but not required. The course includes supporting readings and videos to build foundational knowledge of the models and frameworks used. In addition, a comprehensive course glossary will help reinforce your learning.
In this module, you will explore the fundamentals of generative AI and foundation models, understanding how they drive modern AI applications. You will gain hands-on experience with image captioning using the bootstrapping language image pretraining (BLIP) model and build interactive UIs with Gradio. The module guides you through using Hugging Face for accessing pretrained models and datasets. You’ll also learn to deploy your AI app using IBM Code Engine for scalable access.
What's included
5 videos3 readings1 assignment2 app items3 plugins
5 videos•Total 26 minutes
- Course Introduction•5 minutes
- Generative AI Models •7 minutes
- Foundation Models•6 minutes
- Project Overview: Image Captioning with Generative AI•3 minutes
- Hugging Face•5 minutes
3 readings•Total 7 minutes
- IBM Product Spotlight: watsonx Orchestrate•2 minutes
- Course Syllabus and Prerequisites•3 minutes
- Module Summary: Image Captioning with Generative AI•2 minutes
1 assignment•Total 30 minutes
- Graded Quiz: Image Captioning with Generative AI •30 minutes
2 app items•Total 75 minutes
- Lab: Give Meaningful Names to Your Photos with IMG Captioning AI•60 minutes
- [Optional] Lab: Deploy your App with Code Engine•15 minutes
3 plugins•Total 33 minutes
- Helpful Tips for Course Completion•3 minutes
- Reading: BLIP from Hugging Face Transformers•15 minutes
- Reading: Introduction to Gradio•15 minutes
In this module, you will learn how to build your own ChatGPT-like application using generative AI tools. As part of the project, you will work with Facebook’s BlenderBot model using Hugging Face’s Transformers library in Python. You’ll explore key components such as large language models (LLMs), prompt engineering, and user interface design. Practical readings and labs will guide you through integrating models through APIs and deploying your app. You’ll also gain hands-on experience with frameworks like Gradio and Hugging Face. By the end, you’ll be equipped to create and customize your own conversational AI web app.
What's included
1 video1 reading1 assignment2 app items1 plugin
1 video•Total 4 minutes
- Project Overview: Create Your Own ChatGPT-like Website•4 minutes
1 reading•Total 2 minutes
- Module Summary: Create Your Own ChatGPT-Like Website•2 minutes
1 assignment•Total 30 minutes
- Module 2 Graded Quiz: Create Your Own ChatGPT-like Website•30 minutes
2 app items•Total 90 minutes
- Lab: Create Simple Chatbot with Open Source LLMs using Python and Hugging Face•60 minutes
- Lab: Integrating Your Chatbot into a Web Application•30 minutes
1 plugin•Total 15 minutes
- Reading: Flask: A Gateway to Web Development in Python•15 minutes
In this module, you will explore how to build a generative AI-powered voice assistant by combining OpenAI’s GPT-3 with IBM Watson’s speech-to-text and text-to-speech services. You will learn how to structure the application, apply containerization using Docker for consistent deployment, and implement a basic voice assistant that can understand spoken input and respond naturally through synthesized speech. Finally, you will learn to deploy the chatbot to a public server.
What's included
2 videos1 reading1 assignment1 app item1 plugin
2 videos•Total 8 minutes
- Project Overview: Create a Voice Assistant•4 minutes
- Introduction to Docker•3 minutes
1 reading•Total 1 minute
- Module Summary: Create a Voice Assistant•1 minute
1 assignment•Total 30 minutes
- Module 3 Graded Quiz: Create a Voice Assistant•30 minutes
1 app item•Total 60 minutes
- Lab: Create a Voice Assistant with OpenAI's GPT-3 and IBM Watson•60 minutes
1 plugin•Total 15 minutes
- Reading: IBM Watson Speech-to-Text and Text-to-Speech•15 minutes
In this module, you will learn how to build a generative AI-powered meeting assistant that can transcribe, summarize, and answer questions based on meeting content. You will explore key technologies such as IBM watsonx.ai, Meta Llama 2, and OpenAI Whisper, and understand their roles in creating enterprise-ready AI solutions. Through hands-on labs, you will implement a functional meeting assistant that showcases real-world business applications of generative AI.
What's included
2 videos1 reading1 assignment1 app item2 plugins
2 videos•Total 9 minutes
- Project Overview: Generative AI-Powered Meeting Assistant•3 minutes
- IBM watsonx.ai•6 minutes
1 reading•Total 1 minute
- Module Summary: Generative AI-Powered Meeting Assistant•1 minute
1 assignment•Total 30 minutes
- Module 4 Graded Quiz: Generative AI-Powered Meeting Assistant•30 minutes
1 app item•Total 60 minutes
- Lab: Business AI Meeting Companion•60 minutes
2 plugins•Total 30 minutes
- Reading: Introduction to Meta Llama 2 •15 minutes
- Reading: Introduction to OpenAI Whisper •15 minutes
In this module, you will learn how to build generative AI applications that summarize and answer questions using your own data. You will explore the concept of retrieval-augmented generation (RAG), understand how tools like LangChain and Llama 2 support this process, and apply these technologies to create a functional chatbot that retrieves and summarizes private documents. This hands-on experience will prepare you to implement secure, domain-specific AI assistants in enterprise settings.
What's included
3 videos1 reading1 assignment1 app item1 plugin
3 videos•Total 15 minutes
- Project Overview: Summarize Your Private Data with Generative AI & RAG•4 minutes
- Introduction to LangChain•4 minutes
- Enhancing LLM Accuracy with RAG•7 minutes
1 reading•Total 2 minutes
- Module Summary: Summarize Your Private Data with Generative AI •2 minutes
1 assignment•Total 30 minutes
- Module 5 Graded Quiz: Summarize Your Private Data with Generative AI•30 minutes
1 app item•Total 60 minutes
- Lab: Build a Chatbot for Your Data•60 minutes
1 plugin•Total 15 minutes
- Reading: Introduction to Llama 2 and RAG •15 minutes
In this module, you will acquire the skills to build a real-time voice translator assistant using generative AI technologies. You will learn how to integrate large language models, such as Flan-UL2, with IBM Watson® Speech Libraries for Embed to convert spoken input into translated speech output. The application workflow includes converting speech-to-text (STT), translating the text using an LLM, and converting it back to speech (TTS) in a target language. You will also apply your knowledge of Python, Flask, HTML, CSS, and JavaScript to create a functional and user-friendly web-based voice assistant that supports multilingual communication in real time. To support your learning, this module also includes a course glossary to reinforce key generative AI terms and technologies. You will conclude with a course wrap-up that summarizes major concepts and prepares you to apply your new skills to real-world AI applications.
What's included
1 video4 readings1 assignment1 app item1 plugin
1 video•Total 4 minutes
- Introduction to Project: Babel Fish with LLM and STT TTS•4 minutes
4 readings•Total 7 minutes
- Module Summary: Babel Fish with LLM and STT TTS•2 minutes
- What's Next: Explore watsonX Orchestrate•1 minute
- Congratulations and Next Steps•3 minutes
- Thanks from the Course Team•1 minute
1 assignment•Total 30 minutes
- Module 6 Graded Quiz: Babel Fish with LLM and STT TTS•30 minutes
1 app item•Total 90 minutes
- Lab: Babel Fish (Language Translator) with LLM, STT, & TTS•90 minutes
1 plugin•Total 5 minutes
- Glossary: Building GenAI- Powered Apps with Python•5 minutes
In this module, you will learn to build a personalized AI-powered career coach using large language models (LLMs). You will explore how generative AI can assist job seekers by providing resume feedback, job matching insights, and interview preparation guidance. Through hands-on practice, you will implement a job application assistant that uses user inputs and prompt engineering to generate tailored career advice. This module will also help you understand how to apply LLMs in practical, user-centric scenarios that support professional development and career advancement.
What's included
1 video1 reading1 assignment1 app item
1 video•Total 4 minutes
- Introduction to Project: Build an AI Career Coach•4 minutes
1 reading•Total 2 minutes
- Module Summary: Build an AI Career Coach•2 minutes
1 assignment•Total 10 minutes
- Module 7 Practice Quiz: Build an AI Career Coach•10 minutes
1 app item•Total 60 minutes
- Lab: Your Personalized Job Application Coach Based on LLM•60 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructors
Why people choose Coursera for their career
Learner reviews
- 5 stars
79.15%
- 4 stars
11.30%
- 3 stars
3.88%
- 2 stars
2.47%
- 1 star
3.18%
Showing 3 of 283
Reviewed on Dec 1, 2024
Amazing hands on learning and exposure to various tech
Reviewed on Sep 18, 2024
The course was well-structured with practical and insightful projects.
Reviewed on Jul 22, 2024
Quick overview of lots of things. Needs to be done in two iterations, first rapid read, and copy-paste, second slow, deliberated, and less copy-paste.
Frequently asked questions
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
More questions
Financial aid available,
