Harnessing LLMs & Text-Embeddings API with Google Vertex AI
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Harnessing LLMs & Text-Embeddings API with Google Vertex AI
Included with
Ask Coursera
Recommended experience
Recommended experience
What you'll learn
Integrate Google Cloud with Vertex AI for seamless AI model deployment and usage.
Explore embeddings and their role in enhancing Generative AI and LLMs
Learn hands-on techniques for text generation, classification, and extraction.
Build a scalable Retrieval-Augmented Generation (RAG) system using real-world datasets.
Skills you'll gain
Tools you'll learn
Details to know
See how employees at top companies are mastering in-demand skills
There are 6 modules in this course
Updated in May 2025.
This course now features Coursera Coach! A smarter way to learn with interactive, real-time conversations that help you test your knowledge, challenge assumptions, and deepen your understanding as you progress through the course. Unlock the power of Google Cloud’s Vertex AI and take your machine learning projects to the next level with this practical and hands-on course. You’ll explore how to integrate and apply Large Language Models (LLMs) and the Text-Embeddings API to real-world data, enabling smarter search, classification, and summarization applications. By the end of this course, you’ll have built working knowledge of embeddings, vector similarity, and Retrieval-Augmented Generation (RAG) systems. The course begins with environment setup and a primer on API costs, then walks you through deploying and testing text embeddings with Vertex AI. You’ll perform hands-on tasks like generating sentence embeddings and integrating them into your projects using cosine similarity and visualization tools. A deep dive into the Vertex AI Text Embedding API reveals its potential through multimodal embedding concepts, semantic search, and practical use cases. In later modules, you'll transition from theory to powerful applications—building text generators with the Bison model, extracting structured information from unstructured text, and controlling output via temperature and sampling settings. You'll also develop end-to-end solutions like clustering StackOverflow data and implementing ANN search strategies using HNSW versus cosine similarity. This course is designed for data scientists, machine learning engineers, software developers, and cloud practitioners who are interested in building intelligent applications using GenAI. Ideal learners should have a foundational understanding of Python programming, basic knowledge of machine learning, and experience with REST APIs. Familiarity with Google Cloud Platform services and tools is recommended to fully benefit from this intermediate-level course.
In this module, we will introduce you to the course, outlining its structure and prerequisites. You will gain a clear understanding of what the course will cover and how the content is organized.
What's included
2 videos1 reading
2 videos•Total 3 minutes
- Introduction and About the Course - Prerequisites•2 minutes
- Course Structure•1 minute
1 reading•Total 10 minutes
- Full Course Resources•10 minutes
In this module, we will guide you through setting up the necessary development environment, configuring Google Cloud, and understanding API costs. You will also engage in a hands-on exercise to test sentence embeddings with Vertex AI.
What's included
3 videos1 assignment
3 videos•Total 9 minutes
- Development Environment Setup and API Costs - Overview•2 minutes
- Google Cloud Setup•4 minutes
- Hands-on: Testing the Vertex AI - Generated a Sentence Embedding•3 minutes
1 assignment•Total 15 minutes
- Development Environment Setup & Google Cloud Platform Setup - Assessment•15 minutes
In this module, we will dive deep into Vertex AI and its Text Embedding API, exploring both foundational and advanced concepts. The module includes hands-on exercises to help you better understand embeddings, their dimensions, and real-world applications in Generative AI.
What's included
10 videos1 assignment
10 videos•Total 42 minutes
- Introduction to Vertex AI and Capabilities - Overview•3 minutes
- OPTIONAL: Embeddings Crash Course•4 minutes
- How are Embeddings Used in GenAI and LLMs and Use Cases•6 minutes
- The Embeddings API - Text vs Multimodal Embeddings - Overview•3 minutes
- Task Types and Benefits•4 minutes
- Multimodal Embeddings Diagram•2 minutes
- Hands-on: Embeddings Length - Dimension•2 minutes
- Hands-on: Run Cosine Similarity Search on Different Sentences•6 minutes
- Hands-on: Visualize Embeddings•9 minutes
- Summary•2 minutes
1 assignment•Total 15 minutes
- Vertex AI Text Embedding API and Embeddings Crash Course - Deep Dive - Assessment•15 minutes
In this module, we will focus on text generation techniques within Vertex AI, including working with the Bison model. You'll apply hands-on methods for text classification, information extraction, and fine-tuning text output through various sampling techniques.
What's included
6 videos1 assignment
6 videos•Total 24 minutes
- TextGenerationModel - Generating Text Using Bison Model•3 minutes
- Hands-on: Text Generation - Classification Use Case•4 minutes
- Hands-on: Extract Information into Tables and JSON Formats•3 minutes
- Hands-on: Controlling Temperature for the Model•4 minutes
- Hands-on: TopK and TopP•5 minutes
- Hands-on: Transcript Summarization and Extraction•5 minutes
1 assignment•Total 15 minutes
- Text Generation with Vertex AI Text Embedding API - Assessment•15 minutes
In this module, we will apply what you've learned to real-world use cases by building a RAG system and visualizing clusters in StackOverflow data. You’ll also explore techniques to scale embeddings through approximate nearest neighbor search.
What's included
3 videos1 assignment
3 videos•Total 31 minutes
- Cluster Visualization of StackOverflow Question and Answers in 2D•13 minutes
- Build Your RAG System with the StackOverflow Data•14 minutes
- Scale with the Approximate Nearest Neighbor Search: HNSW vs Cosine Similarity•4 minutes
1 assignment•Total 15 minutes
- Hands-on: Application and Real-world Use Cases of Embeddings - Assessment•15 minutes
In this final module, we will summarize the course content and suggest potential next steps to continue your learning journey in AI and machine learning.
What's included
1 video2 assignments
1 video•Total 2 minutes
- Course Summary and Next Steps•2 minutes
2 assignments•Total 75 minutes
- Full Course Assessment•60 minutes
- Full Course Practice Assessment•15 minutes
Instructor
Explore more from Machine Learning
- Status: FreeD
DeepLearning.AI
Project
Project
- G
Google Cloud
Project
Why people choose Coursera for their career
Frequently asked questions
Yes, you can preview the first video and view the syllabus before you enroll. You must purchase the course to access content not included in the preview.
If you decide to enroll in the course before the session start date, you will have access to all of the lecture videos and readings for the course. You’ll be able to submit assignments once the session starts.
Once you enroll and your session begins, you will have access to all videos and other resources, including reading items and the course discussion forum. You’ll be able to view and submit practice assessments, and complete required graded assignments to earn a grade and a Course Certificate.
More questions
Financial aid available,
