Generative AI for Cloud Solutions
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Generative AI for Cloud Solutions
Included with
Learn more
Ask Coursera
Recommended experience
Recommended experience
What you'll learn
Understand the fundamentals of generative AI, large language models, and their integration with cloud platforms.
Explore advanced techniques such as fine-tuning, prompt engineering, and responsible AI practices.
Gain insights into the development and deployment of AI applications using frameworks like LLMOps and Assistants APIs.
Skills you'll gain
Tools you'll learn
Details to know
February 2026
10 assignments
See how employees at top companies are mastering in-demand skills
There are 10 modules in this course
This course explores how Generative AI is transforming modern cloud solutions by combining large language models with scalable cloud architectures. Learners gain a strategic understanding of how AI-driven systems are designed, deployed, and governed in real-world cloud environments.
Through a structured journey from NLP foundations to advanced LLM-based application development, the course helps learners build practical skills in fine-tuning models, retrieval-augmented generation, and prompt engineering. You will learn how to design, deploy, and scale AI-powered cloud applications while addressing operational and performance challenges. What sets this course apart is its balance of conceptual depth and hands-on architectural thinking. It connects core AI concepts with real-world cloud deployment patterns, Dev frameworks, and LLMOps practices. This course is ideal for cloud engineers, software developers, architects, and technology professionals looking to integrate Generative AI into cloud solutions. A basic understanding of cloud computing and software development concepts is recommended.
In this section, we explore conversational AI and generative AI, focusing on LLMs, open source vs closed source models, and cloud computing for scalable AI implementation.
What's included
2 videos9 readings1 assignment
2 videosβ’Total 2 minutes
- Course Overviewβ’1 minute
- Cloud Computing Meets Generative AI Bridging Infinite Impossibilities - Overview Videoβ’1 minute
9 readingsβ’Total 130 minutes
- Introductionβ’20 minutes
- Chatbots and Agentsβ’20 minutes
- Model Parametersβ’10 minutes
- Deep Dive Open Source vs Closed Source Proprietary Modelsβ’10 minutes
- Trending Models, Tasks, and Business Applicationsβ’10 minutes
- Imageβ’10 minutes
- Audioβ’20 minutes
- Cloud Computing for Scalability, Cost Optimization, and Securityβ’10 minutes
- From Vision to Value Navigating the Journey to Productionβ’20 minutes
1 assignmentβ’Total 10 minutes
- Bridging Cloud and AIβ’10 minutes
In this section, we explore NLP evolution and the role of transformers in AI communication and model development.
What's included
1 video5 readings1 assignment
1 videoβ’Total 1 minute
- NLP Evolution and Transformers Exploring NLPs and LLMs - Overview Videoβ’1 minute
5 readingsβ’Total 55 minutes
- Introductionβ’10 minutes
- NLP and the Strengths of Generative AI in LLMsβ’10 minutes
- How Do Transformers Work?β’10 minutes
- Conversation Prompts and Completions Under the Coversβ’20 minutes
- LLMs Landscape, Progression, and Expansionβ’5 minutes
1 assignmentβ’Total 10 minutes
- NLP and Transformer Fundamentalsβ’10 minutes
In this section, we cover domain-specific LLM fine-tuning, PEFT, and evaluation methods to improve accuracy and reliability.
What's included
1 video6 readings1 assignment
1 videoβ’Total 1 minute
- Fine-Tuning Building Domain-Specific LLM Applications - Overview Videoβ’1 minute
6 readingsβ’Total 120 minutes
- Introductionβ’10 minutes
- Fine-tuning Applicationsβ’30 minutes
- Prompt Tuning Processβ’20 minutes
- LoRAβ’20 minutes
- How to Evaluate Fine-Tuned Model Performanceβ’20 minutes
- GLUE and SuperGLUEβ’20 minutes
1 assignmentβ’Total 10 minutes
- Fine-Tuning and Evaluation in LLM Applicationsβ’10 minutes
In this section, we explore retrieval-augmented generation (RAG) to enhance LLM accuracy, focusing on vector databases, chunking strategies, and real-world applications like chatbots and recommendation systems.
What's included
1 video7 readings1 assignment
1 videoβ’Total 1 minute
- RAGs to Riches Elevating AI with External Data - Overview Videoβ’1 minute
7 readingsβ’Total 120 minutes
- Introductionβ’30 minutes
- Approximate Nearest Neighbors (ANNs)β’20 minutes
- Similarity Measuresβ’20 minutes
- How It Worksβ’10 minutes
- Vector DB Limitationsβ’10 minutes
- Common Vector DB Applicationsβ’10 minutes
- Business Applications of RAGβ’20 minutes
1 assignmentβ’Total 10 minutes
- Enhancing AI with External Dataβ’10 minutes
In this section, we explore prompt engineering techniques, emphasizing RAG integration, LLM interaction design, and ethical considerations for effective AI applications.
What's included
1 video4 readings1 assignment
1 videoβ’Total 1 minute
- Effective Prompt Engineering Techniques Unlocking Wisdom Through AI - Overview Videoβ’1 minute
4 readingsβ’Total 70 minutes
- Introductionβ’10 minutes
- Tokens and Cost Considerationsβ’20 minutes
- Assistantβ’30 minutes
- Ethical Guidelines for Prompt Engineeringβ’10 minutes
1 assignmentβ’Total 10 minutes
- Mastering Effective AI Interaction Strategiesβ’10 minutes
In this section, we explore generative AI app development frameworks like Semantic Kernel and LangChain, autonomous agents, and LLMOps for operationalizing LLM-based applications.
What's included
1 video8 readings1 assignment
1 videoβ’Total 1 minute
- Developing and Operationalizing LLM-based Apps Exploring Dev Frameworks and LLMOps - Overview Videoβ’1 minute
8 readingsβ’Total 140 minutes
- Introductionβ’20 minutes
- Fundamental Componentsβ’20 minutes
- LangChainβ’10 minutes
- Autonomous Agentsβ’20 minutes
- AutoGPTβ’20 minutes
- Discover and Tuneβ’10 minutes
- Comparing MLOps and LLMOpsβ’20 minutes
- Putting It All Togetherβ’20 minutes
1 assignmentβ’Total 10 minutes
- Operationalizing Large Language Models in the Cloudβ’10 minutes
In this section, we explore scaling ChatGPT in cloud environments, analyzing TPM, RPM, and PTU limits, and designing enterprise-ready architectures for efficient and reliable generative AI solutions.
What's included
1 video5 readings1 assignment
1 videoβ’Total 1 minute
- Deploying ChatGPT in the Cloud Architecture Design and Scaling Strategies - Overview Videoβ’1 minute
5 readingsβ’Total 80 minutes
- Introductionβ’20 minutes
- PTUsβ’20 minutes
- Retries With Exponential Backoff The Scaling Special Sauceβ’10 minutes
- Monitoring, Logging, and HTTP Return Codesβ’10 minutes
- Costs Training and Supportβ’20 minutes
1 assignmentβ’Total 10 minutes
- Deploying AI Services in the Cloudβ’10 minutes
In this section, we examine security and privacy challenges in generative AI, focusing on risk mitigation, access controls, and secure deployment strategies for LLMs.
What's included
1 video8 readings1 assignment
1 videoβ’Total 1 minute
- Security and Privacy Considerations for Gen AI Building Safe and Secure LLMs - Overview Videoβ’1 minute
8 readingsβ’Total 75 minutes
- Introductionβ’10 minutes
- Emerging Security Threats A Look at Attack Vectors and Future Challengesβ’5 minutes
- Jailbreaks and Prompt Injectionsβ’10 minutes
- Training Data Poisoningβ’5 minutes
- Insecure Output Handlingβ’10 minutes
- Azure OpenAI Service API Keysβ’5 minutes
- What Is Privacyβ’20 minutes
- Auditingβ’10 minutes
1 assignmentβ’Total 10 minutes
- Security and Privacy in Generative AI Systemsβ’10 minutes
In this section, we explore responsible AI (RAI) principles, address LLM challenges, and evaluate Deepfake risks to ensure ethical, transparent, and safe AI development.
What's included
1 video8 readings1 assignment
1 videoβ’Total 1 minute
- Responsible Development of AI Solutions Building with Integrity and Care - Overview Videoβ’1 minute
8 readingsβ’Total 95 minutes
- Introductionβ’10 minutes
- Key Principles of RAIβ’10 minutes
- Addressing LLM Challenges with RAI Principlesβ’10 minutes
- Rising Deepfake Concernβ’10 minutes
- How to Spot a Deepfakeβ’5 minutes
- Building Applications Using a Responsible AI-First Approachβ’10 minutes
- Building Augmenting Loopβ’10 minutes
- AI, the Cloud, and the Law Understanding Compliance and Regulationsβ’30 minutes
1 assignmentβ’Total 10 minutes
- Ethical and Responsible AI Developmentβ’10 minutes
In this section, we explore future AI trends, including multimodal interactions and ChatGPT's evolving trajectory.
What's included
1 video5 readings1 assignment
1 videoβ’Total 1 minute
- The Future of Generative AI Trends and Emerging Use Cases - Overview Videoβ’1 minute
5 readingsβ’Total 95 minutes
- Introductionβ’30 minutes
- Video Prompts for Video Understandingβ’30 minutes
- Video Generation Models A Far-Fetched Dreamβ’5 minutes
- Industry-specific Generative AI Appsβ’10 minutes
- The Rise of Small Language Models (SLMs)β’20 minutes
1 assignmentβ’Total 10 minutes
- Exploring the Future of Generative AIβ’10 minutes
Instructor
Offered by
Explore more from Cloud Computing
- Status: Preview
Course
- Status: Preview
Course
- Status: Preview
Course
Why people choose Coursera for their career
Frequently asked questions
Yes, you can preview the first video and view the syllabus before you enroll. You must purchase the course to access content not included in the preview.
If you decide to enroll in the course before the session start date, you will have access to all of the lecture videos and readings for the course. Youβll be able to submit assignments once the session starts.
Once you enroll and your session begins, you will have access to all videos and other resources, including reading items and the course discussion forum. Youβll be able to view and submit practice assessments, and complete required graded assignments to earn a grade and a Course Certificate.
More questions
Financial aid available,
