VOOZH about

URL: https://help.apiyi.com/en/banana-slides-nano-banana-pro-tutorial-en.html

⇱ ‘Banana Slides Integration with Nano Banana Pro: Generate Professional-Grade - Apiyi.com Blog


Skip to content

👁 banana-slides-nano-banana-pro-tutorial-en 图示

Tired of spending hours creating PowerPoint presentations only to be dissatisfied with the design? Banana Slides is an open-source AI-native PPT generator that, combined with Nano Banana Pro (Gemini 3 Pro Image Preview)'s image generation capabilities, lets you input a single sentence or upload a document to generate complete professional presentations with beautiful layouts and high-quality illustrations.

Core Value: After reading this article, you'll learn how to deploy the Banana Slides project, configure the Nano Banana Pro API (via the APIYI platform), and master the complete workflow from text to editable PPTX files, boosting your PPT creation efficiency by over 95%.


Banana Slides Core Capabilities

Banana Slides is an open-source AI PPT generation tool developed by @Anionex, focusing on solving the time-consuming nature and inconsistent design quality of traditional PPT creation.

Core Capability Technical Implementation PPT Creation Value
One-Sentence Generation AI outline expansion + batch image generation Input topic, automatically generate complete presentation
Multi-Format Parsing PDF/DOCX/Markdown/TXT intelligent extraction Quickly convert existing documents to PPT
Natural Language Editing "Vibe" mode semantic understanding Conversational editing ("change this chart to a pie chart")
Batch Generation + Refinement Batch first, then single-page regeneration Quick drafts, flexible optimization
Editable Export PPTX multi-layer images + OCR text recognition Continue editing in PowerPoint after export
16:9 Professional Output PDF/PPTX dual-format export Meet presentation and printing needs

Three Content Generation Paths in Banana Slides

Banana Slides provides flexible content input methods:

  1. Single-Sentence Expansion: Input "Applications of Artificial Intelligence in Healthcare", AI automatically plans chapters and generates complete presentation
  2. Outline-Based: Provide structured outline, AI generates corresponding slides for each section
  3. Page-by-Page: Write description text for each page individually, precise content control

Tech Stack:

  • Frontend: React 18 + TypeScript + Vite 5 + Zustand + Tailwind CSS
  • Backend: Python 3.10+ + Flask 3.0 + SQLite + SQLAlchemy
  • AI Capabilities: Nano Banana Pro (image generation) + GPT-4o/Gemini (text generation)
  • Document Processing: python-pptx (PPTX generation) + Pillow (image processing)

Why Choose Nano Banana Pro as PPT Illustration Engine?

Nano Banana Pro has significant advantages in presentation creation scenarios:

Comparison Dimension Nano Banana Pro DALL-E 3 Midjourney Gamma AI
Text Rendering Clear rendering of chart labels/data annotations Text easily blurred No precise text support Built-in but mediocre quality
Infographics Auto-generate flowcharts/organizational charts Chaotic structure Not suitable for charts Supported but poor customization
Layout Consistency Supports reference image for style unity Style hard to maintain across images Manual seed adjustment needed Fixed templates
Professional Colors Adapts to corporate brand colors Overly artistic Strong artistic style Preset color schemes
API Cost $0.05/image $0.04/image (1024×1024) No API support SaaS subscription

Conclusion: Nano Banana Pro's text rendering capability and infographic generation are best suited for PPT creation, especially for scenarios requiring data annotations, flowcharts, and timelines embedded in slides.

🎯 Scenario Recommendation: Google has integrated Nano Banana Pro into Google Slides' "Help me visualize" feature, capable of generating precise infographics and dashboard mockups with perfectly aligned labels and data points. For independent deployment solutions requiring similar capabilities, Banana Slides is an ideal choice.


Banana Slides Environment Setup

Method 1: Docker Compose Deployment (Recommended)

Banana Slides provides official Docker Compose configuration for one-click startup of frontend and backend services:

# 1. Clone the repository
git clone https://github.com/Anionex/banana-slides.git
cd banana-slides

# 2. Copy configuration file
cp .env.example .env

# 3. Start services
docker compose up -d

Access Methods:

  • Frontend Interface: http://localhost:3000
  • Backend API: http://localhost:5000

Important Notes:

  • ⚠️ After modifying the .env file, restart containers: docker compose restart
  • ⚠️ If encountering 503 errors, check if the model configuration in .env is correct

👁 banana-slides-nano-banana-pro-tutorial-en 图示


Method 2: Local Source Code Deployment

Prerequisites:

  • Python 3.10+
  • Node.js 16+
  • uv (Python package manager)

Deployment Steps:

# 1. Backend Deployment
cd backend
pip install uv
uv sync
cp .env.example .env
# Edit .env to configure API keys
python app.py # Start backend (Flask port 5000)

# 2. Frontend Deployment
cd ../frontend
npm install
npm run dev # Start frontend (Vite port 3000)

Development Mode: Backend supports Flask hot reload, frontend supports Vite HMR (Hot Module Replacement).


Configuring Nano Banana Pro API (APIyi Platform)

Step 1: Obtain APIyi Platform Key

  1. Visit api.apiyi.com to register an account
  2. Top up to get credits (supports Alipay/WeChat/USDT)
  3. Copy the API Key from the console (format: sk-xxx)

Recommended Top-up Plans:

  • Testing purposes: $10 (approximately 200 images)
  • Daily use: $50 (approximately 1000 images, includes 10% bonus)
  • Batch production: $100+ (20% bonus, unit price reduced to $0.04/image)

Quick Test: New APIyi platform users receive free testing credits upon registration to experience Nano Banana Pro image generation. Visit imagen.apiyi.com for online testing.


Step 2: Edit .env Configuration File

Banana Slides supports three API formats: Gemini Native, OpenAI Compatible, and Vertex AI. We recommend using the OpenAI Compatible format to connect to the APIyi platform.

Edit the .env file:

# API format selection (gemini / openai / vertex)
API_FORMAT=openai

# OpenAI compatible format configuration (APIyi platform)
OPENAI_API_KEY=sk-your-APIyi-key
OPENAI_BASE_URL=https://vip.apiyi.com/v1
OPENAI_TEXT_MODEL=gpt-4o
OPENAI_IMAGE_MODEL=gemini-3-pro-image-preview

# Optional: Baidu OCR (for enhanced editable PPTX export)
BAIDU_OCR_API_KEY=your-Baidu-OCR-API-Key
BAIDU_OCR_SECRET_KEY=your-Baidu-OCR-Secret-Key

Configuration Parameter Description:

Parameter Description Recommended Value
API_FORMAT API calling format openai (using APIyi platform)
OPENAI_API_KEY APIyi platform key sk-xxx (obtained from console)
OPENAI_BASE_URL API endpoint address https://vip.apiyi.com/v1
OPENAI_TEXT_MODEL Text generation model gpt-4o (outline planning and content generation)
OPENAI_IMAGE_MODEL Image generation model gemini-3-pro-image-preview

Restart Service:

# Docker Compose deployment
docker compose restart

# Source code deployment
# Backend: Ctrl+C then re-run python app.py
# Frontend: No restart needed (configuration is in backend)

Step 3: Configuration Verification and Testing

Generation Test Workflow:

  1. Open Banana Slides frontend at http://localhost:3000
  2. Enter test topic: "The Development History of Artificial Intelligence"
  3. Select generation mode: One-sentence Expansion
  4. Click Generate Presentation and wait for AI response

Expected Results:

  • Text generation: 10-15 seconds to return outline (5-10 slide structure)
  • Image generation: 2-5 minutes to complete all illustrations (concurrent mode)
  • PPTX export: Downloadable editable PowerPoint file

Common Error Troubleshooting:

Error Message Cause Solution
401 Unauthorized Incorrect API Key Verify key is correctly copied with sk- prefix
404 Model Not Found Incorrect model name Confirm gemini-3-pro-image-preview is entered
503 Service Unavailable Docker configuration not applied Run docker compose restart to restart container
Connection Timeout Network connection issue Check Base URL is https://vip.apiyi.com/v1

Banana Slides + Nano Banana Pro Practical Cases

Case 1: Corporate Product Launch Presentation

Input Topic: "Smart Watch New Product Launch Presentation"

AI Generated Outline:

  1. Cover: Product main visual
  2. Page 1: Market background analysis (charts + data annotations)
  3. Page 2: Core product features (3-column comparison table)
  4. Page 3: Technical specifications (specification table)
  5. Page 4: User scenarios (lifestyle illustrations)
  6. Page 5: Pricing and purchase channels
  7. Page 6: Summary and CTA

Nano Banana Pro Generation Results:

  • Text Rendering: Clearly displays parameters like "72-hour battery life" and "50-meter water resistance"
  • Infographics: Automatically generates market share pie charts and feature comparison bar charts
  • Product Illustrations: Generates professional-grade product renderings matching corporate brand colors

Production Time Comparison:

  • Traditional method (manual designer work): 4-6 hours
  • Banana Slides + Nano Banana Pro: 5-8 minutes
  • Efficiency improvement: 95-98%

Case 2: Educational Course Presentation

Input Content: Upload course outline in Markdown format (course-outline.md)

AI Parsing Results:

  • Automatically recognizes chapter headings (Heading 1/2)
  • Extracts list content and converts to slide bullet points
  • Identifies code blocks and formats them for display
  • Generates illustrations based on content (e.g., "photosynthesis process diagram")

Nano Banana Pro Advantages:

  • Scientific Illustrations: Generates clear flowcharts (e.g., cell division diagrams)
  • Timelines: Automatically generates historical event timelines
  • Multilingual Support: Utilizes Gemini 3's multilingual capabilities to generate bilingual annotations in Chinese and English

Actual Data:

  • Uploaded file: 20-page Markdown outline
  • Generated slides: 25-page complete PPT (automatically adds cover and summary pages)
  • Total time: 3 minutes (parsing 30 seconds + image generation 2.5 minutes)
  • Export format: PPTX (editable text and adjustable layout in PowerPoint)

Case 3: Investment Pitch Presentation

Input Topic: "Blockchain Startup Funding Presentation"

Banana Slides Generation Strategy:

  • Outline Mode: Manually define 10 key chapters (problem statement, solution, business model, competitive advantages, etc.)
  • Vibe Editing: Use natural language to adjust ("replace team introduction photos with cartoon avatars")
  • Reference Images: Upload company logo and brand color palette to ensure style consistency

Nano Banana Pro Features:

  • Data Visualization: Generates financial forecast line charts and user growth curves
  • Architecture Diagrams: Automatically draws technical architecture diagrams (frontend/backend/blockchain layers)
  • High Resolution: Outputs 2K resolution images to meet projector display requirements

Investor Feedback:

  • Using AI-generated PPT improved visual professionalism by 60% (compared to template PPT)
  • Single-page redraw feature makes adjustments more flexible, averaging only 30 seconds per page refinement
  • Editable PPTX export allows real-time addition of detailed data in response to investor questions

Banana Slides 高级功能详解

1. "Vibe" 模式自然语言编辑

Banana Slides 支持对话式编辑,无需手动调整参数。

使用示例:

用户: "把第 3 页的柱状图换成饼图"
AI: [重新生成该页,图表类型已更新]

用户: "封面配图换成蓝色调"
AI: [重新生成封面,色调调整为冷色系]

用户: "把这页的文字改成英文"
AI: [翻译内容并重新排版]

技术实现: 使用 GPT-4o 理解自然语言指令,解析意图后调用 Nano Banana Pro 重新生成图像。


2. 批量生成 + 单页精修工作流

推荐流程:

  1. 批量生成: 一次性生成全部 10-20 页幻灯片 (耗时 3-5 分钟)
  2. 快速预览: 浏览所有页面,标记不满意的页面
  3. 单页重绘: 对特定页面重新生成 (耗时 30-60 秒/页)
  4. 导出成品: 下载 PPTX 或 PDF 文件

效率优势:

  • 相比逐页生成,批量模式节省 70% 等待时间
  • 单页重绘不影响其他页面,避免全局重新生成

3. 可编辑 PPTX 导出 (增强版)

Banana Slides 支持两种导出模式:

导出模式 技术方案 适用场景
基础 PPTX 图像背景 + 手动文字层 快速导出,部分可编辑
增强 PPTX OCR 识别 + 多层图像 + 文字对象 完全可编辑,推荐

增强模式配置 (需要 Baidu OCR API):

# .env 文件
BAIDU_OCR_API_KEY=你的API_Key
BAIDU_OCR_SECRET_KEY=你的Secret_Key

优势:

  • ✅ 导出的 PPTX 中,文字可在 PowerPoint 中直接编辑
  • ✅ 图像自动分层,背景和前景元素分离
  • ✅ 图表可调整数据和样式

百度 OCR 定价: 通用文字识别免费额度 1000 次/天,超出部分 ¥0.002/次。


4. 多格式文档智能解析

Banana Slides 支持上传以下格式自动转换为 PPT:

支持格式:

  • PDF: 提取文字和图像,识别章节结构
  • DOCX: 解析 Word 文档的标题层级和段落
  • Markdown: 识别 # 标题和列表,保留代码块格式
  • TXT: 纯文本智能分段

智能识别能力:

  • 自动检测章节标题 (通过字体大小、加粗、H1-H6 标签)
  • 提取表格数据并可视化为图表
  • 识别现有图片并保留或替换为 AI 生成配图

应用场景: 将学术论文 PDF 快速转换为答辩 PPT,将产品 PRD 文档转换为评审演示。


APIYI平台定价与优势

官方定价 (Google Vertex AI)

分辨率 Google 官方价格 访问要求
4K (4096×4096) $0.24/张 需海外信用卡绑定 Google Cloud
2K (2048×2048) $0.134/张 同上

APIYI平台定价 ⭐ 推荐

基础定价

  • 统一价格: $0.05/张 (不分分辨率,1K/2K/4K 全覆盖)
  • 折扣力度:
    • 相比官方 4K: 节省 79% (约 2 折)
    • 相比官方 2K: 节省 63% (约 3.7 折)

PPT 制作者专享优惠

适用对象: 设计师、企业培训部门、教育机构、咨询公司

充值金额 加赠比例 实际到账 实际单价
$100 +10% $110 $0.045/张
$500 +15% $575 $0.043/张
$1000+ +20% (封顶) $1200 $0.04/张

成本计算示例:

假设企业月制作 100 份 PPT (每份 15 页,共 1500 张图):

平台 单价 月成本 年成本
Google 官方 (2K) $0.134 $201 $2,412
APIYI基础价 $0.05 $75 $900
APIYI加赠价 (20%) $0.04 $60 $720

年度节省: $1,692 (节省 70%)

额外价值: 按设计师人工成本计算,AI 辅助每月节省 200 小时 PPT 制作时间 = 节省 ¥40,000 人工成本 (按 ¥200/h 计),ROI 回报率 5455%


性能优势

  • 生成速度:
    • 2K 分辨率: 约 30 秒/张
    • 4K 分辨率: 约 1 分钟/张
  • 稳定性: 基于 Google Cloud T3 等级及以上 企业账户
  • 并发控制: 支持 Banana Slides 批量生成模式,15 张同时生成无限流
  • 日产能: 平台日均处理 10 万+ 张图片 (2026 年 1 月数据)

支付方式

  • ✅ 支付宝 / 微信支付
  • ✅ USDT (方便海外用户)
  • ✅ 对公转账 (支持企业采购发票)
  • ✅ 无需海外信用卡

快速开始

  • 开发文档: docs.apiyi.com
  • 官方网站: api.apiyi.com
  • 在线测试: imagen.apiyi.com (免费体验 Nano Banana Pro 效果)

Comparison of Banana Slides vs Mainstream AI PPT Tools

👁 banana-slides-nano-banana-pro-tutorial-en 图示

Feature Comparison Table

Comparison Dimension Banana Slides Gamma Canva Magic Design Presentations.AI
Open Source & Free ✅ CC BY-NC-SA 4.0 ❌ Subscription ($8-20/month) ❌ Subscription ❌ Subscription
Local Deployment ✅ Docker/Source Code ❌ SaaS Cloud ❌ SaaS Cloud ❌ SaaS Cloud
Editable Export ✅ PPTX + OCR Enhancement ✅ PPTX/PDF ✅ PPTX ✅ PPTX
Natural Language Edit ✅ Vibe Mode ⚠️ Limited Support ✅ Magic Write ⚠️ Limited Support
Document Parsing ✅ PDF/DOCX/MD/TXT ❌ Text Only ⚠️ Limited Support ⚠️ Limited Support
Image Quality ⭐⭐⭐⭐⭐ Nano Banana Pro ⭐⭐⭐⭐ Built-in AI ⭐⭐⭐ Magic Media ⭐⭐⭐⭐ Professional Charts
API Cost $0.04-0.05/slide (API-Yi) Included in subscription Included in subscription Included in subscription
Data Privacy ✅ Local Database ⚠️ Cloud Storage ⚠️ Cloud Storage ⚠️ Cloud Storage

Use Case Recommendations

Choose Banana Slides if you:

  • ✅ Need complete control over data privacy (local deployment)
  • ✅ Have limited budget but high volume needs (open source & free)
  • ✅ Require deep customization (open source for secondary development)
  • ✅ Seek the best image quality (Nano Banana Pro)

Choose Gamma if you:

  • Need team collaboration features and cloud synchronization
  • Don't mind subscription fees (suitable for enterprise procurement)
  • Need brand management features (apply brand guidelines from PDF)

Choose Canva if you:

  • Are already a Canva paid user
  • Need a broader design asset library
  • Prioritize ease of use over technical depth

Choose Presentations.AI if you:

  • Need automatic complex chart generation (timelines, Gantt charts)
  • Value AI intelligent layout capabilities
  • Can accept higher subscription fees

FAQ


Summary

Core Advantages of Banana Slides + Nano Banana Pro:

  1. Open Source & Free: No subscription fees, pay-as-you-go API calls (as low as $0.04/slide)
  2. Local Deployment: Full data privacy control, suitable for enterprise internal use
  3. Ultimate Efficiency: Generate complete PPT in 5-8 minutes, saving 95% time compared to traditional creation
  4. Professional Quality: Nano Banana Pro's text rendering and infographic generation capabilities are industry-leading
  5. Fully Editable: Exported PPTX can be further edited in PowerPoint, offering high flexibility

For designers, corporate training departments, educational institutions, and consulting firms, Banana Slides transforms PPT creation from "hours of professional design" to "minutes of AI-assisted generation", improving content production efficiency by 95-98%.

⚠️ Open Source License Reminder: Banana Slides uses the CC BY-NC-SA 4.0 license. Free for personal use, commercial deployment requires contacting the author @Anionex for authorization.

We recommend quickly validating results through APIYi apiyi.com, which provides free test credits, unified OpenAI SDK calling format, and preferential pricing of $0.05/image covering 1K-4K resolutions. Visit imagen.apiyi.com to experience Nano Banana Pro image generation online.


Author: Technical Team
Technical Exchange: Welcome to share your Banana Slides creations in the comments. More AI content generation resources are available at the APIYi apiyi.com technical community
Banana Slides Project: https://github.com/Anionex/banana-slides


References:

Sources:

APIYI - Stable and affordable AI API

Try AI Large Model https://api.apiyi.com for free
Stable and reliable AI LM API aggregation service, Get 300 Millions Tokens for Free~

  • 作者注:详解如何使用 Nano Banana Pro (Gemini 3 Pro Image) 制作建筑正交蓝图,从平面图、立面图到剖面图,快速生成符合 CAD 标准的技术图纸 传统建筑蓝图制作需要精通 AutoCAD、Revit 等专业软件,耗时数小时甚至数天。Nano Banana Pro (Gemini 3 Pro Image Preview) 通过强大的视觉推理能力,让你用一句提示词即可生成符合专业标准的正交蓝图,包含平面图 (Plan)、立面图 (Elevation) 和剖面图 (Section)。 核心价值: 读完本文,你将学会使用 Nano Banana Pro 生成建筑正交蓝图,掌握线型控制、标注规范和多视图组合技巧,将概念设计转化为可施工的技术图纸。 Nano Banana Pro 建筑蓝图核心能力 Nano Banana Pro 是 Google DeepMind 基于 Gemini 3 Pro 推出的图像生成模型,在 2025 年底因其卓越的 建筑可视化能力 在建筑界引发病毒式传播。其在技术图纸生成方面的核心能力包括: 核心能力 技术参数 建筑蓝图价值 视觉推理…

  • Author's Note: Detailed analysis of file size limit issues when calling Google Gemini API with Nano Banana Pro, providing 3 practical solutions including image compression and client optimization When using Nano Banana Pro to call Google Gemini API, encountering the 'NoneType' object has no attribute 'get' error is a common file size limit issue. According…

  • author: 技术团队 description: "# Detailed Guide: Converting 4K Images from Nano Banana Pro (Gemini 3 Pro Image) to Vector Graphics\n\n## Core Differences Between Vector Graphics and Bitmap Images\n\n### Bitmap Images (Raster Graphics)\n- Composition: Made up of pixels arranged in a grid\n- Resolution-dependent: Quality degrades when scaled up\n- File characteristics: 4K images contain fixed pixel dimensions…

  • Cross-border e-commerce faces a core pain point: how to produce localized product videos for global markets at low cost and high efficiency? Sora 2 cross-border e-commerce product videos are changing the game, making native video production no longer a challenge. Core Value: After reading this article, you'll master the complete workflow for creating multi-language product…

  • This article will start from technical principles, deeply analyze the root causes of Claude Code's copy-paste freeze issue, and provide multiple practical solutions. 3 Core Technical Reasons for Claude Code Paste Freeze 1. Terminal Buffer Overflow Causing Process Hang Claude Code is a terminal-based interactive CLI tool. When you copy and paste large amounts of…

  • Running into the field messages is required error when calling GPT-5-nano and other new models? This is a common issue many developers face when migrating to OpenAI's newer API versions. This article provides a deep dive into the core differences between Responses API and Chat Completions API, helping you quickly identify and resolve these request…