openai-vision
Here are 21 public repositories matching this topic...
AI Agent capable of automating various tasks using MCP
- Updated
- TypeScript
An AI powered PlantUML Editor App for iPad
- Updated
- JavaScript
Generate DDLs from ER Diagrams using OpenAI Vision
- Updated
- Python
A lightweight Vision-Language-Action (VLA) baseline for MetaWorld robot-arm tasks using a pretrained CLIP-ViT vision transformer(openai/clip-vit-base-patch32), a small text transformer, and robot-state fusion
- Updated
- Python
This Python script processes a video file, generates a compelling description, creates a voiceover script in the style of David Attenborough, and synthesizes the voiceover using OpenAI's Text-to-Speech API.
- Updated
- Python
🤖 🌐 Personal AI assistant that browses the web independently
- Updated
- HTML
The AI Expense Tracker is an expense management tool designed to simplify the process of tracking expenses by automating the extraction of necessary data from receipt images.
- Updated
- TypeScript
- Updated
- Python
Generate K6 test case from a web page screenshot
- Updated
- Python
Multimodal Video Reasoning and Frame Analysis Platform for real-time intelligent monitoring.
- Updated
An intelligent document navigation system that extracts topics, maps relationships, detects anomalies, and creates visual navigation tools for complex documents using LangGraph and OpenAI.
- Updated
- Python
Smart Screen Reader- A Screen reader chrome extension for visually-impaired people. This is a Chrome extension that enhances web accessibility by: Generating image descriptions using AI, Summarizing entire pages, Allowing users to ask questions and automatically scroll to the relevant sections.
- Updated
- JavaScript
Web-based AI image recognition app.
- Updated
- Python
Enterprise vision-query Technical Architecture focusing on Scalability and High Performance.
- Updated
AI-powered cross-browser UI/UX testing tool that captures screenshots across devices and uses OpenAI vision models to analyze layout, responsiveness, and compatibility issues.
- Updated
- Python
Supercharge Legacy models with RAG, image analysis (GPT-4o vision), code execution, and web search. Transform it into a lean, multimodal powerhouse—smarter, faster, and more versatile, without the heavyweight cost.
- Updated
- Python
Just testing Open AI Vision
- Updated
- TypeScript
Improve this page
Add a description, image, and links to the openai-vision topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the openai-vision topic, visit your repo's landing page and select "manage topics."
