VOOZH about

URL: https://apify.com/funny_electrician/korak1902

⇱ Multimodal Dataset Scraper Β· Apify


Pricing

from $7.00 / 1,000 image-text pair extracteds

Go to Apify Store

Multimodal Dataset Scraper

Collects images + their descriptive text from niche forums for vision model training.

Pricing

from $7.00 / 1,000 image-text pair extracteds

Rating

0.0

(0)

Developer

πŸ‘ Milton Gardener

Milton Gardener

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a month ago

Last modified

Share

My beautiful actor

Contains a documentation what your Actor does and how to use it, which is then displayed in the app or library. It's always a good idea to write a good README.md, in a few months not even you will remember all the details about the Actor.

You can use Markdown language for rich formatting.

You might also like

Google Forums API

johnvc/google-forums-search-api

Extract forum threads, Q&A, and community discussions from Google Search’s "Forums" tab. Supports 40+ countries, custom localization (GL/HL), and pagination. Get structured JSON data from Reddit, LinkedIn, Quora, and niche forums for sentiment analysis, market research, or AI training.

Google Forums Search

burbn/google-forums-search

Search discussions and forums across the web using Google Forums tab. Find forum posts from Reddit, Quora, Stack Overflow, Apple Support & more. Filter by time, country & language. Perfect for market research, brand monitoring & community insights.

Podcast-to-Text Dataset Scraper

funny_electrician/Korak1906

Podcast-to-Text Dataset Scraper: Scrapes transcriptions from niche industry podcasts.

πŸ‘ User avatar

Milton Gardener

2

AI Training Data Curator

ryanclinton/ai-training-data-curator

Crawl any website and extract clean, structured text data ready for LLM fine-tuning, RAG pipelines, and AI model training.

OpenRouter Model Scraper

datapilot/openrouter-model-scraper

OpenRouter Models Scraper extracts AI model metadata from OpenRouter API, including pricing, context length, providers, modalities, token limits, vision/tool support, JSON support, and model architecture. Supports keyword filtering, proxy rotation, and structured dataset

Google Images Scraper

scraper-engine/google-images-scraper

Google Images Scraper collects image URLs, alt text, source pages, and metadata from Google Images. Use it as an API, with Python or Node.js, or via npm. Ideal for datasets, AI training, research, and automation. Exports in JSON, CSV, or Excel.

πŸ‘ User avatar

Scraper Engine

358

5.0

Alt Text Generator

balt1794/alt-ai-generator

Alt Text Generator automatically generates descriptive alt text for images using AI to help you rank faster on Google. Rank faster on search engines with SEO optimized alt text descriptions and increase visibility.

Vision OCR MCP

accelerationengg/vision-ocr-mcp

Extract text from images instantly. Turn receipts, invoices, documents, and handwritten notes into structured data.

14

5.0

Google Forums Scraper

codingfrontend/google-forums-scraper

A robust, high-performance utility designed for developer automation, data integration, and AI training. Features built-in captcha bypass, headful/headless browser execution, and proxy support to scrape Google data seamlessly, reliably, and at scale.

πŸ‘ User avatar

Coding Frontned

2

Related articles

Multimodal AI: what can it do, and why is it a game-changer?
Read more
How to improve AI models with web scraping and data augmentation
Read more