VOOZH about

URL: https://apify.com/stanvanrooy6/audio-video-transcriber

โ‡ฑ Audio And Video Transcriber (OpenAI GPT-4o-transcribe) ยท Apify


๐Ÿ‘ Audio And Video Transcriber (OpenAI GPT-4o-transcribe) avatar

Audio And Video Transcriber (OpenAI GPT-4o-transcribe)

Pricing

$5.00/month + usage

Go to Apify Store

Audio And Video Transcriber (OpenAI GPT-4o-transcribe)

Downloads videos from public URLs, extracts audio, and transcribes them using OpenAI

Pricing

$5.00/month + usage

Rating

0.0

(0)

Developer

๐Ÿ‘ Stan Van Rooy

Stan Van Rooy

Maintained by Community

Actor stats

4

Bookmarked

49

Total users

2

Monthly active users

8 months ago

Last modified

Share

Video Transcriber Actor ๐ŸŽค๐ŸŽฌ

This Apify Actor automates the process of downloading videos from public URLs, extracting their audio content, and then transcribing the audio into text using OpenAI's powerful speech-to-text models (GPT-4o Mini Transcribe or GPT-4o Transcribe).

Use Cases

  • Transcribing lectures, talks, or presentations.
  • Generating subtitles or text content from video podcasts.
  • Making video content searchable by transcribing its audio.
  • Analyzing spoken content in a collection of videos.

Input Configuration

The actor requires the following input fields. Your OpenAI API key is essential for the transcription service to work.

FieldTypeDescriptionDefault Value
video_urlsArrayRequired. A list of public direct URLs to video files (e.g., MP4, MOV, AVI). Each URL will be processed.[] (Example prefilled)
openai_api_keyStringRequired. Your OpenAI API key. This is treated as a secret and stored securely.N/A
openai_modelStringThe OpenAI model for transcription. gpt-4o-mini-transcribe is fast & cost-effective; gpt-4o-transcribe may offer higher accuracy.gpt-4o-mini-transcribe
openai_transcription_languageStringOptional. Language of the audio in ISO-639-1 format (e.g., en for English). If omitted, OpenAI attempts auto-detection."" (Empty String)
openai_transcription_promptStringOptional. Text prompt to guide the model's style or vocabulary (e.g., for specific jargon or names).N/A
openai_transcription_temperatureStringSampling temperature (0.0-1.0, provided as a string e.g., "0.2"). Lower values are more deterministic."0.0"
max_concurrent_tasksIntegerMaximum number of videos to process in parallel.5
max_retriesIntegerNumber of times to retry processing a video if an error occurs.3

Example Input JSON:

{
"video_urls":[
"https://www.ffmpeg.org/example-assets/Counting_Atoms_preview.mp4",
"https://another-public-domain.com/another-video.mp4"
],
"openai_api_key":"sk-yourSecretOpenAiApiKeyGoesHere",
"openai_model":"gpt-4o-mini-transcribe",
"openai_transcription_language":"en",
"openai_transcription_prompt":"Focus on scientific terminology.",
"openai_transcription_temperature":"0.2",
"max_concurrent_tasks":5,
"max_retries":3
}

Output

The actor saves each transcription result as a separate item in the Apify Dataset. Each item will have the following structure:

{
"download_url":"https://www.example.com/video.mp4",
"transcription":"This is the transcribed text from the video...",
"status":"succeeded"// or "failed"
}

If a video fails to process after all retries, the transcription will be null, status will be failed, and an error field will contain the error message.

How to Use

  1. Go to the Actor page on the Apify Store.
  2. Click on "Try actor".
  3. Fill in the input configuration fields, especially video_urls and your openai_api_key.
  4. Click "Start" to run the actor.
  5. When the run finishes, you can find the results in the "Dataset" tab of the run console.

Limitations

  • URL Accessibility: Video URLs must be publicly accessible and direct links to video files. Redirects are followed, but complex authentication or sites requiring browser interaction are not supported.
  • OpenAI API Limits: Your OpenAI API usage is subject to your OpenAI account's rate limits and quotas. Long videos or large batches might take time or hit these limits.
  • Video Size/Length & Memory:
    • The actor downloads each video into the computer's memory before it can extract the audio.
    • If you set max_concurrent_tasks to a certain number (e.g., 5), the actor might try to hold up to that many videos in memory at the same time.
    • Very large video files (e.g., several gigabytes) combined with a high number of concurrent tasks can lead to high memory consumption. This could potentially cause the actor to run out of memory and stop, especially on plans with limited resources.
    • Recommendation: If you are processing very large video files, consider starting with a lower max_concurrent_tasks value (e.g., 1 or 2) and monitor its performance.
  • CDN Link Stability: If using temporary CDN links (e.g., from some social media platforms), they may expire. Prefer stable, direct URLs.

Support & Issues

If you encounter any issues or have suggestions for improvement, please open an issue on the GitHub repository for this actor (if applicable, or provide another contact method).


Happy Transcribing!

You might also like

Video Transcriber Ultimate

marielise.dev/video-transcriber-ultimate

Transcribe videos from 1000+ platforms including Vimeo, Dailymotion, Twitch, Rumble, TED, and Bitchute. Powered by Whisper AI with 50+ language support. Get full text with timestamps and segments. No API keys needed. Perfect for content creators, researchers, and accessibility compliance.

Video Transcript

agentx/video-transcript

Universal video-to-text API across YouTube, TikTok, Instagram, X, Facebook, Vimeo and 1000+ platforms. Returns the full transcript as timestamped segments with the source video metadata, optionally translated into 100+ target languages โ€” one endpoint replacing per-platform transcription stacks.

Tiktok Video Transcirpt Using OpenAI Whisper API

linen_snack/tiktok-video-transcirpt-using-openai-whisper-api

This Apify actor uses the OpenAI Whisper API to either transcribe Tiktok video into its original language or translate it into English. It's built to be robust, automatically handling video-to-audio conversion and compression to stay within API limits.

All Video Scraper

agentx/all-video-scraper

Universal video downloader API for YouTube, TikTok, Instagram, X, Vimeo, Twitch and 1000+ other sources. Pass any video URL and choose 4K, 720p, 480p, or metadata-only โ€” the response includes view/like/share/comment counts, author, duration, tags, plus the canonical hosted file URL.

๐• (twitter) Info Scraper

agentx/twitter-info-scraper

Bulk X/Twitter profile lookup โ€” send up to 1,000 usernames, @handles, or x.com URLs per run. Every row returns the full profile: display name, bio, follower and following counts, tweet count, verified type (blue/business/government), join date, location, website, and pinned tweet.

๐ŸŽง YouTube Music Downloader ๐ŸŒ

scrapearchitect/youtube-music-downloader

๐ŸŽง Instantly grab MP3s from YouTube Music URLs! ๐ŸŽง Extract audio + metadata (title, artist, duration) ๐ŸŽต, HD thumbnails ๐Ÿ–ผ๏ธ, and direct links. Bypass geo-restrictions with proxies ๐ŸŒ Perfect for playlists, bots, or offline jams! ๐Ÿ“ฅ JSON-ready โœ‰๏ธ ๐Ÿš€ YouTube Music Downloader ๐ŸŽง

๐Ÿ‘ User avatar

Scrape Architect

51

Instagram AI Transcript Extractor

sian.agency/instagram-ai-transcript-extractor

Instagram Transcript Generator โ€” ๐ŸŽฌ AI Reel Transcription | ๐Ÿ—ฃ๏ธ Speaker Diarization | ๐ŸŒ Language Detection | ๐Ÿ“Š 30+ Metrics | ๐Ÿ’ฐ Best Price. Extract entire channels with word-perfect transcripts and speaker identification. Try 5 reels free!

๐Ÿ‘ User avatar

SIรN Oรœ

2.2K

4.0

Instagram Transcript Scraper

crawlerbros/instagram-transcript-scraper

Extract transcripts from Instagram videos and reels using auto-generated captions or AI-powered speech-to-text. Returns clean, timestamped transcript segments with full video metadata.

582

4.4

Video Transcript Scraper: Youtube, X, Facebook, Tiktok, etc.

invideoiq/video-transcript-scraper

Scrapes transcripts from online video/audio content on multiple plateforms (Youtube, X, ..) in any available language. It delivers outputs in both JSON and LLM-ready formats, making it ideal for analytics, and AI-based applications. Perfect for research and building intelligent conversational agents

1.5K

4.4

Instagram Video Transcript

truefetch/instagram-video-transcript

AI-transcribe any Instagram reel, story, or video โ€” timestamped captions, speaker diarization, and translation into 100+ languages from a single pasted link. $0.30 per video.

136

5.0