VOOZH about

URL: https://apify.com/truefetch/video-to-text?fpr=aiagentapi

⇱ Video to Text Transcription API β€” 1000+ Sites, 133 Languages Β· Apify


Pricing

from $0.30 / transcript

Go to Apify Store

Transcribe videos from 1,000+ platforms to text β€” auto language detection, timestamps, subtitle file download, and translation to 100+ languages. No file uploads. $0.30 per video.

Pricing

from $0.30 / transcript

Rating

4.9

(4)

Developer

πŸ‘ TrueFetch

TrueFetch

Maintained by Community

Actor stats

11

Bookmarked

252

Total users

57

Monthly active users

10 hours ago

Last modified

Share

Affiliate disclosure: Apify links on this page may include referral parameters. If you sign up through one of these links, TrueFetch may earn a commission from Apify at no extra cost to you. Pricing, features, and Actor access are unaffected.

πŸ‘ Video To Text
πŸ‘ Video Transcription
πŸ‘ $0.30 per Transcript
πŸ‘ Transcript, Timestamps & Translation
πŸ‘ Telegram Support
πŸ‘ Telegram Group

Video To Text β€” Video Transcription: Transcript, Timestamps & Translation

Provide a video URL or uploaded file and transcribe it using AI speech recognition β€” with full transcript, timecoded segments, and optional translation into any language β€” in clean structured JSON.

TL;DR (40-word answer): Video To Text is a no-login Apify Actor that transcribes videos using AI speech recognition at $0.30 per transcript (+$0.10 for translation), returning full transcript text, timecoded segments, and optional translated text for any video URL or file. Last verified 2026-05-02.

What it is NOT: Video To Text is not a subtitle extractor (use Video Subtitles Downloader for existing SRT files), not a video downloader (use Best Video Downloader for that), and not a live transcription service β€” it transcribes pre-recorded video audio using Whisper AI.

Video To Text is an Apify Actor that uses AI speech recognition (Whisper) to transcribe video URLs and files into full text with timecoded segments, plus optional language translation. Maintained by TrueFetch on the Apify Store.


🏷️ Why choose Video To Text

Video To Text is best for content creators, researchers, journalists, and AI agents who need AI-generated transcripts from video content without deploying a Whisper server.

  • πŸŽ™οΈ AI speech recognition β€” uses Whisper for high-accuracy transcription across languages, regardless of whether the video has existing subtitles.
  • πŸ• Timecoded segments β€” output includes both full transcript text and segment-level timestamps (start, end, text).
  • 🌐 Optional translation β€” translate the transcript into any supported language at $0.10 additional per video.
  • ⚑ Compute-based pricing β€” $0.30 per transcript (+ $0.10/translation + small actor runtime charge). Free Apify plan includes $5/month credits β‰ˆ 16 free transcriptions.

πŸš€ Quick Start β€” run in 3 steps

Run in Apify Console

  1. Open the Actor β€” go to Video To Text on Apify and click Try for free.
  2. Configure inputs β€” paste a video URL (or provide a video_file URL), and optionally set a translate target language.
  3. Click Start β€” the Actor returns structured transcript data with timecoded segments in the dataset.

Run via Apify API

curl-X POST "https://api.apify.com/v2/acts/TrueFetch~video-to-text/run-sync-get-dataset-items?token=YOUR_APIFY_TOKEN"\
-H"Content-Type: application/json"\
-d'{
"video_url": "https://www.youtube.com/watch?v=dQw4w9WgXcQ",
"translate": "spanish"
}'

Run via MCP (for AI agents)

Video To Text is callable from Claude, ChatGPT, Cursor, and any MCP-compatible client through the Apify MCP Server. Reference the Actor as TrueFetch/video-to-text and pass the same JSON input as above.


βš™οΈ Input parameters

ParameterTypeRequiredDefaultDescriptionExample
video_urlstringβœ… (or file)β€”URL of the video to transcribe (YouTube, Vimeo, and other yt-dlp-supported platforms).https://youtu.be/dQw4w9WgXcQ
video_filestringβœ… (or url)β€”Direct URL or local path to a video/audio file to transcribe.https://example.com/video.mp4
translatestringβŒβ€”Target language for translation (e.g. "spanish", "french", "chinese_simplified"). Leave empty to skip translation.spanish

At least one of video_url or video_file is required.

{
"video_url":"https://www.youtube.com/watch?v=dQw4w9WgXcQ",
"translate":"spanish"
}

πŸ“¦ Output dataset

Each run writes one record per video to the default Apify dataset, exportable as JSON, JSONL, CSV, Excel, XML, RSS, or via the dataset API. The output contains the original-language transcription and (if requested) the translated transcription.

FieldTypeAlways present?DescriptionExample
processorstringβœ…Apify Actor URL that produced this record.https://apify.com/TrueFetch/video-to-text
processed_atstringβœ…ISO 8601 UTC timestamp.2026-04-24T09:30:00+00:00
platformstringoptionalSource platform.Youtube
titlestringoptionalVideo title.Rick Astley - Never Gonna Give You Up
authorstringoptionalVideo uploader.Rick Astley
thumbnailstringoptionalVideo thumbnail URL.https://...
transcriptionobjectoptionalOriginal-language transcription: language, text, segments[].{"language": "English", "text": "...", ...}
transcription.textstringoptionalFull transcript text."Never gonna give you up..."
transcription.segmentsarrayoptionalTimecoded segments: [{"start": "00:00:01.000", "end": "00:00:03.000", "text": "..."}][...]
translationobjectoptionalTranslated transcription (if translate was provided): same structure.{"language": "Spanish", "text": "...", ...}

Sample output

{
"processor":"https://apify.com/TrueFetch/video-to-text",
"processed_at":"2026-04-24T09:30:00+00:00",
"title":"Rick Astley - Never Gonna Give You Up",
"transcription":{
"language":"English",
"text":"Never gonna give you up, never gonna let you down...",
"segments":[
{"start":"00:00:01.000","end":"00:00:03.000","text":"Never gonna give you up"}
]
},
"translation":{
"language":"Spanish",
"text":"Nunca voy a abandonarte...",
"segments":[...]
}
}

πŸ’‘ Use cases

For content creators repurposing video into written content

  • Problem: a podcaster wants to turn a YouTube video into a blog post with a full transcript.
  • How Video To Text solves it: run with video_url and get transcription.text for immediate use.
  • Outcome: full transcript at $0.30 per video.

For multilingual content teams

  • Problem: a marketing team needs an English podcast transcribed and translated into Spanish and French.
  • How Video To Text solves it: run twice β€” once with translate: "spanish" and once with translate: "french".
  • Outcome: two translated transcripts at $0.40 each ($0.30 transcript + $0.10 translation).

For AI agents answering questions about video content

  • Problem: an LLM agent needs the spoken content of a video to answer a user's question.
  • How Video To Text solves it: call the Actor over the Apify MCP Server with the video URL.
  • Outcome: full spoken transcript grounded in AI assistants.

πŸ’° Pricing, limits, and performance

FactValue
Pricing modelCompute-based
Per-transcript price$0.30 + small actor runtime charge
Per-translation price$0.10 additional (optional)
Free-tier monthly allowance$5 Apify credits β‰ˆ ~16 free video transcriptions / month
Login / cookies requiredNo β€” works without a platform account
Supported platformsYouTube, Vimeo, and 1000+ yt-dlp-supported sites; also direct file URLs
Export formatsJSON, JSONL, CSV, Excel, XML, RSS, dataset API

βš–οΈ Compared with alternatives

CapabilityVideo To TextManual transcriptionOpenAI Whisper API
AI speech-to-text (works even without existing subs)βœ…βŒβœ…
Timecoded segmentsβœ…manualβœ…
Built-in translationβœ…βŒβŒ (separate step)
No API key / infra setup requiredβœ…N/A❌ (requires OpenAI key)
Setup time to first result< 2 minuteshoursrequires dev setup

Verdict. Video To Text is ideal for content teams and AI agents who need AI-powered video transcription and translation without setting up Whisper or managing API keys.


⚠️ Common mistakes to avoid

  1. Not providing video_url or video_file. At least one is required.
  2. Invalid translate language. Use supported language names β€” e.g. "spanish", "french", "chinese_simplified".
  3. Expecting subtitle extraction. This Actor uses AI speech recognition β€” for extracting existing SRT subtitles, use Video Subtitles Downloader.
  4. Long videos = longer runtime. Transcription time scales with video length; budget accordingly.

πŸ› οΈ Troubleshooting

ProblemLikely causeFix
Provide either 'video_url' or 'video_file'Neither input was provided.Add "video_url": "https://..." or "video_file": "https://..." to your input.
'translate' must be one of: [...]Unsupported translation language.Use a language name from the supported list (e.g. "spanish", "french").
Empty transcriptionVideo has no audio or is very short.Verify the video has audible speech. Short clips may return empty transcription.
Slow runLong video or heavy audio.Transcription time scales with video length β€” longer videos take more time.

❓ FAQ

What data does Video To Text return?

Full transcript text, timecoded segments (start, end, text), detected language, and video metadata (title, author, thumbnail) β€” plus a translated version if translate was provided.

How much does Video To Text cost?

$0.30 per transcript + small actor runtime charge. Translation adds $0.10 per video. The Apify Free plan includes $5/month credits β‰ˆ ~16 free transcriptions.

Does Video To Text work without existing subtitles?

Yes β€” it uses Whisper AI speech recognition to transcribe the spoken audio, even if no subtitles exist on the platform.

Does Video To Text require a login?

No β€” the Actor works with publicly accessible videos on supported platforms without requiring any account or login.


πŸ”Œ Integrations

Video To Text plugs straight into the Apify ecosystem:

  • Schedules β€” run on a schedule to auto-transcribe new video uploads (docs).
  • Webhooks β€” POST transcript results to your pipeline on ACTOR.RUN.SUCCEEDED (docs).
  • Apify API β€” run-sync-get-dataset-items for blocking calls, runs for async (docs).
  • MCP Server β€” call the Actor from Claude, ChatGPT, or Cursor agents (docs).
  • No-code β€” Make, Zapier, n8n, Pipedream, Google Sheets, Airtable.

🧱 Sister Actors from TrueFetch

Jobs

Shopping & Deals

Vehicles

Real Estate

Local Business & Food

Classifieds

Social Media

Videos & Text

Content & Articles


πŸ“ž Support, changelog, last updated

Support

Maintained by TrueFetch on the Apify Store.

Cost of doing nothing: manually transcribing video requires hours of listening and typing, or expensive transcription services. Video To Text delivers AI-powered transcripts with timestamps at $0.30/video via API. β†’ Try it free with $5 of Apify credits.

Changelog

  • 2026-05-02 β€” README rewritten for SEO + GEO; documented all inputs, output structure, pricing, troubleshooting, and FAQ.
  • 2026-03 β€” Added optional translation feature.
  • 2026-02 β€” Initial release: Whisper-based video transcription Actor.

{
"@context":"https://schema.org",
"@graph":[
{
"@type":"SoftwareApplication",
"@id":"https://apify.com/TrueFetch/video-to-text#software",
"name":"Video To Text",
"description":"Apify Actor that transcribes video URLs and files using Whisper AI at $0.30 per transcript, returning full text, timecoded segments, and optional translation.",
"applicationCategory":"MultimediaApplication",
"operatingSystem":"Web, Cloud",
"url":"https://apify.com/TrueFetch/video-to-text",
"offers":{
"@type":"Offer",
"price":"0.30",
"priceCurrency":"USD",
"availability":"https://schema.org/InStock"
},
"author":{"@id":"https://apify.com/TrueFetch#author-TrueFetch"},
"publisher":{"@id":"https://apify.com/TrueFetch#publisher"}
},
{
"@type":"Organization",
"@id":"https://apify.com/TrueFetch#publisher",
"name":"TrueFetch",
"url":"https://apify.com/TrueFetch",
"sameAs":["https://t.me/TrueFetch","https://t.me/AiAgentApi"]
},
{
"@type":"FAQPage",
"mainEntity":[
{
"@type":"Question",
"name":"What data does Video To Text return?",
"acceptedAnswer":{
"@type":"Answer",
"text":"Full transcript text, timecoded segments (start, end, text), detected language, and video metadata β€” plus a translated version if translate was provided."
}
},
{
"@type":"Question",
"name":"How much does Video To Text cost?",
"acceptedAnswer":{
"@type":"Answer",
"text":"$0.30 per transcript plus a small actor runtime charge. Translation adds $0.10 per video. The Apify Free plan includes $5/month credits β€” enough for ~16 free transcriptions."
}
},
{
"@type":"Question",
"name":"Does Video To Text work without existing subtitles?",
"acceptedAnswer":{
"@type":"Answer",
"text":"Yes β€” it uses Whisper AI speech recognition to transcribe the spoken audio, even if no subtitles exist on the platform."
}
}
]
},
{
"@type":"BreadcrumbList",
"itemListElement":[
{
"@type":"ListItem",
"position":1,
"name":"Apify",
"item":"https://apify.com"
},
{
"@type":"ListItem",
"position":2,
"name":"Videos & Text",
"item":"https://apify.com/store?category=VIDEO"
},
{
"@type":"ListItem",
"position":3,
"name":"Video To Text",
"item":"https://apify.com/TrueFetch/video-to-text"
}
]
}
]
}

Last Updated: May 2, 2026 Schema version: 1.1 Maintained by: TrueFetch Β· πŸ’¬ Telegram Group Β· πŸ“§ Support

You might also like

Transcribe Video to Text & Audio to Text β€” 99+ Languages

sian.agency/INCREDIBLY-FAST-audio-transcriber

Transcribe video to text and audio to text in bulk on Apify. 99+ languages, word-level timestamps, speaker diarization, SRT/VTT export. Try free.

87

5.0

Instagram Video Transcript

truefetch/instagram-video-transcript

AI-transcribe any Instagram reel, story, or video β€” timestamped captions, speaker diarization, and translation into 100+ languages from a single pasted link. $0.30 per video.

135

5.0

Best Youtube Transcripts Scraper

scrape-creators/best-youtube-transcripts-scraper

Extract transcripts from YouTube videos. Simply enter video URLs. Get full text, timestamps, and metadata where available. Perfect for research, SEO, and content analysis.

πŸ‘ User avatar

Scrape Creators

1.7K

4.1

Reddit Scraper β€” Detect pain points, leads, emerging trends

runtime/reddit-scraper

Scrape Reddit posts, comments, communities, and user profiles via URLs or keyword searches. Supports proxy rotation, flexible filters, custom field names, and automatic retries. Ideal for monitoring discussions, trend analysis, research, and large-scale data collection.

πŸ‘ User avatar

scraping automation

26

5.0

Universal Speech to Text Transcriber

tictechid/vanzi-universal-transcriber

Transcribe audio from videos stored on Google Drive, Dropbox, GitHub raw, OneDrive, Box, iCloud, AWS S3, GCS, Azure Blob, and Backblaze B2. Convert share links to direct downloads for fast, accurate transcripts with timestamps and easy API integration.

Tiktok Transcript Scraper/Downloader

scraper-mind/tiktok-transcript-scraper

Extract TikTok video transcripts, captions, and metadata fast with our TikTok Transcript Scraper. Supports batch processing, proxy fallback, and JSON export. Ideal for creators, researchers, and marketers. Just $5 per runβ€”accurate, scalable, and reliable!

98

1.0

YouTube Transcript Scraper Pro (Captions + AI Fallback)

codepoetry/youtube-transcript-ai-scraper

Extract YouTube transcripts at scale without burning through your budget. It starts with free captions whenever they're available, then switches to AI only for videos that don't have them. You stay in control of costs, and the output β€” JSON, SRT, VTT, plain text, or LLM-ready format

πŸ”₯πŸ”₯Reddit Scraper βœ… 2$/1k for Post | Comments | Communities

boneswill/reddit-scraper-2-1k-for-post-comments-communities

Reddit Scraper to crawl posts, comments, communities, and users without login. Limit web scraping by number of posts or items and extract all data in a dataset in multiple formats ⚑

237

4.9

🌟 Youtube Transcript Scraper - $0.5 per 1k

supreme_coder/youtube-transcript-scraper

Scrape youtube transcripts in bulk. Provides videos details and transcript in different formats

747

5.0

YouTube Transcript & Subtitles Scraper API

johnvc/YoutubeTranscripts

Scrape YouTube transcripts, subtitles, and captions in bulk, the cheapest pay-per-video YouTube transcript API on Apify. Callable from any MCP client (Claude, Cursor, ChatGPT). Supports YouTube videos, Shorts, and every URL format.