YouTube Transcript API | Video to Text Scraper for AI
Pricing
from $1.00 / 1,000 transcripts
YouTube Transcript API | Video to Text Scraper for AI
Extract full transcripts and time-coded captions from any YouTube video. Build custom AI datasets, train LLMs, or repurpose video content.
Pricing
from $1.00 / 1,000 transcripts
Rating
0.0
(0)
Developer
Actor stats
1
Bookmarked
25
Total users
3
Monthly active users
3 months ago
Last modified
Categories
Share
YouTube Transcript Scraper for AI & RAG
Turn any YouTube video into clean, structured text ready for LLM context windows, RAG pipelines, or full-text search. Extracts captions directly from YouTube's internal player data β no browser, no API key, no rate-limit headaches. Process hundreds of videos in parallel with configurable concurrency.
Features
- Full transcript extraction β pulls the complete caption track from any video with available subtitles
- Timed segments β returns individual segments with offset and duration for precise referencing
- Bulk processing β extract transcripts from hundreds of videos in a single run
- Parallel execution β configurable concurrency to balance speed and reliability
- AI-ready output β concatenated plain text field perfect for LLM ingestion
- No API key required β works directly with YouTube's public caption data
Input
| Field | Type | Required | Default | Description |
|---|---|---|---|---|
urls | array | Yes | β | YouTube video URLs to extract transcripts from |
concurrency | integer | No | 10 | Number of videos to process in parallel. Increase for large batches, decrease if you see errors. |
Input Example
{"urls":["https://www.youtube.com/watch?v=dQw4w9WgXcQ","https://www.youtube.com/watch?v=jNQXAC9IVRw"],"concurrency":10}
Output
Each video produces one dataset row containing the full transcript as a single text block, plus an array of timed segments for granular access.
Key output fields:
| Field | Type | Description |
|---|---|---|
inputUrl | string | The video URL you provided |
videoId | string | YouTube video ID |
fullText | string | Complete transcript as a single text block |
segments | array | Individual caption segments with timing data |
segments[].text | string | Text content of the segment |
segments[].offset | number | Start time in milliseconds |
segments[].duration | number | Segment duration in milliseconds |
Output Example
{"inputUrl":"https://www.youtube.com/watch?v=jNQXAC9IVRw","videoId":"jNQXAC9IVRw","fullText":"Welcome to the presentation. Today we will cover three key topics in machine learning. First, let us look at supervised learning and how it differs from unsupervised approaches...","segments":[{"text":"Welcome to the presentation.","offset":0,"duration":2400},{"text":"Today we will cover three key topics","offset":2400,"duration":3100},{"text":"in machine learning.","offset":5500,"duration":1800}]}
Pricing
Pay per event on Apify platform.
| Event | Description |
|---|---|
| Transcript | One video transcript extracted |
Use Cases
- RAG pipelines β feed video transcripts into vector databases for retrieval-augmented generation
- Content repurposing β convert video content into blog posts, summaries, or social media threads
- Research analysis β make lecture and conference talk content searchable and quotable
- SEO audits β analyze what competitors say in their videos without watching them
- Training data β build domain-specific datasets from educational YouTube content
Related Actors
| Actor | What it adds |
|---|---|
| YouTube Transcript MCP Server | AI-powered transcription via MCP β works even on videos without captions |
| YouTube Video Metadata Extractor | Get views, likes, tags, and publish dates alongside your transcripts |
| Markdown Extractor | Convert web pages to clean markdown β complement video transcripts with written sources |
