VOOZH about

URL: https://apify.com/automation-lab/youtube-transcript-enhanced

โ‡ฑ YouTube Transcript Enhanced โ€” SRT, VTT and Analytics ยท Apify


Pricing

Pay per event

Go to Apify Store

YouTube Transcript Enhanced

Extract YouTube transcripts with SRT/VTT subtitle export, paragraph chunking, keyword search, time range filtering, and text analytics. Works with any public video.

Pricing

Pay per event

Rating

1.0

(1)

Developer

๐Ÿ‘ Stas Persiianenko

Stas Persiianenko

Maintained by Community

Actor stats

0

Bookmarked

16

Total users

2

Monthly active users

2 months ago

Last modified

Categories

Share

Extract YouTube transcripts with SRT/VTT subtitle export, paragraph chunking, keyword search, time range filtering, and text analytics. Works with any public YouTube video.

What does YouTube Transcript Enhanced do?

YouTube Transcript Enhanced extracts transcripts from YouTube videos and adds powerful post-processing features. Beyond raw transcript segments, it provides ready-to-use subtitle files (SRT/VTT), intelligent paragraph grouping, keyword search across the transcript, time range filtering, and text analysis with word counts and keyword extraction.

It uses YouTube's public InnerTube API to access caption tracks โ€” the same API YouTube's own player uses. Both manual captions and auto-generated subtitles are supported across 100+ languages.

Why use YouTube Transcript Enhanced?

  • Multiple output formats โ€” Export as SRT subtitles, VTT subtitles, timestamped plain text, or JSON segments
  • Paragraph chunking โ€” Groups small caption segments into coherent paragraphs using pause detection
  • Keyword search โ€” Find specific content in transcripts with timestamp references
  • Time range filtering โ€” Extract only the portion of the transcript you need
  • Text analytics โ€” Word count, reading time, unique words, and top keyword extraction
  • Full metadata โ€” Video title, channel, views, duration, keywords, thumbnail, publish date
  • Batch processing โ€” Process multiple videos in a single run
  • Language selection โ€” Choose preferred language with smart fallback logic

Use cases

  • Content repurposing โ€” Convert video transcripts into blog posts, articles, or social media content
  • Subtitle generation โ€” Get SRT/VTT files for videos that lack proper subtitles
  • Research โ€” Search transcripts for specific topics or keywords across multiple videos
  • SEO analysis โ€” Extract keywords and topics from video content
  • Accessibility โ€” Generate formatted transcripts for hearing-impaired users
  • Education โ€” Extract and chunk lecture transcripts into study materials

Input parameters

ParameterTypeRequiredDefaultDescription
urlsstring[]Yesโ€”YouTube video URLs or 11-character video IDs
languagestringNoenISO 639-1 language code for preferred transcript
includeAutoGeneratedbooleanNotrueAllow auto-generated captions as fallback
outputFormatstringNojsonOutput format: json, srt, vtt, or text
chunkParagraphsbooleanNofalseGroup segments into paragraphs
searchKeywordsstringNoโ€”Comma-separated keywords to search in transcript
timeRangeStartintegerNoโ€”Start time in seconds for time range filter
timeRangeEndintegerNoโ€”End time in seconds for time range filter
includeTextAnalysisbooleanNotrueInclude word count, reading time, top keywords

Output example

{
"videoId":"jNQXAC9IVRw",
"videoUrl":"https://www.youtube.com/watch?v=jNQXAC9IVRw",
"videoTitle":"Me at the zoo",
"channelName":"jawed",
"language":"en",
"isAutoGenerated":true,
"segmentCount":5,
"fullText":"All right so here we are...",
"srt":"1\n00:00:01,000 --> 00:00:04,000\nAll right so here we are...",
"paragraphs":[
{
"text":"All right so here we are in front of the elephants...",
"startTime":1.0,
"endTime":12.5,
"segmentCount":3
}
],
"paragraphCount":2,
"textAnalysis":{
"wordCount":42,
"uniqueWordCount":30,
"characterCount":210,
"readingTimeMinutes":0.2,
"topKeywords":[
{"word":"elephants","count":2},
{"word":"cool","count":2}
]
},
"wordCount":42,
"readingTimeMinutes":0.2,
"enrichedAt":"2026-03-01T12:00:00.000Z"
}

Output formats explained

FormatDescriptionOutput field
jsonRaw transcript segments with start time and durationsegments array (always included)
srtSubRip subtitle format, ready to use with video playerssrt string
vttWebVTT subtitle format, compatible with HTML5 videovtt string
textPlain text with [MM:SS] timestamps per lineformattedText string

Enhancement features

Paragraph chunking

When chunkParagraphs is enabled, the actor groups small segments into paragraphs based on natural pause detection (gaps > 1.5 seconds between segments). Each paragraph includes start/end times and the merged text.

Keyword search

Set searchKeywords to a comma-separated list (e.g., "AI, machine learning, neural") to search the transcript. Returns matching segments with the keyword that triggered the match and the timestamp.

Time range filtering

Use timeRangeStart and timeRangeEnd (in seconds) to extract only a portion of the transcript. For example, timeRangeStart: 60, timeRangeEnd: 300 extracts minutes 1-5 only.

Text analysis

When includeTextAnalysis is enabled, the output includes word count, unique word count, character count, estimated reading time (~200 WPM), and the top 20 most frequent meaningful keywords (stop words excluded).

How to extract enhanced YouTube transcripts

  1. Open YouTube Transcript Enhanced on Apify.
  2. Enter one or more YouTube video URLs or video IDs in the urls field.
  3. Choose your preferred outputFormat (json, srt, vtt, or text).
  4. Optionally enable chunkParagraphs, set searchKeywords, or specify a time range.
  5. Click Start and wait for the extraction to finish.
  6. Download results as JSON, CSV, or Excel from the Dataset tab.

How much does it cost to extract YouTube transcripts?

YouTube Transcript Enhanced uses pay-per-event pricing:

EventPriceDescription
Actor start$0.035Charged once per run
Transcript enriched$0.005Charged per successfully enriched transcript

Example costs:

  • 1 video: $0.035 + $0.005 = $0.04
  • 10 videos: $0.035 + (10 x $0.005) = $0.085
  • 100 videos: $0.035 + (100 x $0.005) = $0.535

Using the Apify API

Node.js

import{ ApifyClient }from'apify-client';
const client =newApifyClient({token:'YOUR_API_TOKEN'});
const run =await client.actor('automation-lab/youtube-transcript-enhanced').call({
urls:['https://www.youtube.com/watch?v=dQw4w9WgXcQ'],
outputFormat:'srt',
chunkParagraphs:true,
includeTextAnalysis:true,
});
const{ items }=await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item)=>{
console.log(`${item.videoTitle} โ€” ${item.wordCount} words, ${item.readingTimeMinutes} min read`);
console.log(`Top keywords: ${item.textAnalysis.topKeywords.map(k=> k.word).join(', ')}`);
// Save SRT file
if(item.srt) fs.writeFileSync(`${item.videoId}.srt`, item.srt);
});

Python

from apify_client import ApifyClient
client = ApifyClient("YOUR_API_TOKEN")
run = client.actor("automation-lab/youtube-transcript-enhanced").call(run_input={
"urls":["https://www.youtube.com/watch?v=dQw4w9WgXcQ"],
"outputFormat":"srt",
"chunkParagraphs":True,
"includeTextAnalysis":True,
})
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
print(f"{item['videoTitle']} โ€” {item['wordCount']} words")
# Save SRT file
if item.get("srt"):
withopen(f"{item['videoId']}.srt","w")as f:
f.write(item["srt"])

Integrations

  • Google Sheets โ€” Export transcript data and analytics to spreadsheets
  • Webhooks โ€” Get notified when transcript extraction completes
  • Zapier / Make โ€” Automate workflows triggered by new transcripts
  • Other Apify actors โ€” Chain with scrapers that collect YouTube URLs

cURL:

curl-X POST "https://api.apify.com/v2/acts/automation-lab~youtube-transcript-enhanced/runs?token=YOUR_API_TOKEN"\
-H"Content-Type: application/json"\
-d'{"urls":["https://www.youtube.com/watch?v=dQw4w9WgXcQ"],"outputFormat":"srt","chunkParagraphs":true,"includeTextAnalysis":true}'

Use with AI agents via MCP

YouTube Transcript Enhanced is available as a tool for AI assistants via the Model Context Protocol (MCP).

Setup for Claude Code

$claude mcp add--transport http apify "https://mcp.apify.com?tools=automation-lab/youtube-transcript-enhanced"

Setup for Claude Desktop, Cursor, or VS Code

{
"mcpServers":{
"apify":{
"url":"https://mcp.apify.com?tools=automation-lab/youtube-transcript-enhanced"
}
}
}

Example prompts

  • "Get an enriched transcript with timestamps for this video"
  • "Extract and summarize this YouTube video's content"
  • "Download SRT subtitles for this lecture and find all mentions of 'neural network'"

Learn more in the Apify MCP documentation.

FAQ

What's the difference between this and the basic YouTube Transcript Scraper? YouTube Transcript Enhanced adds SRT/VTT subtitle export, paragraph chunking, keyword search, time range filtering, and text analytics (word count, reading time, top keywords). Use the basic scraper if you just need raw transcript segments; use Enhanced if you need formatted output or post-processing features.

Keyword search returns no results even though the word is in the video. Why? The keyword search matches against the transcript text, not audio. If the video's captions are auto-generated, words may be misspelled or split differently. Try searching for partial keywords or common variations. Also check that the transcript language matches โ€” if captions are in Spanish but you're searching for English words, there won't be matches.

Tips and best practices

  • Use outputFormat: "srt" or "vtt" when you need subtitle files for video editing
  • Enable chunkParagraphs for content repurposing โ€” paragraphs are easier to read than raw segments
  • Use searchKeywords to quickly find relevant sections in long videos
  • Time range filtering is useful for extracting specific sections from lectures or interviews
  • The fullText field is always included and ready for further text processing
  • For batch processing, all videos share the same settings โ€” use separate runs for different configurations

Compliance

This actor uses YouTube's public InnerTube API to access caption tracks โ€” the same API used by YouTube's own video player. It accesses only publicly available video metadata and captions. No login credentials or private data are used.

Who is it for

YouTube Transcript Enhanced is built for content creators repurposing video content into articles, researchers analyzing video transcripts at scale, developers building subtitle tools, and educators extracting lecture content for study materials.

Pricing

This actor uses pay-per-event pricing. See the Pricing tab on Apify Store for current rates.

API usage

You can run this actor programmatically using the Apify API or the Apify client libraries for Node.js and Python. See the code examples above for usage details.

MCP

This actor is compatible with Model Context Protocol (MCP). Use it with AI assistants via the Apify MCP server.

Legality

Scraping publicly available data is generally legal. This actor only accesses publicly available pages. Users are responsible for compliance with applicable laws and the target site's Terms of Service.

Related

You might also like

YouTube Subtitle Extractor

entertained_rattlesnake/youtube-subtitle-extractor

Extract subtitles and transcripts from YouTube videos and export them as JSON, TXT, SRT and VTT.

๐Ÿ‘ User avatar

Entertained Rattlesnake

2

YouTube To Transcript

hexa-api/youtube-to-transcript

Extract YouTube transcripts from public video URLs

YouTube Transcript API - AI Training Data

app.tanalytics/youtube-transcript-api---ai-training-data

Extract YouTube video transcripts optimized for AI and machine learning workflows. Features chunking for LLM context limits, SRT/VTT formats, and music symbol removal. Perfect for building training datasets, content analysis, and subtitle generation.

3

YouTube Transcript & Subtitle Scraper

abotapi/youtube-transcript-scraper

Extract transcripts and subtitles from YouTube videos in bulk using video, playlist, channel URLs, or keyword search. Returns timed transcript segments, plain text, SRT, and WebVTT subtitle files, with optional auto-translation to other languages.

YouTube Transcript Scraper

taroyamada/youtube-transcript-bulk-api

Extract YouTube captions, timestamps, SRT, VTT, and plain text from public videos in bulk without browser automation.

Youtube Transcript

canadesk/youtube-transcript

Extract transcripts (with timestamps) from YouTube videos.

๐Ÿ‘ User avatar

Canadesk Support

3

YouTube Transcript & Subtitle Scraper

dataharvest/youtube-transcript-scraper

Extract transcripts and subtitles from YouTube videos, channels or search results. Plain text, timestamped or SRT format. Optional translation.

Youtube Transcript Scraper

scraperx/youtube-transcript-scraper

๐ŸŽฌ YouTube Transcript Scraper (youtube-transcript-scraper) extracts transcripts, subtitles & captions (auto/manual) with timestamps from videos, channels & playlists. ๐Ÿ“ฆ Bulk scrape. ๐Ÿ“„ Export SRT, VTT, CSV, JSON. ๐ŸŒ Multilingual. ๐Ÿš€ Perfect for SEO, content repurposing, research & accessibility.

YouTube Transcript API - AI Training Data (Batch)

app.tanalytics/youtube-transcript-batch

Batch extract YouTube transcripts at scale. Process thousands of videos in parallel with AI-optimized output. Smart chunking, token estimation, SRT/VTT export. $10 per 1K.

2