VOOZH about

URL: https://apify.com/crawlerbros/khanacademy-scraper

โ‡ฑ Khan Academy Scraper ยท Apify


Pricing

from $3.00 / 1,000 results

Go to Apify Store

Khan Academy Scraper

Scrape Khan Academy with free CC-licensed educational content. Search by keyword, fetch by path/URL/subject, list all courses, look up videos by YouTube ID. Returns videos with download URLs, articles, exercises, courses and units.

Pricing

from $3.00 / 1,000 results

Rating

0.0

(0)

Developer

๐Ÿ‘ Crawler Bros

Crawler Bros

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

a month ago

Last modified

Share

Scrape free, CC-licensed educational content from Khan Academy. The scraper exposes search, the full topic browser, by-subject course listings, direct path/URL lookups for videos / articles / exercises / courses, and a YouTube-ID lookup for any video Khan Academy publishes.

Khan Academy is a freely accessible educational platform โ€” no login or API key required. The scraper uses Khan Academy's own public GraphQL endpoints (the same ones the website calls) and returns clean structured records with rich video metadata (download URLs, subtitles, durations) for downstream use in research, content discovery, dataset building, or curriculum tools.

Features

  • Search โ€” full-text search across videos, articles, exercises and topics
  • Topic browser โ€” list every course Khan Academy offers, grouped by category
  • By subject โ€” list all courses under a major root subject (Math, Science, Computing, Test Prep, etc.)
  • By path / URL โ€” fetch a single course, video, article or exercise directly
  • By YouTube video ID โ€” look up any Khan Academy-published video by its YouTube ID
  • Optional unit expansion โ€” emit a record per course-unit alongside the course record
  • Filtering โ€” restrict by subject, content kind, duration window, or keyword in title/description

Use cases

  • Educators: build curriculum playlists, find aligned exercises and articles for a topic
  • Researchers: assemble open educational resource (OER) datasets, all CC-licensed
  • Content creators: discover Khan Academy videos for a topic; pull download URLs and metadata
  • EdTech: map a topic graph, mirror lesson content into your platform
  • Translators / accessibility tools: detect subtitle availability per language

Input

The actor accepts a single input object. Required field: mode.

FieldTypeDescription
modeenumOne of search, byPaths, byUrls, bySubject, topicBrowser, byVideoIds
searchQuerystringFree-text query (mode=search)
subjectenumRoot subject for mode=bySubject โ€” math, science, computing, humanities, test-prep, ela, economics-finance-domain, partner-content, college-careers-more, khan-for-educators, ai-activities
subjectsenum[]Restrict mode=search or mode=topicBrowser to these subjects
contentKindsenum[]Restrict mode=search to specific content kinds: Video, Article, Exercise, Topic
pathsstring[]Khan Academy content paths (e.g. math/algebra)
urlsstring[]Full khanacademy.org URLs
videoIdsstring[]YouTube IDs to resolve back into KA videos
containsKeywordstringDrop records whose title/description/keywords do not contain this string (case-insensitive)
minDurationSecondsintegerDrop videos shorter than this
maxDurationSecondsintegerDrop videos longer than this
includeUnitsbooleanFor course paths/URLs: emit one record per unit in addition to the course record
maxItemsintegerHard cap on emitted records (default 50, max 5000)

Example: list all courses under Math

{
"mode":"bySubject",
"subject":"math",
"maxItems":50
}

Example: search videos about photosynthesis

{
"mode":"search",
"searchQuery":"photosynthesis",
"contentKinds":["Video"],
"maxItems":25
}

Example: fetch a specific course + its units

{
"mode":"byUrls",
"urls":["https://www.khanacademy.org/math/algebra"],
"includeUnits":true
}

Example: full topic-browser tree

{
"mode":"topicBrowser",
"maxItems":500
}

Output

Each record is pushed to the default dataset. Fields are emitted only when populated (no nulls). Common fields across record types:

  • id โ€” Khan Academy content ID (e.g. 19647488, x2f8bb11595b61c86)
  • slug โ€” URL-safe slug
  • kind โ€” One of Video, Article, Exercise, Topic, Course, Unit, Project
  • title โ€” Translated title
  • description โ€” HTML-stripped description
  • url โ€” Direct Khan Academy URL
  • subject โ€” Primary root subject slug
  • recordType โ€” Stable record-type label (video, article, exercise, course, unit, etc.)
  • scrapedAt โ€” UTC ISO timestamp

Video-specific:

  • youtubeId + youtubeUrl
  • durationSeconds
  • thumbnailUrl
  • downloadUrls โ€” {m3u8, mp4, mp4-low, mp4-low-ios, png} direct CDN URLs
  • subtitleLanguages โ€” language codes with translated subtitles
  • authorNames, keywords, dateAdded, license, language, educationalLevel

Course-specific:

  • unitCount, lessonCount, masteryEnabled, curriculumKey, iconUrl

Sample video record

{
"id":"19647488",
"slug":"negative-numbers-introduction",
"kind":"Video",
"title":"Intro to negative numbers",
"description":"Mysterious negative numbers! What ARE they? ...",
"youtubeId":"Hlal9ME2Aig",
"youtubeUrl":"https://www.youtube.com/watch?v=Hlal9ME2Aig",
"durationSeconds":576,
"thumbnailUrl":"https://cdn.kastatic.org/googleusercontent/...",
"authorNames":["Sal Khan"],
"downloadUrls":{
"m3u8":"https://cdn.kastatic.org/ka-youtube-converted/Hlal9ME2Aig.m3u8/Hlal9ME2Aig.m3u8",
"mp4":"https://cdn.kastatic.org/ka-youtube-converted/Hlal9ME2Aig.mp4/Hlal9ME2Aig.mp4"
},
"dateAdded":"2011-02-20T16:51:16Z",
"language":"en",
"license":"cc-by-nc-sa",
"url":"https://www.khanacademy.org/math/arithmetic-home/negative-numbers/neg-num-intro/v/negative-numbers-introduction",
"subject":"math",
"recordType":"video",
"scrapedAt":"2026-05-21T09:17:47Z"
}

Data source

Khan Academy is a non-profit educational organization that provides free, CC-licensed (CC-BY-NC-SA) lessons in math, science, the arts, computer programming, economics, and more.

This scraper hits Khan Academy's public GraphQL endpoints โ€” the same ones used by their website. No login, cookies or API key are required, and the actor runs on the free Apify plan without any paid proxy add-ons.

FAQ

Does this require login or an API key? No. Khan Academy's content is freely accessible. The scraper uses public endpoints with no authentication.

Do I need to provide a proxy? No. The scraper works from datacenter IPs out-of-the-box.

What is the license of the returned data? Khan Academy content is licensed under CC-BY-NC-SA 3.0 unless otherwise noted. You can reuse it for non-commercial purposes with attribution. Verify the license field on each record.

Can I get transcripts? Khan Academy stores subtitle files per language; the actor exposes the available subtitle languages in subtitleLanguages. To fetch the actual transcript bodies, follow up with the YouTube ID against the YouTube transcript API or KA's subtitles endpoint.

Why are some titles different from what I see on the site? Khan Academy localizes content per visitor region. The actor uses the en/US locale by default.

What if Khan Academy updates their GraphQL schema? The cacheable persisted queries are tied to a publish version (pcv) that the actor fetches dynamically. Search queries are sent as fully inlined GraphQL strings so they survive schema bumps that don't change field shape.

Can I run this on the free Apify plan? Yes. No proxy, no add-ons, no user-supplied credentials needed.

You might also like

Commercial Law Academy Guidance Scraper

yourapiservice/commerciallaw-guidance-scraper

Commercial Law Academy Guidance Scraper (commerciallaw.academy) lets you extract guidance content in HTML, JSON, and plaintext. Get authors, create/update date, images, read time, RSS, titles, SEO titles, featured images & videos, and keywords easily for content analysis and aggregation.

๐Ÿ‘ User avatar

Your API Service

1

Agile Success Academy Blog Scraper

yourapiservice/agilesuccess-blog-scraper

Agile Success Academy Blog Scraper (agilesuccess.academy) lets you extract blog content in HTML, JSON, and plaintext. Get authors, create/update date, images, read time, RSS, titles, SEO titles, featured images & videos, and keywords easily for content analysis and aggregation.

๐Ÿ‘ User avatar

Your API Service

3

Academy Sports + Outdoors Scraper

parseforge/academy-sports-outdoors-scraper

Scrape Academy Sports + Outdoors product data at scale. Extract prices, ratings, reviews, stock availability, specifications, UPC codes, brand info, and free shipping details from any category or search page. Export clean, structured data as JSON, CSV, or Excel โ€” ready for analysis.

Sound Medicine Academy Blog Scraper

yourapiservice/soundmedicineacademy-blog-scraper

Sound Medicine Academy Blog Scraper (soundmedicineacademy.com) lets you extract blog content in HTML, JSON, and plaintext. Get authors, create/update date, images, read time, RSS, titles, SEO titles, featured images & videos, and keywords easily for content analysis and aggregation.

๐Ÿ‘ User avatar

Your API Service

3

Maven Courses & Instructors Scraper

crawlerbros/maven-courses-scraper

Scrape Maven (maven.com) cohort-based courses and instructor profiles. Search by keyword, browse by category, get full course details or instructor profiles with all courses.

Platzi Courses Listing By Keyword

datacach/platzi-courses-listing-by-keyword

An Apify Actor that scrapes and lists Platzi courses by keyword, providing details like title, URL, and enrollment count. Ideal for market research and content analysis.

YouTube Downloader Tool โ€“ Extract and Download YouTube Videos

sovanza.inc/youtube-downloader-tool-extract-and-download-youtube-videos

Download YouTube videos instantly by providing video URLs. Get high-quality MP4 download links and automate your workflows with ease. Start downloading YouTube videos now.

Youtube Keyword Videos Urls Scraper

sasky/youtube-keyword-videos-urls-scraper

Scrapes Youtube videos URLs related to a given keyword

Abdulrahman

43

3.0

Leading Edge Dog Show Academy Answer Scraper

yourapiservice/leadingedgedogshowacademy-answer-scraper

Leading Edge Dog Show Academy Answer Scraper (leadingedgedogshowacademy.com) lets you extract answer content in HTML, JSON, and plaintext. Get authors, create/update date, images, read time, RSS, titles, SEO titles, featured images & videos, and keywords easily for content analysis and aggregation.

๐Ÿ‘ User avatar

Your API Service

3

YouTube Scraper by Keyword Search (Videos & Shorts)

lurkapi/youtube-keyword-search-scraper

Search YouTube by keyword and scrape videos and Shorts. Get titles, channels, views, duration, thumbnails, and more.