VOOZH about

URL: https://apify.com/crawlerbros/edx-scraper

โ‡ฑ edX Course Scraper ยท Apify


Pricing

from $3.00 / 1,000 results

Go to Apify Store

Scrape edX - the world's leading MOOC platform. Search courses, browse by subject or university, fetch specific course URLs. Extracts title, institution, level, duration, effort, pricing, enrollment count, rating, skills, and more.

Pricing

from $3.00 / 1,000 results

Rating

0.0

(0)

Developer

๐Ÿ‘ Crawler Bros

Crawler Bros

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

22 days ago

Last modified

Share

Scrape online courses from edX โ€” one of the world's leading MOOC platforms featuring courses from MIT, Harvard, Stanford, Berkeley, IBM, Microsoft, and 200+ top institutions worldwide. Acquired by 2U, edX hosts thousands of courses, professional certificates, MicroMasters programs, and degree programs across dozens of subjects.

Features

  • Search mode โ€” find courses by keyword (e.g., "python programming", "machine learning", "data science")
  • Browse by subject โ€” get all courses for a specific topic (computer science, data analysis, AI, finance, etc.)
  • Browse by university โ€” get all courses from a specific institution (harvardx, mitx, berkeleyx, ibm, etc.)
  • Single course URL โ€” scrape one or more specific edX course pages for complete details
  • Filters: level (Introductory / Intermediate / Advanced), course type (Course / Professional Certificate / MicroMasters / etc.), free-only, language
  • Output includes: title, institution, subject, level, duration, effort, pricing, enrollment count, skills, availability, pacing, image URL, course URL, and more

Input

FieldTypeDescription
modeselectsearch (default), bySubject, byUniversity, byUrl
searchQuerystringText search query (mode=search)
subjectselectSubject slug โ€” e.g. computer-science, data-analysis (mode=bySubject)
universitySlugstringUniversity slug โ€” e.g. harvardx, mitx, ibm (mode=byUniversity)
courseUrlsarrayList of edX course URLs to scrape (mode=byUrl)
filterLevelselectFilter by level: Introductory / Intermediate / Advanced
filterCourseTypeselectFilter by type: Course / Professional Certificate / MicroMasters / etc.
filterIsFreebooleanOnly return free courses
filterLanguagestringISO 639-1 language code (e.g. en, es, fr)
maxItemsintegerMax number of records to return (1โ€“1000, default 50)

Example Inputs

Search for Python courses:

{
"mode":"search",
"searchQuery":"python programming",
"maxItems":20
}

Browse CS courses from Harvard:

{
"mode":"byUniversity",
"universitySlug":"harvardx",
"filterCourseType":"Course",
"maxItems":50
}

Browse AI/ML courses:

{
"mode":"bySubject",
"subject":"artificial-intelligence",
"filterLevel":"Intermediate",
"maxItems":30
}

Output

Each record contains:

FieldTypeDescription
courseIdstringedX product UUID
titlestringCourse title
descriptionstringFull course description (HTML stripped)
institutionstringPrimary university/institution name
institutionsarrayAll institution names (for multi-partner courses)
subjectstringPrimary subject
subjectsarrayAll subject categories
levelstringIntroductory / Intermediate / Advanced
durationWeeksintegerTotal duration in weeks
durationWeeksMinintegerMinimum weeks to complete
durationWeeksMaxintegerMaximum weeks to complete
effortHoursPerWeekMinnumberMin hours per week effort
effortHoursPerWeekMaxnumberMax hours per week effort
pacingstringself_paced or instructor_paced
languagestringPrimary language
availabilitystringCurrent / Archived / etc.
isFreebooleanWhether the course is free to audit
pricenumberFull price in USD
originalPricenumberOriginal price before discount
currencystringCurrency code (e.g. USD)
enrollmentCountintegerRecent enrollment count
skillsarraySkills covered in the course
courseTypestringCourse / Professional Certificate / MicroMasters / etc.
imageUrlstringCourse thumbnail URL
courseUrlstringFull edX course URL
startDatestringGo-live date (ISO format)
productSourcestringedX
sourceUrlstringURL that was scraped
scrapedAtstringUTC timestamp of scrape
recordTypestringAlways "course"

Sample Output

{
"courseId":"b3c02aea-cbf6-4fc4-a730-0433860e2a35",
"title":"Python for Data Science",
"description":"Learn to use powerful, open-source, Python tools including Pandas, Git and Matplotlib...",
"institution":"The University of California, San Diego",
"subject":"Computer Science",
"subjects":["Computer Science","Data Analysis & Statistics"],
"level":"Intermediate",
"durationWeeks":10,
"effortHoursPerWeekMin":3,
"effortHoursPerWeekMax":5,
"pacing":"self_paced",
"language":"English",
"availability":"Current",
"isFree":false,
"price":149.0,
"currency":"USD",
"enrollmentCount":333911,
"skills":["Python (Programming Language)","Data Science","Pandas"],
"courseType":"Course",
"imageUrl":"https://prod-discovery.edx-cdn.org/media/course/image/b3c02aea.jpg",
"courseUrl":"https://www.edx.org/learn/python/the-university-of-california-san-diego-python-for-data-science",
"sourceUrl":"https://www.edx.org/learn/python",
"scrapedAt":"2026-06-06T12:00:00+00:00",
"recordType":"course"
}

FAQ

What subjects are supported? All edX subjects are supported including Computer Science, Data Analysis, AI, Business & Management, Engineering, Healthcare, Math, Physics, Social Sciences, Language, and more.

Can I get courses by a specific university? Yes โ€” use mode=byUniversity with a slug like harvardx, mitx, berkeleyx, microsoft, ibm, googlecloud, etc.

Are free courses available? Many edX courses are free to audit. Use filterIsFree: true to return only free courses.

Does this include boot camps and degree programs? Yes โ€” edX hosts Boot Camps, MicroBachelors, MicroMasters, Professional Certificates, and full Degrees. Use filterCourseType to narrow down.

What is the daily test prefill?

{"mode":"search","searchQuery":"python programming","maxItems":5}

Data Source

edX embeds course data in its Next.js server-rendered HTML pages (RSC JSON payloads). This actor parses that embedded data without requiring authentication or API keys. Data is publicly accessible and does not require a proxy.

You might also like

EDX Discovery Scraper

getdataforme/edx-discovery-scraper

The EDX Discovery Scraper extracts detailed course data from EDX, including descriptions, pricing, and organization info, aiding market research and competitive analysis....

edX Scraper | University Courses and Programs

parseforge/edx-scraper

Extract edX course catalog data including title, university, instructors, level, duration, price, language, subject, prerequisites, and full description. Track MicroMasters, professional certificates, and degree programs for education analytics, lead generation, and market research.

edX Online Course Data Extractor

epctex/edx-scraper

Effortlessly scrape thousands of online courses from edX. Extract titles, images, details, owners, and all other course details. Customize your search with filters like language and more for precise results.

edX Scraper | All In One | $3 / 1k

fatihtahta/edx-scraper

Scrape edX into clean, structured course and program data. Capture titles, partners, descriptions, skills, level, language, pacing, duration, availability and enrollment signals. Perfect for curriculum research, catalog building, market analysis and competitive tracking.

EdX Course Scraper ๐ŸŽ“

shahidirfan/edx-course-scraper

Power your edtech insights with this ultimate EdX Course Scraper. Instantly extract detailed online course data, including syllabi, instructors, pricing, and reviews. Perfect for e-learning aggregators and market researchers. Streamline your education data collection today!

10

Edx Allcourse Details Spider

getdataforme/Edx-allcourse-details-spider

The Edx Allcourse Details Spider is an Apify Actor that scrapes comprehensive details on all edX courses, including titles, descriptions, partners, subjects, levels, and skills....

Coursera Scraper - Courses, Specializations & Certificates

thirdwatch/coursera-scraper

Scrape Coursera course listings by keyword. Get title, partner/university, rating, difficulty level, skills, duration, course type (course/specialization/professional-cert), and more.

Udemy Course Scraper - Search Courses by Keyword

thirdwatch/udemy-scraper

Scrape Udemy courses by keyword: title, headline, instructor(s), rating, enrollment count, level, duration, lectures, categories, and topics. No login required. Uses course landing pages for rich data.