👁 Wayback Machine Historical Content Scraper avatar

Wayback Machine Historical Content Scraper

Pricing

$3.99 / 1,000 results

Try for free

Go to Apify Store

👁 Wayback Machine Historical Content Scraper

Wayback Machine Historical Content Scraper

Try for free

Compare archived website snapshots through the Wayback Machine and extract page-history change signals.

Pricing

$3.99 / 1,000 results

Rating

4.0

(1)

Developer

👁 Kelsey Todd

Kelsey Todd

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

2 months ago

Last modified

What changed

Replaced the Puppeteer-based crawl flow with direct Wayback Machine API calls.
Reduced runtime cost and flakiness by comparing archive snapshots with plain HTTP requests.
Added archive-span, title-change, and word-count change signals for each URL.

Best fit

SEO teams researching page history
agencies comparing historical messaging
competitor-monitoring workflows
founders recovering how a site evolved over time

Wayback Machine Scraper

glassventures/wayback-machine-scraper

Scrape Wayback Machine archive snapshots for any URL or domain. Get archived URLs, timestamps, status codes, MIME types. Export to JSON, CSV, Excel.

👁 User avatar

Glass Ventures

👁 Wayback Machine Scraper - Track Website Changes Over Time avatar

Wayback Machine Scraper - Track Website Changes Over Time

ryanclinton/wayback-machine-search

Search the Internet Archive's Wayback Machine for historical snapshots of any website. Retrieve archived page metadata -- including timestamps, URLs, MIME types, HTTP status codes, and content hashes -- for up to 10,000 snapshots per run.

👁 User avatar

Ryan Clinton

👁 Wayback Machine Search avatar

Wayback Machine Search

crawlerbros/wayback-machine-search

Query Internet Archive's Wayback Machine for historical snapshots of any URL or domain. Filter by date, HTTP status, MIME type, and deduplicate. Optionally fetch the archived page text. Free public CDX API, no authentication.

👁 User avatar

Crawler Bros

👁 Wayback Machine Search avatar

Wayback Machine Search

maximedupre/wayback-machine-search

Search Wayback Machine snapshots for URLs, hosts, and domains. Export archive dates, status codes, MIME types, digests, content text, version timelines, reports, and monitoring alerts.

👁 User avatar

Maxime Dupré

👁 Wayback Machine URL Extractor - Archived URLs avatar

Wayback Machine URL Extractor - Archived URLs

logiover/wayback-machine-url-extractor

Extract every archived URL of any domain from the Internet Archive's Wayback Machine (CDX API). Recover lost or old pages, build redirect maps and run OSINT, with date and status filters. No API key, export to CSV or JSON.

👁 User avatar

Logiover

👁 Wayback Machine Scraper avatar

Wayback Machine Scraper

gio21/wayback-machine-scraper

List Internet Archive Wayback Machine snapshots for one or more URLs. Returns timestamp, snapshot URL, HTTP status, MIME type, digest. Useful for tracking website changes over time, OSINT research, content recovery, and brand monitoring.

👁 User avatar

Gio

👁 Wayback Machine Checker avatar

Wayback Machine Checker

automation-lab/wayback-machine-checker

This actor checks if URLs are archived in the Internet Archive Wayback Machine. It retrieves snapshot counts, oldest and newest archive dates, and direct links to archived versions. Uses both the Availability API and CDX API for comprehensive results.

👁 User avatar

Stas Persiianenko

Wayback Cdx Scraper

fortuitous_pirate/wayback-cdx-scraper

Scrape the Internet Archive Wayback Machine CDX index: find all archived snapshots of any URL with timestamps, HTTP status codes, and MIME types.

👁 User avatar

Fortuitous Pirate

Internet Archive & Wayback Machine Scraper

cloud9_ai/internet-archive-scraper

Search Internet Archive and check Wayback Machine snapshots. Access 800B+ archived pages, books, movies, audio. Search items, get metadata, or check URL archive history. No API key needed. For SEO, OSINT, legal, and research.

👁 User avatar

cloud9

👁 Wayback Machine CDX Bulk Extractor avatar

Wayback Machine CDX Bulk Extractor

automation-lab/wayback-machine-cdx-extractor

Bulk extract archived snapshot metadata from the Wayback Machine CDX API. Get every crawled URL, timestamp, HTTP status code, MIME type, and content digest for any domain or URL pattern. Export to JSON, CSV, or Excel.

👁 User avatar

Stas Persiianenko

URL: https://apify.com/happyfhantum/wayback-machine-historical-content-scraper