Wayback Machine Scraper

Pricing

from $1.00 / 1,000 snapshot scrapeds

Wayback Machine Scraper

List Internet Archive Wayback Machine snapshots for one or more URLs. Returns timestamp, snapshot URL, HTTP status, MIME type, digest. Useful for tracking website changes over time, OSINT research, content recovery, and brand monitoring.

Pricing

from $1.00 / 1,000 snapshot scrapeds

Rating

0.0

(0)

Developer

👁 Gio

Gio

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

24 days ago

Last modified

Free vs. paid

Free plan: mock records for each URL.
Paid plan: real, live Wayback Machine data.

Input

Field	Type	Description
`urls`	Array (required)	List of URLs to look up.
`from`	String	Start date filter (`YYYY`, `YYYYMMDD`, or `YYYYMMDDhhmmss`).
`to`	String	End date filter.
`maxSnapshotsPerUrl`	Integer	Default 50, max 1000.
`debug`	Boolean	Verbose logs.

Output

{
"url":"apify.com",
"timestamp":"20210105141317",
"snapshotUrl":"https://web.archive.org/web/20210105141317/apify.com",
"originalUrl":"https://apify.com/",
"statusCode":"200",
"mimeType":"text/html",
"digest":"QPBSADYPYQEHJ4NTAXNCLN7QHFFROZHU",
"length":158034
}

Pricing

$0.001/snapshot. 1,000 snapshots = $1.

Limitations

Wayback Machine's CDX server has soft rate limits (~1 req/sec). The actor adds 400ms between URL queries.
For very popular URLs, the number of snapshots can be massive (millions). Use from/to to scope.

If this actor helped you, please leave a review on the Apify Store.

👁 Wayback Machine Scraper - Track Website Changes Over Time avatar

Wayback Machine Scraper - Track Website Changes Over Time

ryanclinton/wayback-machine-search

Search the Internet Archive's Wayback Machine for historical snapshots of any website. Retrieve archived page metadata -- including timestamps, URLs, MIME types, HTTP status codes, and content hashes -- for up to 10,000 snapshots per run.

👁 User avatar

Ryan Clinton

👁 Wayback Machine Snapshots Scraper — Internet Archive History avatar

Wayback Machine Snapshots Scraper — Internet Archive History

seemuapps/wayback-machine-snapshots-scraper

List every Internet Archive snapshot of a URL, page, or whole domain. Timestamp, snapshot URL, status code, mime type, content length. No login.

👁 User avatar

Andrew

Wayback Machine Scraper

glassventures/wayback-machine-scraper

Scrape Wayback Machine archive snapshots for any URL or domain. Get archived URLs, timestamps, status codes, MIME types. Export to JSON, CSV, Excel.

👁 User avatar

Glass Ventures

👁 Wayback Machine Search avatar

Wayback Machine Search

crawlerbros/wayback-machine-search

Query Internet Archive's Wayback Machine for historical snapshots of any URL or domain. Filter by date, HTTP status, MIME type, and deduplicate. Optionally fetch the archived page text. Free public CDX API, no authentication.

👁 User avatar

Crawler Bros

👁 Wayback Machine CDX Bulk Extractor avatar

Wayback Machine CDX Bulk Extractor

automation-lab/wayback-machine-cdx-extractor

Bulk extract archived snapshot metadata from the Wayback Machine CDX API. Get every crawled URL, timestamp, HTTP status code, MIME type, and content digest for any domain or URL pattern. Export to JSON, CSV, or Excel.

👁 User avatar

Stas Persiianenko

Internet Archive & Wayback Machine Scraper

cloud9_ai/internet-archive-scraper

Search Internet Archive and check Wayback Machine snapshots. Access 800B+ archived pages, books, movies, audio. Search items, get metadata, or check URL archive history. No API key needed. For SEO, OSINT, legal, and research.

👁 User avatar

cloud9

👁 Wayback Machine Historical Content Scraper avatar

Wayback Machine Historical Content Scraper

happyfhantum/wayback-machine-historical-content-scraper

Compare archived website snapshots through the Wayback Machine and extract page-history change signals.

👁 User avatar

Kelsey Todd

4.0

👁 Wayback Machine Bulk Lookup avatar

Wayback Machine Bulk Lookup

jungle_synthesizer/wayback-machine-bulk-lookup

Look up Wayback Machine snapshots for any URL or list of URLs. Returns capture timeline, optional snapshot markdown, and live-vs-snapshot diff. Date range filtering, capture limit, bulk input. Built for OSINT, journalism, SEO link-rot recovery, and legal evidence.

👁 User avatar

BowTiedRaccoon

Wayback Cdx Scraper

fortuitous_pirate/wayback-cdx-scraper

Scrape the Internet Archive Wayback Machine CDX index: find all archived snapshots of any URL with timestamps, HTTP status codes, and MIME types.

👁 User avatar

Fortuitous Pirate

Wayback Snapshots — CSV, Date-Filter, Bulk JSON

knotless_cadence/wayback-machine-scraper

Wayback Machine snapshots CSV/JSON — per snapshot: timestamp, status, MIME, size, archive URL — date-filterable. CDX API, no key. 21+ runs. For competitor history-tracking + SEO recovery + brand archaeology. spinov001@gmail.com · blog.spinov.online · t.me/scraping_ai

👁 User avatar

Alex

URL: https://apify.com/gio21/wayback-machine-scraper

⇱ Wayback Machine Scraper · Apify

Wayback Machine Scraper

Free vs. paid

Input

Output

Pricing

Limitations

You might also like

Wayback Machine Scraper - Track Website Changes Over Time

Wayback Machine Snapshots Scraper — Internet Archive History

Wayback Machine Scraper

Wayback Machine Search

Wayback Machine CDX Bulk Extractor

Internet Archive & Wayback Machine Scraper

Wayback Machine Historical Content Scraper

Wayback Machine Bulk Lookup

Wayback Cdx Scraper

Wayback Snapshots — CSV, Date-Filter, Bulk JSON