VOOZH about

URL: https://apify.com/compute-edge/rfc-editor-scraper

⇱ IETF RFC Editor Index Scraper Β· Apify


Pricing

from $3.00 / 1,000 results

Go to Apify Store

IETF RFC Editor Index Scraper

Extract every published IETF RFC with metadata: title, authors, status, stream, obsoletes/updates relationships, DOI, and abstract. ~9,700 RFCs from RFC 1 to today, fully filterable.

Pricing

from $3.00 / 1,000 results

Rating

0.0

(0)

Developer

πŸ‘ Compute Edge

Compute Edge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a month ago

Last modified

Categories

Share

Extract structured metadata for every published IETF Request for Comments (RFC) β€” the foundational standards documents of the Internet, from RFC 1 (1969) through today's TLS, HTTP/3, OAuth, and DNS specifications. This Actor parses the official RFC Editor XML index and turns ~9,700 RFCs into clean, filterable JSON.

The IETF RFC corpus is the canonical source-of-truth for how the Internet works. Engineers, standards consultants, protocol researchers, compliance auditors, patent prosecutors, and AI agents reasoning about networking all need it as structured data β€” not a 20MB XML file. This Actor solves that.

Key Features

  • Every RFC β€” From RFC 1 to the latest publication, including obsoleted documents
  • Status & stream filters β€” Filter by Proposed Standard, Internet Standard, Best Current Practice, Informational, Experimental, Historic; or by stream (IETF / IAB / IRTF / Independent / Legacy)
  • Year range β€” Pull RFCs by publication year window
  • Keyword filter β€” Substring match across title, keywords, and abstract
  • Current-only mode β€” Skip RFCs that have been obsoleted by a newer document
  • Standards relationships β€” Captures obsoletes, obsoleted-by, updates, updated-by, and also-known-as (BCP/STD/FYI) cross-references
  • No authentication β€” Public IETF data source

Output Data Fields

FieldDescription
rfcNumberInteger RFC number
docIdDocument ID (e.g., RFC9293)
titleRFC title
authorsList of author names
month / yearPublication month and year
formatsAvailable file formats (ASCII, HTML, PDF, XML)
pageCountPage count
currentStatusCurrent status (Proposed Standard, Internet Standard, etc.)
publicationStatusOriginal publication status
streamPublishing stream (IETF, IAB, IRTF, Independent, Legacy)
doiDigital Object Identifier
keywordsAuthor-supplied keywords
abstractPlain-text abstract
obsoletesDoc-IDs of RFCs this one obsoletes
obsoletedByDoc-IDs of RFCs that obsolete this one
updatesDoc-IDs this one updates
updatedByDoc-IDs that update this one
alsoKnownAsBCP / STD / FYI document IDs
hasErrataWhether errata exist
rfcUrlCanonical rfc-editor.org URL

How to Scrape the RFC Index

  1. Open the RFC Editor Index Scraper on Apify Store
  2. (Optional) Filter by status, stream, year range, or keyword
  3. (Optional) Enable "Current RFCs Only" to skip obsoleted documents
  4. Click Start β€” every matching RFC is written to the default dataset

Pricing

This Actor uses pay-per-result pricing. A full ~9,700-RFC extract finishes in well under a minute since it parses a single XML file.

Use Cases

  • Standards tracking β€” Subscribe to a filter (e.g., new TLS RFCs) for compliance updates
  • Patent prior-art search β€” Filter by year/keyword to find pre-existing protocol disclosures
  • AI protocol assistants β€” Build RAG pipelines that answer questions about IETF standards
  • Vendor compliance audits β€” Confirm products implement current (not obsoleted) RFCs
  • Academic / education β€” Bulk metadata for citation tooling and curriculum design

Legal & Disclaimer

The IETF RFC series is published by the RFC Editor and is freely available under the IETF Trust Legal Provisions. This Actor reads the public rfc-index.xml from rfc-editor.org and parses it locally. No authentication is bypassed. Data is provided "as is" without warranty.

You might also like

RFC Editor Index Scraper

parseforge/rfc-editor-scraper

Export RFC documents from the RFC Editor index. Query 9,000+ Internet standards by RFC number, status, stream, or title keyword. Pull title, authors, status, stream, publish date, abstract, format URLs, obsoletes, updates.

IETF Datatracker Documents Scraper

parseforge/ietf-datatracker-drafts-scraper

Pull IETF Datatracker internet drafts and RFCs: document name, title, authors, abstract, working group, area, status, revision, dates, related drafts, and PDF or text URL. Export internet engineering standards to JSON, CSV, or Excel for protocol research and developer tooling.

Verificador RFC Mexico - SAT + Lista 69-B

leongael/verificador-rfc-mexico

Verify Mexican RFC tax IDs against SAT and check Lista 69-B (EFOS) blacklist. Batch support up to 100 RFCs. Returns status, taxpayer name, and blacklist flags. Essential for CFDI compliance and supplier verification.

βœ‰οΈ Bulk Email Validator

taroyamada/email-deliverability-checker

Verify email addresses using live DNS MX lookups and RFC syntax validation. Identify disposable providers and extract a 0-100 health score for any list.

SAT Mexico 69-B Taxpayer Blacklist Scraper

scrapers_lat/sat-69b-scraper

Extract Mexico SAT Lista 69-B (EFOS) blacklist of taxpayers presumed or confirmed to issue fake invoices. Scrape RFC, name, status (Presunto, Definitivo, Desvirtuado, Sentencia Favorable), oficio numbers and DOF publication dates, or screen any RFC for a clean or listed result.

2

5.0

Mexico RFC Validator | SAT Taxpayer ID Format Check

parseforge/mexico-rfc-scraper

Validate Mexican RFC taxpayer identifiers in bulk. Check format, embedded date, and homoclave checksum using the SAT modulo-11 algorithm for persona fisica and persona moral. Returns isValid plus per-check details. CSV, Excel, JSON, XML for KYC and CFDI invoice workflows.

arXiv Metadata Collectorβ€” Metadata, PDF, Authors & Abstract

scrapepilot/arxiv-metadata-collector---metadata-pdf-authors-abstract

Scrape arXiv research papers with metadata including title, authors, abstract, PDF links, DOI, and categories. Supports keyword search, proxy integration, and structured dataset output for AI, ML, and academic research use

arXiv Scraper

dami_studio/arxiv-scraper

Search arXiv via the official API and return structured paper metadata as JSON: title, abstract, authors, categories, DOI, dates, and abstract + PDF links. Best for literature reviews.

3

5.0

Email Validator β€” Bulk MX, Disposable & Score

khadinakbar/email-address-validator

Fast, cheap email list validator. RFC 5322 syntax, DNS MX lookup, disposable detection (3000+ domains), role & free-provider tagging, did-you-mean typo suggestions, 0–100 deliverability score. No external API key. ~$0.90 per 1,000 emails. MCP-ready.