VOOZH about

URL: https://apify.com/taroyamada/federal-register-digest

⇱ Federal Register Scraper Β· Apify


Pricing

from $8.00 / 1,000 results

Go to Apify Store

Federal Register Scraper

Extract daily government filings, proposed rules, and agency decisions from the Federal Register to build custom regulatory watchlists.

Pricing

from $8.00 / 1,000 results

Rating

0.0

(0)

Developer

πŸ‘ naoki anzai

naoki anzai

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a month ago

Last modified

Share

Federal Register Rule & Notice Watch Scraper

Use this actor to monitor Federal Register rules, proposed rules, notices, agencies, and watch-term matches from the official API. Input agency slugs, document types, lookback windows, and watch terms; get source-linked digest rows for policy and regulatory monitoring. FederalRegister.gov provides an API view of Federal Register documents; verify important records with the linked official source before relying on them.

Store Quickstart

Run this actor with your target input. Results appear in the Apify Dataset and can be piped to webhooks for real-time delivery. Use dryRun to validate before committing to a schedule.

Key Features

  • πŸ›οΈ Government-sourced β€” Pulls directly from official agency feeds β€” no third-party aggregators
  • ⏱️ Timely digests β€” Daily/weekly rollups of new filings, rulings, or actions
  • πŸ” Keyword watchlists β€” Flag items matching your compliance/legal watch terms
  • πŸ“Š Structured metadata β€” Agency, date, docket, document type, link β€” all dataset-ready
  • πŸ“‘ Webhook alerts β€” Push to legal/compliance teams the moment new items match watchlist

Use Cases

WhoWhy
DevelopersAutomate recurring data fetches without building custom scrapers
Data teamsPipe structured output into analytics warehouses
Ops teamsMonitor changes via webhook alerts
Product managersTrack competitor/market signals without engineering time

Input

FieldTypeDefaultDescription
feedsarrayrequiredOne entry per agency/topic watch target. Each feed produces one summary digest row. Set agencySlug and documentTypes to
watchTermsstringβ€”Keywords, company names, or regulatory topics to flag in document titles and abstracts. Matching documents receive a wat
lookbackDaysinteger7Fetch documents published within this many days. Use 7–14 for recurring scheduled runs; widen to 30+ for initial discove
maxDocsPerFeedinteger50Upper bound on documents fetched per feed per run. Increase for broad discovery; keep low (50) for fast recurring digest
maxPagesPerFeedinteger5Hard page cap per feed to prevent runaway pagination. Each page fetches up to 100 documents.
deliverystring"dataset"dataset stores results in the Apify dataset. webhook posts the digest JSON to webhookUrl.
webhookUrlstringβ€”POST target for the digest payload. Leave empty for dataset delivery.
datasetModestring"all"all emits every feed digest row. action_needed emits only feeds with watch-term hits. new_only emits only feeds with doc

Input Example

{
"lookbackDays":7,
"maxDocsPerFeed":50,
"maxPagesPerFeed":5,
"delivery":"dataset",
"datasetMode":"all",
"snapshotKey":"federal-register-digest-state",
"federalRegisterApiUrl":"https://www.federalregister.gov/api/v1/documents.json",
"requestTimeoutSeconds":30,
"notifyOnNoNew":true,
"dryRun":false
}

Input Examples

Example: Single-target audit

{
"targets":[
"example-target-1"
],
"maxResultsPerTarget":30
}

Example: Bulk portfolio

{
"targets":[
"target-1",
"target-2",
"target-3"
],
"maxResultsPerTarget":50,
"snapshotKey":"federal-register-digest-state"
}

Example: Recurring delta watch

{
"targets":[
"target-1"
],
"snapshotKey":"federal-register-digest-state",
"emitChangedOnly":true
}

Output

FieldTypeDescription
metaobject
errorsarray
digestsarray
digests[].feedIdstring
digests[].feedNamestring
digests[].agencySlugsarray
digests[].documentTypesarray
digests[].checkedAttimestamp
digests[].statusstring
digests[].newDocCountnumber
digests[].totalDocCountnumber
digests[].changedSinceLastRunboolean
digests[].actionNeededboolean
digests[].recommendedActionstring
digests[].signalTagsarray
digests[].watchTermHitsarray
digests[].topDocTypesobject
digests[].documentsarray
digests[].errornull

Output Example

{
"meta":{
"generatedAt":"2024-02-15T09:00:00.000Z",
"now":"2024-02-15T09:00:00.000Z",
"lookbackDays":7,
"feedCount":2,
"totalDocs":5,
"newDocs":4,
"watchTermHitCount":2,
"actionNeededCount":1,
"snapshot":{
"key":"federal-register-digest-sample",
"loadedFrom":"local",
"savedTo":"local"
},
"warnings":[],
"executiveSummary":{
"overallStatus":"action_needed",
"brief":"1 feed(s) have watch-term hits requiring review.",
"watchTermHits":[
{
"term":"climate",
"docNumber":"2024-02974",
"title":"National Ambient Air Quality Standards for Particulate Matter",
"docType":"RULE",
"primaryAgency":"Environmental Protection Agency",
"publicationDate":"2024-02-07T00:00:00.000Z",
"htmlUrl":"https://www.federalregister.gov/documents/2024/02/07/2024-02974/national-ambient-air-quality-standards"
},
{
"term":"greenhouse",
"docNumber":"2024-02345",
"title":"Greenhouse Gas Reporting: Additions and Revisions",
"docType":"PRORULE",
"primaryAgency":"Environmental Protection Agency",
"publicationDate":"2024-02-12T00:00:00.000Z",
"htmlUrl":"https://www.federalregister.gov/documents/2024/02/12/2024-02345/greenhouse-gas-reporting"
}
],
"actionItems":[

API Usage

Run this actor programmatically using the Apify API. Replace YOUR_API_TOKEN with your token from Apify Console β†’ Settings β†’ Integrations.

cURL

curl-X POST "https://api.apify.com/v2/acts/taroyamada~federal-register-digest/run-sync-get-dataset-items?token=YOUR_API_TOKEN"\
-H"Content-Type: application/json"\
-d'{ "lookbackDays": 7, "maxDocsPerFeed": 50, "maxPagesPerFeed": 5, "delivery": "dataset", "datasetMode": "all", "snapshotKey": "federal-register-digest-state", "federalRegisterApiUrl": "https://www.federalregister.gov/api/v1/documents.json", "requestTimeoutSeconds": 30, "notifyOnNoNew": true, "dryRun": false }'

Python

from apify_client import ApifyClient
client = ApifyClient("YOUR_API_TOKEN")
run = client.actor("taroyamada/federal-register-digest").call(run_input={
"lookbackDays":7,
"maxDocsPerFeed":50,
"maxPagesPerFeed":5,
"delivery":"dataset",
"datasetMode":"all",
"snapshotKey":"federal-register-digest-state",
"federalRegisterApiUrl":"https://www.federalregister.gov/api/v1/documents.json",
"requestTimeoutSeconds":30,
"notifyOnNoNew": true,
"dryRun": false
})
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
print(item)

JavaScript / Node.js

import{ ApifyClient }from'apify-client';
const client =newApifyClient({token:'YOUR_API_TOKEN'});
const run =await client.actor('taroyamada/federal-register-digest').call({
"lookbackDays":7,
"maxDocsPerFeed":50,
"maxPagesPerFeed":5,
"delivery":"dataset",
"datasetMode":"all",
"snapshotKey":"federal-register-digest-state",
"federalRegisterApiUrl":"https://www.federalregister.gov/api/v1/documents.json",
"requestTimeoutSeconds":30,
"notifyOnNoNew":true,
"dryRun":false
});
const{ items }=await client.dataset(run.defaultDatasetId).listItems();
console.log(items);

Tips & Limitations

  • Run daily for active watchlists; weekly for passive monitoring.
  • Webhook delivery works well for compliance team Slack channels β€” include docket URL for 1-click access.
  • Use watchKeywords generously β€” false positives are cheap to triage, false negatives miss filings.
  • Pair with regulatory-change-monitor for cross-agency coverage.
  • Archive Dataset rows weekly for long-term compliance evidence retention.

FAQ

How far back does history go?

This actor monitors forward-only β€” new items since first run. For historical data, use the agency's own search tool.

What timezones are used?

All timestamps are UTC. Use your downstream pipeline to convert to agency-local time if needed.

Does it translate non-English content?

No β€” original language is preserved. Use downstream translation services if needed.

Is the data official?

Yes β€” sourced directly from official government websites and feeds. Not a third-party aggregator.

Can I use this for legal research?

For alerting and monitoring, yes. For litigation research, cross-verify with primary sources (agency websites) β€” this actor is a monitoring tool, not a legal database.

Related Actors

Government & Regulatory cluster β€” explore related Apify tools:

Cost

Pay Per Event:

  • actor-start: $0.01 (flat fee per run)
  • dataset-item: $0.003 per output item

Example: 1,000 items = $0.01 + (1,000 Γ— $0.003) = $3.01

No subscription required β€” you only pay for what you use.

⭐ Was this helpful?

If this actor saved you time, please leave a β˜… rating on Apify Store. It takes 10 seconds, helps other developers discover it, and keeps updates free.

Bug report or feature request? Open an issue on the Issues tab of this actor.

Compliance Guardrails

This actor is independently operated and has no source-agency affiliation, sponsorship, or approval. It uses public data from FederalRegister.gov API (https://www.federalregister.gov/developers/documentation/api/v1). Results are informational only and may be delayed, incomplete, amended, withdrawn, or superseded by the source. Verify all records with the linked official source before relying on them.

  • Use the documented API, not UI scraping.
  • Treat results as informational monitoring; link users to official document URLs/PDFs for verification.
  • Do not imply agency endorsement or legal advice.

You might also like

Federal Register Rules & Regulations Search

ryanclinton/federal-register-search

Search the US Federal Register for rules, proposed rules, notices, executive orders, and other presidential documents published by every federal agency.

Federal Register Monitor

george.the.developer/federal-register-monitor

Monitor the Federal Register for rules, proposed rules, notices, and presidential documents with normalized rows and full document detail.

Federal Register Scraper - Rules & Regulations API

pink_comic/federal-register-search

Scrape Federal Register rules, proposed rules, notices, and presidential documents by keyword, agency, date, CFR title/part, or comment deadline. Built for regulatory monitoring, compliance, lobbying, and policy research. No API key needed.

Federal Register Documents Scraper

compute-edge/federal-register-scraper

Extract U.S. Federal Register documents via the public API. Filter by query, document types (rules, notices, proposed rules), publication date, and agencies. Includes full document details, citations, and regulatory information.

Federal Register Intelligence

benthepythondev/federal-register-intelligence

Extract US federal regulations, rules, notices, and proposed regulations from the official Federal Register. Track regulatory changes affecting your industry, find public comment periods, monitor agency actions from EPA, SEC, FDA, and 400+ agencies.

Federal Register Crawler - US Regulatory Documents

jungle_synthesizer/federal-register-crawler

Crawl 800K+ regulatory documents from the Federal Register API. Extract rules, proposed rules, executive orders, and notices with titles, abstracts, agency info, CFR references, RIN numbers, and comment deadlines. Filter by agency, document type, date range, and significance.

πŸ‘ User avatar

BowTiedRaccoon

2