VOOZH about

URL: https://apify.com/autofacts/sec-edgar-scraper

⇱ SEC EDGAR Scraper - Filings, 10-K Text, XBRL Data Β· Apify


πŸ‘ SEC EDGAR Scraper β€” Filings, Full-Text RAG & XBRL Financials avatar

SEC EDGAR Scraper β€” Filings, Full-Text RAG & XBRL Financials

Pricing

from $1.00 / 1,000 filing metadata items

Go to Apify Store

SEC EDGAR Scraper β€” Filings, Full-Text RAG & XBRL Financials

Extract SEC EDGAR filings metadata, full-text search results, section-parsed 10-K/10-Q text for RAG, and XBRL financial facts from official SEC public data. No API key required.

Pricing

from $1.00 / 1,000 filing metadata items

Rating

0.0

(0)

Developer

πŸ‘ Richard Feng

Richard Feng

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a day ago

Last modified

Share

Extract everything from SEC EDGAR in one Actor: company filings metadata, EDGAR full-text search, section-parsed RAG-ready full text (Item 1A Risk Factors, Item 7 MD&A, …), and XBRL financial facts β€” with no API key required.

At a glance

FieldDetails
SourceOfficial U.S. SEC EDGAR public data
AuthenticationNo API key required
CoverageU.S. public company and registrant filings, metadata, full text, and XBRL facts
Best forSEC filings scraper, 10-K/10-Q full text extraction, financial facts API, investment research, compliance monitoring, financial RAG datasets
Pricing$0.001 metadata/search item, $0.005 full-text document, $0.0002 XBRL fact

Features

FeatureDescription
πŸ“ Filings metadataAll form types (10-K, 10-Q, 8-K, S-1, DEF 14A, Form 4, 13F, …) for any company, with document & exhibit links
πŸ”Ž Full-text searchEDGAR full-text search (EFTS) across all filings since 2001 β€” phrases, form filters, date ranges
🧠 RAG-ready full textFull document text, parsed into Item sections and chunked for embeddings β€” feed it straight into your vector DB
πŸ’° XBRL financialsStructured financial facts (Revenues, NetIncomeLoss, EPS, Assets, …) with fiscal period metadata
🏷️ Smart company lookupSearch by ticker, CIK, or company name β€” automatic CIK resolution
⏱️ SEC-compliant rate limitingGlobal throttle keeps requests safely under the SEC's 10 req/s limit

Quick Start

Get Apple's latest annual and quarterly reports as RAG-ready chunks:

{
"mode":"fulltext",
"ticker":"AAPL",
"formTypes":["10-K","10-Q"],
"dateFrom":"2024-01-01",
"chunking":"paragraph",
"maxItems":5
}

Modes

ModeWhat you getCharged per
filingsFiling metadata records for a companyitem
fulltextFull document text + Item sections + chunksdocument
searchEDGAR full-text search hitsitem
factsXBRL financial fact rowsfact

Input

FieldTypeRequiredDefaultDescription
modestringyesfilingsfilings | fulltext | search | facts
tickerstringone of ticker/cik/companyNameΒΉβ€”Stock ticker, e.g. AAPL
cikstringΒΉβ€”SEC Central Index Key, e.g. 320193
companyNamestringΒΉβ€”SEC registrant name (exact or partial)
querystringsearch modeβ€”Full-text search expression; quote phrases: "supply chain disruption"
formTypesarraynoallForm types to include, e.g. ["10-K", "10-Q"]
dateFrom / dateTostringnoβ€”Filing date range (YYYY-MM-DD)
chunkingstringnosectionsection | paragraph (~2000 chars, best for embeddings) | none
includeExhibitsbooleannofalseFilings mode: list each filing's exhibit files
factTagsarraynoallFacts mode: restrict to XBRL tags, e.g. ["Revenues", "NetIncomeLoss"]
dedupeFactsbooleannotrueFacts mode: collapse prior-period comparatives re-reported by later filings into one row per period & value
maxItemsintegerno100Maximum items to save

ΒΉ filings, fulltext and facts modes require a company; fulltext can alternatively run from a query.

Output

Example: fulltext item

{
"itemType":"filing-fulltext",
"title":"Apple Inc. β€” 10-K 2025-10-31",
"company":"Apple Inc.",
"ticker":"AAPL",
"cik":"0000320193",
"formType":"10-K",
"filedAt":"2025-10-31",
"accessionNo":"0000320193-25-000123",
"documentUrl":"https://www.sec.gov/Archives/edgar/data/320193/...",
"sections":[
{"name":"Item 1A β€” Risk Factors","charCount":38241},
{"name":"Item 7 β€” Management's Discussion and Analysis","charCount":21077}
],
"chunks":[
{"text":"The Company's business, reputation, results of operations...","section":"Item 1A β€” Risk Factors","order":12}
],
"textLength":220151
}

Example: facts item

{
"itemType":"fact",
"company":"Apple Inc.",
"taxonomy":"us-gaap",
"tag":"RevenueFromContractWithCustomerExcludingAssessedTax",
"unit":"USD",
"value":111184000000,
"start":"2025-12-28",
"end":"2026-03-28",
"fiscalYear":2026,
"fiscalPeriod":"Q2",
"form":"10-Q"
}

Recipes

1. Build a RAG knowledge base from SEC filings

Pull two years of 10-K/10-Q full text for your watchlist, chunked for embeddings. Each chunk carries its Item section, so you can filter retrieval to Risk Factors or MD&A only.

{
"mode":"fulltext",
"ticker":"NVDA",
"formTypes":["10-K","10-Q"],
"dateFrom":"2024-01-01",
"chunking":"paragraph",
"maxItems":12
}

2. Monitor risk-factor language across an industry

Full-text search for a phrase in recent annual reports β€” who is talking about it, and since when?

{
"mode":"search",
"query":"\"supply chain disruption\"",
"formTypes":["10-K"],
"dateFrom":"2025-01-01",
"maxItems":200
}

3. Time-series financials without parsing a single PDF

Quarterly revenue, net income and diluted EPS as clean rows, ready for a spreadsheet or BI tool:

{
"mode":"facts",
"ticker":"MSFT",
"factTags":["RevenueFromContractWithCustomerExcludingAssessedTax","NetIncomeLoss","EarningsPerShareDiluted"],
"dateFrom":"2023-01-01",
"maxItems":500
}

4. Track insider activity and ownership filings

{
"mode":"filings",
"ticker":"TSLA",
"formTypes":["Form 4","SC 13D","SC 13G"],
"dateFrom":"2026-01-01",
"maxItems":100
}

Pricing

Pay-per-event β€” you only pay for what gets saved:

EventPrice
Filing metadata / search hit$0.001 per item
Full-text document (sections + chunks)$0.005 per document
XBRL financial fact$0.0002 per fact

Example: a RAG corpus of 50 full 10-K documents costs $0.25. A 1,000-row financial time series costs $0.20.

FAQ

Do I need an SEC API key?

No. SEC EDGAR is a free public data source. The Actor identifies itself with a compliant User-Agent and stays under the SEC's rate limit automatically.

How far back does full-text search go?

EDGAR full-text search covers filings from 2001-05-04 onward. Filing metadata and XBRL facts go back much further (metadata to 1994).

Why are some filings skipped in fulltext mode?

Forms whose primary document is not HTML (e.g. XBRL-only Form 4 XML) can't be section-parsed and are skipped β€” you are not charged for them.

Why did facts mode return the same period twice (with dedupeFacts: false)?

XBRL filings restate prior-period comparatives β€” a Q2 10-Q also reports last year's Q2 column. With deduplication off you get one row per filing that reported the fact (accessionNo/filed tell you which one); with the default on, only the original disclosure is kept.

Which chunking should I use for embeddings?

paragraph produces ~2000-character chunks aligned to paragraph boundaries with section attribution β€” ideal for most embedding models. Use section if you want one chunk per Item section.

Legal & Compliance

This Actor accesses only public data published by the U.S. Securities and Exchange Commission, which mandates open access to EDGAR. Requests are rate-limited below SEC's published threshold (10 req/s) and carry an identifying User-Agent, per SEC developer guidelines. No personal data is collected.

You might also like

SEC EDGAR Analyzer β€” 10-K, 10-Q & 8-K Data

ryanclinton/sec-edgar-filing-analyzer

Search SEC filings by ticker, name, or CIK. Extract 10-K, 10-Q, 8-K metadata and structured XBRL financials (revenue, net income, assets, EPS). Covers 10,000+ public companies. Free SEC API, no key needed.

12

SEC Forms Scraper

parseforge/sec-forms-scraper

Scrape SEC EDGAR filings across 10-K, 10-Q, 8-K, S-1, DEF 14A, Form 4, SC 13D, SC 13G, 20-F, and 6-K. Extracts entity names, tickers, CIKs, filing dates, SIC codes, and direct document links. Filter results by search query, entity name, date ranges, tickers, and CIK numbers. For equity research use.

3

5.0

(1)

SEC EDGAR Company Financials & Filings Scraper

compute-edge/sec-edgar-scraper

Extract structured financial data (revenue, net income, assets, EPS) and filing history from SEC EDGAR XBRL API. Search by ticker symbol, company name, or CIK number across 10,000+ public companies.

SEC Filings Intelligence - 10-K Decoded for AI Agents

ai_solutionist/sec-filings-intelligence

The SEC decoder AI agents trust. Extract structured financials, risk factors, executive compensation, and MD&A from 10-K, 10-Q, 8-K, and proxy statements. Built for Colorado SB 25B-004 compliance. Powers AI employees with grounded financial intelligence, Bluebook citations, and RAG-ready chunks.

πŸ‘ User avatar

Jason Pellerin

4

Corporate Deep Research MCP

ryanclinton/corporate-deep-research-mcp

Deep corporate intelligence combining financial health, reputation risk, and governance analysis for AI agents via the Model Context Protocol.

4

1.0

(1)

SEC EDGAR Scraper

labrat011/sec-edgar-scraper

Resolve companies to SEC CIK numbers, search SEC filings, and extract structured XBRL financial facts from EDGAR. No API key required.

SEC EDGAR Intel - 8-K Triggers, Form 4 Patterns, Form D Funding

seibs.co/sec-edgar-intel

SEC filings intelligence. 8-K trigger classifier (M&A/exec-change/going-concern/etc), Form 4 insider patterns (cluster buys, unusual size), 13F position tracker, Form D startup raises, earnings transcript extraction. For hedge funds, fintech, due diligence, journalists.