SEC EDGAR Scraper β Filings, Full-Text RAG & XBRL Financials
Pricing
from $1.00 / 1,000 filing metadata items
SEC EDGAR Scraper β Filings, Full-Text RAG & XBRL Financials
Extract SEC EDGAR filings metadata, full-text search results, section-parsed 10-K/10-Q text for RAG, and XBRL financial facts from official SEC public data. No API key required.
Pricing
from $1.00 / 1,000 filing metadata items
Rating
0.0
(0)
Developer
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
a day ago
Last modified
Categories
Share
Extract everything from SEC EDGAR in one Actor: company filings metadata, EDGAR full-text search, section-parsed RAG-ready full text (Item 1A Risk Factors, Item 7 MD&A, β¦), and XBRL financial facts β with no API key required.
At a glance
| Field | Details |
|---|---|
| Source | Official U.S. SEC EDGAR public data |
| Authentication | No API key required |
| Coverage | U.S. public company and registrant filings, metadata, full text, and XBRL facts |
| Best for | SEC filings scraper, 10-K/10-Q full text extraction, financial facts API, investment research, compliance monitoring, financial RAG datasets |
| Pricing | $0.001 metadata/search item, $0.005 full-text document, $0.0002 XBRL fact |
Features
| Feature | Description |
|---|---|
| π Filings metadata | All form types (10-K, 10-Q, 8-K, S-1, DEF 14A, Form 4, 13F, β¦) for any company, with document & exhibit links |
| π Full-text search | EDGAR full-text search (EFTS) across all filings since 2001 β phrases, form filters, date ranges |
| π§ RAG-ready full text | Full document text, parsed into Item sections and chunked for embeddings β feed it straight into your vector DB |
| π° XBRL financials | Structured financial facts (Revenues, NetIncomeLoss, EPS, Assets, β¦) with fiscal period metadata |
| π·οΈ Smart company lookup | Search by ticker, CIK, or company name β automatic CIK resolution |
| β±οΈ SEC-compliant rate limiting | Global throttle keeps requests safely under the SEC's 10 req/s limit |
Quick Start
Get Apple's latest annual and quarterly reports as RAG-ready chunks:
{"mode":"fulltext","ticker":"AAPL","formTypes":["10-K","10-Q"],"dateFrom":"2024-01-01","chunking":"paragraph","maxItems":5}
Modes
| Mode | What you get | Charged per |
|---|---|---|
filings | Filing metadata records for a company | item |
fulltext | Full document text + Item sections + chunks | document |
search | EDGAR full-text search hits | item |
facts | XBRL financial fact rows | fact |
Input
| Field | Type | Required | Default | Description |
|---|---|---|---|---|
mode | string | yes | filings | filings | fulltext | search | facts |
ticker | string | one of ticker/cik/companyNameΒΉ | β | Stock ticker, e.g. AAPL |
cik | string | ΒΉ | β | SEC Central Index Key, e.g. 320193 |
companyName | string | ΒΉ | β | SEC registrant name (exact or partial) |
query | string | search mode | β | Full-text search expression; quote phrases: "supply chain disruption" |
formTypes | array | no | all | Form types to include, e.g. ["10-K", "10-Q"] |
dateFrom / dateTo | string | no | β | Filing date range (YYYY-MM-DD) |
chunking | string | no | section | section | paragraph (~2000 chars, best for embeddings) | none |
includeExhibits | boolean | no | false | Filings mode: list each filing's exhibit files |
factTags | array | no | all | Facts mode: restrict to XBRL tags, e.g. ["Revenues", "NetIncomeLoss"] |
dedupeFacts | boolean | no | true | Facts mode: collapse prior-period comparatives re-reported by later filings into one row per period & value |
maxItems | integer | no | 100 | Maximum items to save |
ΒΉ filings, fulltext and facts modes require a company; fulltext can alternatively run from a query.
Output
Example: fulltext item
{"itemType":"filing-fulltext","title":"Apple Inc. β 10-K 2025-10-31","company":"Apple Inc.","ticker":"AAPL","cik":"0000320193","formType":"10-K","filedAt":"2025-10-31","accessionNo":"0000320193-25-000123","documentUrl":"https://www.sec.gov/Archives/edgar/data/320193/...","sections":[{"name":"Item 1A β Risk Factors","charCount":38241},{"name":"Item 7 β Management's Discussion and Analysis","charCount":21077}],"chunks":[{"text":"The Company's business, reputation, results of operations...","section":"Item 1A β Risk Factors","order":12}],"textLength":220151}
Example: facts item
{"itemType":"fact","company":"Apple Inc.","taxonomy":"us-gaap","tag":"RevenueFromContractWithCustomerExcludingAssessedTax","unit":"USD","value":111184000000,"start":"2025-12-28","end":"2026-03-28","fiscalYear":2026,"fiscalPeriod":"Q2","form":"10-Q"}
Recipes
1. Build a RAG knowledge base from SEC filings
Pull two years of 10-K/10-Q full text for your watchlist, chunked for embeddings. Each chunk carries its Item section, so you can filter retrieval to Risk Factors or MD&A only.
{"mode":"fulltext","ticker":"NVDA","formTypes":["10-K","10-Q"],"dateFrom":"2024-01-01","chunking":"paragraph","maxItems":12}
2. Monitor risk-factor language across an industry
Full-text search for a phrase in recent annual reports β who is talking about it, and since when?
{"mode":"search","query":"\"supply chain disruption\"","formTypes":["10-K"],"dateFrom":"2025-01-01","maxItems":200}
3. Time-series financials without parsing a single PDF
Quarterly revenue, net income and diluted EPS as clean rows, ready for a spreadsheet or BI tool:
{"mode":"facts","ticker":"MSFT","factTags":["RevenueFromContractWithCustomerExcludingAssessedTax","NetIncomeLoss","EarningsPerShareDiluted"],"dateFrom":"2023-01-01","maxItems":500}
4. Track insider activity and ownership filings
{"mode":"filings","ticker":"TSLA","formTypes":["Form 4","SC 13D","SC 13G"],"dateFrom":"2026-01-01","maxItems":100}
Pricing
Pay-per-event β you only pay for what gets saved:
| Event | Price |
|---|---|
| Filing metadata / search hit | $0.001 per item |
| Full-text document (sections + chunks) | $0.005 per document |
| XBRL financial fact | $0.0002 per fact |
Example: a RAG corpus of 50 full 10-K documents costs $0.25. A 1,000-row financial time series costs $0.20.
FAQ
Do I need an SEC API key?
No. SEC EDGAR is a free public data source. The Actor identifies itself with a compliant User-Agent and stays under the SEC's rate limit automatically.
How far back does full-text search go?
EDGAR full-text search covers filings from 2001-05-04 onward. Filing metadata and XBRL facts go back much further (metadata to 1994).
Why are some filings skipped in fulltext mode?
Forms whose primary document is not HTML (e.g. XBRL-only Form 4 XML) can't be section-parsed and are skipped β you are not charged for them.
Why did facts mode return the same period twice (with dedupeFacts: false)?
XBRL filings restate prior-period comparatives β a Q2 10-Q also reports last year's Q2 column. With deduplication off you get one row per filing that reported the fact (accessionNo/filed tell you which one); with the default on, only the original disclosure is kept.
Which chunking should I use for embeddings?
paragraph produces ~2000-character chunks aligned to paragraph boundaries with section attribution β ideal for most embedding models. Use section if you want one chunk per Item section.
Legal & Compliance
This Actor accesses only public data published by the U.S. Securities and Exchange Commission, which mandates open access to EDGAR. Requests are rate-limited below SEC's published threshold (10 req/s) and carry an identifying User-Agent, per SEC developer guidelines. No personal data is collected.
