VOOZH about

URL: https://apify.com/paxiq/us-biz-filings-scraper

โ‡ฑ US Business Entity Filings โ€” New LLC & Corp Registrations Daily ยท Apify


๐Ÿ‘ US Business Entity Filings โ€” New LLC & Corp Registrations Daily avatar

US Business Entity Filings โ€” New LLC & Corp Registrations Daily

Pricing

Pay per usage

Go to Apify Store

US Business Entity Filings โ€” New LLC & Corp Registrations Daily

Scrape fresh LLC, Corp, and LP filings from US Secretary of State portals daily. NY, FL, CO, CT today โ€” all 50 states coming. Filter by filing date to get new businesses the moment they register. For sales leads, KYC, compliance, and due diligence.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

๐Ÿ‘ PaxIQ

PaxIQ

Maintained by Community

Actor stats

0

Bookmarked

33

Total users

8

Monthly active users

3 months ago

Last modified

Share

Get fresh business filings (LLC, Corp, LP, LLP, etc.) from US Secretary of State portals the moment they're registered โ€” ideal for sales prospecting, KYC, compliance monitoring, and competitive intelligence.

Why this actor? Every new business is a warm lead. Solar installers, insurance agents, accountants, lawyers, and SaaS companies all use new LLC registrations to find prospects before the competition does. Filter by filing date to pull only today's new registrations.

Coverage today: NY, FL, CO, CT (4 states, ~4,000+ new entities/day). All 50 states in progress.

Phase 1 States

StateSourcePlatformNotes
NYdata.ny.govSocrata SODADaily filing data
CTdata.ct.govSocrata SODABusiness registry
COsos.state.co.usHTML form scrapeDate-filter POST
FLdos.fl.gov/sunbizJSON APIsunbiz.org

Output Schema (14 fields)

FieldDescription
entity_nameBusiness name
entity_typeLLC, Corporation, LP, LLP, etc.
state2-letter state code
state_entity_idState's internal ID/entity number
filing_dateYYYY-MM-DD formation/filing date
statusActive, Inactive, Dissolved, etc.
registered_agentRegistered agent name
principal_addressPrincipal business address
agent_addressRegistered agent address
ownersComma-separated officer/member names
countyCounty if available
source_platformsocrata / portal_scrape / fl_sunbiz
source_urlPortal URL
scraped_atISO UTC timestamp

Apify Input

{
"states":["NY","FL"],
"startDate":"2025-01-01",
"endDate":"2025-01-31",
"maxResults":10000,
"socrataAppToken":"optional-token"
}
FieldTypeDefaultDescription
statesarrayall2-letter state codes to scrape
startDatestringyesterdayYYYY-MM-DD start date
endDatestringyesterdayYYYY-MM-DD end date
maxResultsinteger10000Max records per state
socrataAppTokenstringโ€”Socrata app token (boosts rate limits for NY/CT)

Local Development

# Install deps
pip install-r requirements.txt
# Run all configured states (yesterday's filings)
python src/main.py
# Run specific states
python src/main.py --states NY CT --start2025-01-01 --end2025-01-31
# Test individual scrapers
python src/socrata_scraper.py --state NY --start2025-01-01 --end2025-01-02 --max5
python src/colorado_scraper.py --start2025-01-01 --end2025-01-02 --max10
python src/florida_scraper.py --start2025-01-01 --end2025-01-02 --max10

File Structure

biz-filings/
โ”œโ”€โ”€ .actor/
โ”‚ โ”œโ”€โ”€ actor.json Apify actor metadata + dataset schema
โ”‚ โ””โ”€โ”€ input_schema.json Apify UI input form
โ”œโ”€โ”€ src/
โ”‚ โ”œโ”€โ”€ main.py Async Apify entry point
โ”‚ โ”œโ”€โ”€ router.py STATE_REGISTRY + scraper dispatch
โ”‚ โ”œโ”€โ”€ normalize.py Raw โ†’ 14-field normalized schema
โ”‚ โ”œโ”€โ”€ socrata_scraper.py NY + CT (Socrata SODA API)
โ”‚ โ”œโ”€โ”€ colorado_scraper.py CO (HTML form POST + table parse)
โ”‚ โ””โ”€โ”€ florida_scraper.py FL (sunbiz.org JSON API)
โ”œโ”€โ”€ Dockerfile apify/actor-python:3.11 base
โ”œโ”€โ”€ requirements.txt httpx, beautifulsoup4, apify
โ””โ”€โ”€ README.md

Architecture

main.py
โ””โ”€ router.py(STATE_REGISTRY dispatch)
โ”œโ”€ socrata_scraper.py โ†’ NY,CT
โ”œโ”€ colorado_scraper.py โ†’ CO
โ””โ”€ florida_scraper.py โ†’ FL
โ”‚
โ–ผ raw dicts
normalize.py โ†’ 14-field standard record
โ”‚
โ–ผ
Apify dataset / output/XX_filings.json

Rate Limiting

  • Between pages: 0.5s delay
  • Between states: 2.0s delay
  • Colorado detail pages: 0.3s delay per entity (fetch_details=True)

Adding New States (Phase 2)

  1. Add entry to STATE_REGISTRY in router.py
  2. Add a fetch_filings() function in a new <state>_scraper.py
  3. Add field aliases to normalize.py's pick() chains
  4. Test with python src/<state>_scraper.py --start ... --end ...

You might also like

CA Business Leads - SOS Entity Search

pink_comic/california-business-leads

Verify California companies via Secretary of State. Search any LLC, Corp, LP by name. Get formation date, status, officers, registered agent, addresses. Includes CSLB contractor license data. For sales teams, KYC compliance, and due diligence workflows. $0.002/result.

Florida Sunbiz Scraper - Business Entity & LLC Leads

pink_comic/sunbiz-florida-business-leads

Florida Sunbiz scraper for business entity filings, LLC/corp lookup data, registry verification, and new-company leads. Extract names, status, officers, registered agents, addresses, FEI/EIN fields, and filing dates from Florida Division of Corporations. For KYC, prospecting, and due diligence.

Colorado & Oregon Business Leads

harbinger/new-business-leads-scraper

Scrape new LLC, Corp & LP filings from Colorado & Oregon open-data APIs. Fresh B2B leads daily โ€” names, addresses, registered agents, filing dates. No proxies needed. Perfect for insurance agents, accountants, marketers, and sales teams.

๐Ÿ›๏ธ Business Registration Lookup โ€” State Filing Records

nexgendata/business-registration-lookup

Search U.S. state business registrations, corporate filings, LLC records and entity-status data. Verify businesses, surface officers and registered agents, and pull filing history for KYC, due diligence and sales-prospecting workflows. Returns structured JSON ready for downstream pipelines.

New Business Filings Scraper โ€” Daily LLC & Corporation Leads

4l3c/new-business-filings-scraper

Daily new LLC and corporation registrations from official state sources: Florida Sunbiz (with officers + registered agent), Colorado, Oregon, and New York. Fresh B2B leads for banks, insurance agents, accountants, and agencies. Official government data, no scraping fragility.

US Business Formation Scraper โ€” New LLC & Company Leads

scrapesage/us-business-formation-scraper

Scrape newly registered US businesses & LLCs from state open-data registries (CO, OR, CT, PA). Get business name, entity type, formation date, status, registered agent and address โ€” filter by date, type & keyword, with monitoring for only-new filings. Keyless, no browser.

Florida Business Leads

great_pistachio/florida-business-leads

Get fresh Florida business filings daily โ€” LLC/Corp registrations with officer names, addresses, registered agents, and filing details. Sourced directly from Florida Division of Corporations. Ideal for insurance agents, B2B sales, and lead generation.

๐Ÿ‘ User avatar

Saturnin Pugnet

28

Us Business Search

great_pistachio/us-business-search

Search official Secretary of State databases across US states. Look up businesses by name, get entity type, status, formation date, officers, registered agent, addresses. Supports NY and FL. Public government data.

๐Ÿ‘ User avatar

Saturnin Pugnet

67

5.0