ImportYeti Scraper - US Customs Importer & Supplier Data
Pricing
from $2.50 / 1,000 results
ImportYeti Scraper - US Customs Importer & Supplier Data
Pull structured US sea-shipment, importer, and supplier records from ImportYeti.com. Returns company addresses, country-of-origin breakdowns, top HS codes, monthly shipment history, trademarks, and tags.
Pricing
from $2.50 / 1,000 results
Rating
0.0
(0)
Developer
Actor stats
0
Bookmarked
11
Total users
5
Monthly active users
4 days ago
Last modified
Categories
Share
π’ ImportYeti Scraper β US Customs Importer & Supplier Data
π€ Structured supply-chain intelligence in seconds. Pull every public datapoint from ImportYeti β importer addresses, country-of-origin breakdowns, monthly shipment history, top HS codes, trademarks and tags β straight into a JSON / CSV / Excel dataset.
β¨ Why this scraper
- β‘οΈ Fast β concurrent extraction, 5 records / second under default settings
- π¦ Complete β every visible field from each company/supplier page in one flat record
- π·οΈ Both sides of the trade β US importer "companies" and their foreign "supplier" counterparts
- π Country-of-origin rollups β pre-aggregated per country & continent
- π Monthly history β shipments, weight (kg) and TEU broken down per month
- π Incremental mode β
since:parameter skips records older than your last run - π‘οΈ Cloudflare-resilient β browser-fingerprint impersonation + your own residential/mobile proxy (via env var)
- π§Ύ 3 export formats β JSON, CSV, Excel
π Quick start
- Click Try for free
- Type a search term (a company name, brand, product, or keyword) β or paste a list of
/company/<slug>//supplier/<slug>URLs - Hit Start and grab a coffee
- Download the result as JSON, CSV, or Excel from the Storage tab
π οΈ Input
| Field | Type | Description |
|---|---|---|
query | string | Free-text search term (brand, importer, HS-code keyword) |
entityType | enum | company (US importer), supplier (foreign manufacturer), or both |
startUrls | string[] | Optional explicit list of /company/<slug> or /supplier/<slug> URLs |
maxItems | integer | Hard cap on records; 0 = unlimited (default 50) |
since | datetime | Skip records whose most recent shipment is older than this ISO timestamp |
concurrency | integer | Parallel HTTP requests (default 5, max 25) |
fetchDetails | boolean | true (default) enriches each result from its detail page. false = list-only mode: search-surface fields only, no detail-page traffic (records flagged partial: true). |
π Proxy: set via the
PROXY_URL(orHTTPS_PROXY/HTTP_PROXY) environment variable on the actor β every request is routed through it. A residential / mobile proxy is recommended (ImportYeti is behind Cloudflare and blocks datacenter IPs). Format:http://user:pass@host:port.
πΈ Saving proxy traffic
Residential/mobile proxies bill by the gigabyte, and detail pages are the cost driver. The actor minimises bytes for you, and you can trade depth for cost:
| Lever | How | Effect |
|---|---|---|
| RSC fetch (automatic) | Detail pages are pulled as the Next.js RSC payload, brotli-compressed | ~34 KB/record on the wire vs ~60 KB for full HTML β ~44% less, no data loss |
| List-only mode | fetchDetails: false | Skips detail pages entirely β ~99% less traffic; keeps name, address, country, totals, trademarks |
| Incremental | since: <last run ISO> | Old records are filtered from the search seed before any detail fetch β repeat runs only pay for new data |
| Cap volume | maxItems, narrower query/entityType | Fewer detail fetches = fewer bytes |
Tip for recurring monitoring: run
fetchDetails: falseto list what's changed cheaply, then re-runfetchDetails: truewithstartUrlsfor just the records you care about.
π€ Sample output
{"url":"https://www.importyeti.com/company/apple","id":"company/apple","scraped_at":"2026-05-10T20:00:10Z","type":"company","title":"Apple","address":"568 Aldi Blvd, Mount Juliet, Tn 37122, Us","country_code":"US","phone_number":"XXX-XXX-X000","website":"apple.com","other_names_count":2,"other_addresses_count":89,"most_recent_shipment":"01/21/2026","total_sea_shipments":2449,"total_shipping_cost":101746.81,"shipping_cost_coverage":2.69,"avg_teu_per_month":0.39,"avg_teu_per_shipment":0.77,"database_updated":"05/06/2026","multi_address":true,"dataset":"us","uflpa":false,"location":{"state":"Tennessee","county":"Wilson County","city":"Mount Juliet","district":"Mount Juliet"},"tags":[{"tag":"puter","shipments":617},{"tag":"computer","shipments":617},{"tag":"lithium","shipments":388}],"trademarks":[{"name":"Apple","trademarks_count":1620}],"imports_per_country":[{"country":"China","continent":"Asia","shipments":2292},{"country":"Hong Kong","continent":"Asia","shipments":62},{"country":"Vietnam","continent":"Asia","shipments":37},{"country":"Germany","continent":"Europe","shipments":11}],"top_hs_codes":["8504.40","8517.62","8471.30","8544.42","8473.30"],"shipments_time_series":[{"period":"01/01/2024","shipments":32,"weight":89421,"teu":84},{"period":"01/02/2024","shipments":41,"weight":117850,"teu":102}]}
πΌ Use cases
| Who | What for |
|---|---|
| Procurement teams | Find alternative suppliers for components shipped from specific countries |
| B2B sales | Build prospect lists by HS code, trade volume, or country of origin |
| Market research | Quantify competitor sourcing strategies β who buys what, from where, how often |
| Investment analysts | Track import volume as a leading indicator for retail / consumer-goods companies |
| Logistics & freight | Identify high-volume lanes and consolidation opportunities |
| Trade compliance | Screen suppliers against UFLPA and other forced-labour flags |
π‘ Tips & tricks
- Free-text queries match titles, addresses and aliases β
"apple"returns 27 hits across companies and suppliers, not just Apple Inc. - For company-only datasets, set
entityType: "company". Suppliers are foreign manufacturers and have a different shape (no trademarks, often no website). - The
sincefilter uses the most recent shipment date on each record β perfect for nightly incremental runs. - Want comparable competitors? Run one query per company name, then merge β each result carries a
tagsarray you can use for affinity clustering. - Large runs (10k+ records) benefit from
concurrency: 10and unlimitedmaxItems: 0.
β FAQ
Q: What data is in each record? A: Every field shown in the sample above β importer name, primary and alternate addresses, phone, website, total sea-shipment count, country-of-origin breakdown, top HS codes, monthly time series, trademarks and keyword tags.
Q: How fresh is the data?
A: ImportYeti's database update timestamp travels with each record in the database_updated field, so you always know how recent the underlying customs data is.
Q: Can I get individual shipment-level records? A: This actor returns the public aggregated view (count + breakdowns + monthly history). Individual bill-of-lading records aren't part of the public page surface.
Q: Will this work for non-US companies? A: Yes β the dataset itself is US sea imports, so US-side companies are importers and foreign-side companies are suppliers. Both show up.
Q: How are records de-duplicated?
A: By the canonical id field (company/<slug> or supplier/<slug>) β the same company appearing on multiple search pages or via multiple startUrls is pushed once.
Q: What happens if a search slug doesn't have a real page? A: ImportYeti's search occasionally surfaces orphan slugs. The actor detects and silently skips them β they don't count against your run.
π Output
Three checkmarks live in the Storage tab β the dataset is exportable as JSON, CSV, and Excel out of the box. Use the Open in Apify Console link for an interactive table view with filtering and sort.
π Found a bug or want a field that's missing? Open an issue from the actor page β we ship fixes fast.
