VOOZH about

URL: https://apify.com/washed_fun/business-contact-extractor

⇱ Business Contact Extractor: AI-enabled Phones and Emails Β· Apify


Pricing

from $6.00 / 1,000 results

Go to Apify Store

Business Contact Extractor

Extract business emails and phone numbers from company websites. High-accuracy AI-enabled business contact extractor using smart crawling, LLM and AI extraction, heuristics, and PDF extraction. Finds emails/phones even on complex sites. Supports CSV bulk input and outputs clean, CRM-ready data.

Pricing

from $6.00 / 1,000 results

Rating

0.0

(0)

Developer

πŸ‘ Sue

Sue

Maintained by Community

Actor stats

5

Bookmarked

107

Total users

0

Monthly active users

6 months ago

Last modified

Share

The most thorough and accurate contact extractor available β€” powered by AI and built with enterprise-grade verification.

Extract verified business emails and phone numbers from company websites, even when contact info is buried in PDFs, hidden behind navigation menus, or scattered across multiple pages. This Actor combines smart multi-page crawling, strict multi-layer validation, and optional AI-powered extraction to deliver results that other scrapers simply miss.

Why AI Makes the Difference

When you provide a Gemini API key, this Actor uses LLM-powered extraction to:

  • Understand page context β€” AI reads the page like a human, identifying contact sections even on modern, JavaScript-heavy sites with minimal visible text
  • Extract from complex layouts β€” Finds emails and phones embedded in stylized designs, image-based text, or unconventional formatting
  • Validate intelligently β€” Cross-references extracted contacts against the page content to reject false positives
  • Achieve 100% email accuracy β€” In real-world benchmarks, LLM mode eliminated all email errors

Even without AI, this Actor runs 50+ validation rules on every email and phone number β€” rejecting hex IDs, placeholder text, vendor emails, invalid formats, and other junk that pollutes typical scraper output.

Ideal for lead generation, B2B prospecting, data enrichment, and CRM automation. Works as a business email scraper, website contact finder, and phone number extraction API.

Pricing

This Actor uses Apify's pay-per-event model:

Fee TypeCost
Actor Start$0.05 per run
Result$0.006 per domain

Example: Processing 100 domains costs approximately $0.65 ($0.05 start fee + $0.60 for 100 results).

LLM Cost (Optional)

The optional LLM feature uses Gemini 2.0 Flash, which has an extremely generous free tier. For most users, the LLM cost is completely free or negligible β€” typically just a few cents even for thousands of domains.

Get a free Gemini API key at https://aistudio.google.com/


Why This Scraper Is Different

Most contact scrapers only scan a single page or rely on simple pattern matching. That misses a huge amount of real business contact information.

This Actor is designed to be far more thorough and reliable, using a hybrid system that dramatically improves both coverage and accuracy:

πŸ” Smart Multi-Page Crawling

Automatically looks for:

  • Contact pages
  • About/team/support pages
  • Footer links
  • Auto-discovered subpages

This avoids the β€œhomepage only” limitation of basic scrapers.

🧠 AI-Powered Extraction + Strict Verification

Every contact goes through a multi-stage validation pipeline:

  • Format validation β€” Rejects malformed emails, hex IDs, UUID fragments, and datetime strings
  • Domain matching β€” Prioritizes emails matching the company's own domain
  • Vendor filtering β€” Excludes generic vendor emails (e.g., orders@toasttab.com)
  • Phone normalization β€” Converts all numbers to E.164 international format
  • Duplicate detection β€” Removes redundant entries across all pages

With the optional Gemini AI integration, the Actor can also:

  • Parse JavaScript-rendered content that basic scrapers miss
  • Understand semantic context to find contacts on unconventional page layouts
  • Cross-validate LLM findings against strict rules to eliminate hallucinations

πŸ“„ PDF Contact Extraction

Many companies hide contact details inside:

  • brochures
  • catalogs
  • downloadable spec sheets

This Actor automatically fetches and scans PDFs for emails and phone numbers β€” a major upgrade over typical HTML-only scrapers.

🧹 Enterprise-Grade Data Quality

This isn't just extraction β€” it's verification at scale. Every result passes through 50+ validation rules:

  • Rejects placeholder emails (test@, noreply@, example@)
  • Filters out vendor/third-party emails (toasttab, squarespace, wix)
  • Removes invalid phone patterns (hex IDs, tracking codes, dates)
  • Normalizes all phones to E.164 international format
  • Deduplicates across all crawled pages
  • Prioritizes brand-matching emails as the primary contact

The result: CRM-ready data you can trust, not a list of garbage to clean up manually.

πŸ“¦ Bulk CSV Upload

Upload a CSV of domains and process hundreds of websites in one run.


Performance (Based on Real-World Benchmarking)

Testing with 30 trade show exhibitor domains:

MetricWithout LLMWith LLM
Email accuracy92%100%
Email coverage80%83%
Phone coverage87%90%

These results are significantly higher than traditional scrapers.


Performance Guarantees & Crawling Limits

To ensure fast and reliable performance on the Apify platform, this Actor enforces strict limits:

LimitValueReason
Max pages per domain3Ensures fast completion and high health score
Per-domain timeout6 secondsPrevents slow sites from blocking the queue
Global run timeout5 minutesEnsures runs always complete within Apify limits

Important notes:

  • User requests for deeper crawling (>3 pages) are automatically capped for stability
  • PDF extraction is disabled by default because it significantly increases run times. Enable it via enablePdfExtraction: true if needed.
  • Each domain is time-limited to ~6 seconds, ensuring the Actor can process large batches efficiently

Recommended batch sizes: 50–250 domains per run for optimal performance.

This is not a full-site spider. The Actor is optimized for fast, targeted contact extraction from the homepage and key contact pages β€” not deep crawling entire websites.


Input Options

Option 1 β€” Domain List

{
"domains":["example.com","another-company.com"],
"maxPagesPerDomain":3,
"llmApiKey":"your-gemini-api-key"
}

Option 2 β€” CSV Upload

Upload a CSV with a domain, website, or url column:

domain
example.com
another-company.com
acme-corp.net

Parameters

ParameterTypeDescription
domainsarrayList of domains or URLs to scrape
csvFilefileCSV file with domain/website/url column
maxPagesPerDomainintegerMax pages to crawl per domain (default: 3, max: 3)
enablePdfExtractionbooleanEnable PDF extraction (default: false, slower but more thorough)
llmApiKeystringGemini API key for enhanced accuracy

Get a free Gemini API key at https://aistudio.google.com/


Output Format

Each domain produces one result:

{
"domain":"example.com",
"primary_email":"contact@example.com",
"primary_phone":"+14155551234",
"supplemental_emails":["sales@example.com","support@example.com"],
"supplemental_phones":["+14155555678"]
}

Output Fields

FieldDescription
domainDomain that was scraped
primary_emailBest email found (prefers brand domain)
primary_phoneBest phone found (E.164 format)
supplemental_emailsAll other valid emails found
supplemental_phonesAll other valid phones found

Usage Tips

  • Add a Gemini API key for maximum coverage and 100% email accuracy
  • Use CSV upload for large batches (50–250 domains recommended)
  • Enable enablePdfExtraction if contact info is often in PDF brochures/catalogs
  • Find results in the Dataset tab after the run completes

Limitations

  • Cannot extract contacts behind login walls
  • Cannot retrieve contacts locked behind form submissions

If you need high-quality business contact data at scale, this Actor provides the most robust and accurate extraction method available on Apify.

You might also like

Company Contact Enricher - Website to B2B Leads

alizarin_refrigerator-owner/company-contact-enricher

Transform company website URLs into enriched B2B contact data. Automatically scrapes team pages, detects email patterns, cross-references LinkedIn & identifies decision makers. - Website Scanning - Contact Extraction - Email Pattern Detection - LinkedIn Integration - Title Filtering - Webhooks

151

1.0

Decision Maker Name & Email Extractor

dominic-quaiser/decision-maker-name-email-extractor

An actor that crawls a website to identify key decision‑maker names and job titles, then uses NER‑powered matching to extract and pair their email addresses for streamlined lead generation and B2B data enrichment.

πŸ‘ User avatar

Dominic M. Quaiser

298

1.0

Ai Email Contact Finder

menus-r-us/ai-email-contact-finder

Find business emails, phone numbers, and social links from any website using AI. Unlike regex scrapers, we actually read the page β€” finding emails that pattern matchers miss. Works with any site. Feed URLs directly or enrich a Google Maps Scraper dataset. ~70-90% email hit rate.

45

5.0

Email Finder By Name

gp1991/email-finder-by-name

Find anyone's business email from their name and company domain. Tries common patterns (john.doe@, jdoe@, etc.) and verifies each via real SMTP - no stale database lookups. $1.00/1,000 leads.

127

Monster | $0.99πŸ’° | Search | Details | Scraper [/w EMAILS]

memo23/monster-scraper

πŸ’° $0.99 per 1,000 results. Collects Monster.com job data including IDs, titles, company metadata, salary ranges, employment types, posting dates, HTML descriptions, normalized locations, and apply URLs for recruitment analytics.

πŸ‘ User avatar

Muhamed Didovic

136

5.0

Email Finder

automation-lab/email-finder

Find business emails for any person given their name and company domain. Uses waterfall verification: website scraping, GitHub commits, Gravatar checks, and smart pattern detection.

πŸ‘ User avatar

Stas Persiianenko

37

Email Finder-Find Verified Emails by Name & Company Domain

davidsharadbhatt/email-finder-find-verified-emails-by-name-and-company-domain

Find professional emails instantly. Enter first name, last name, and company domain to get verified business emails for leads and outreach.

Us Business Search

great_pistachio/us-business-search

Search official Secretary of State databases across US states. Look up businesses by name, get entity type, status, formation date, officers, registered agent, addresses. Supports NY and FL. Public government data.

πŸ‘ User avatar

Saturnin Pugnet

68

5.0

Business Entity Search

openactor/business-entity-search

Business Entity Search delivers compliance-ready company profiles from UK, Australia, Canada, and US registries. Built for legal, risk, procurement, and revenue teams that need fast answers, clear evidence, and transparent pricing.

Contact Info Scraper with Emails and Phones

intelecta/fast-contact-info-scraper-with-emails

A powerful Apify actor that scrapes emails, phone numbers, and social media profiles from a list of websites, following internal links for thorough contact extraction. Ideal for lead generation, research, and building structured contact databases.

158

3.3