VOOZH about

URL: https://apify.com/timo.sieber/website-lead-scraper

⇱ Website Contact Scraper - AI-Powered Lead Finder Β· Apify


πŸ‘ Website Contact Scraper - AI-Powered Lead Finder avatar

Website Contact Scraper - AI-Powered Lead Finder

Pricing

$55.00 / 1,000 results

Go to Apify Store

Website Contact Scraper - AI-Powered Lead Finder

AI-powered website scraper that extracts real contact data from company sites! Finds people, positions, emails & phone numbers using LLM technology. Scans team pages, contact sections & company info. Perfect for B2B lead generation and sales research.

Pricing

$55.00 / 1,000 results

Rating

0.0

(0)

Developer

πŸ‘ Timo Sieber

Timo Sieber

Maintained by Community

Actor stats

8

Bookmarked

60

Total users

3

Monthly active users

4 months ago

Last modified

Share

LLM-Guided Corporate Website Scraper

An advanced Apify actor that uses LLMs (Large Language Models) to identify and extract high-value business contact information from corporate websites.

πŸš€ Overview

This scraper goes far beyond traditional crawling. It:

  • Uses GPT (OpenAI) to intelligently rank internal URLs based on their relevance to contact data
  • Maximizes content extraction, including hidden and modal content
  • Parses and validates contact fields using LLMs and custom regex preprocessing
  • Aggregates data across multiple pages for higher confidence

πŸ’‘ Key Features

  • 🧰 LLM-based URL Evaluation: Scores and selects only the most promising URLs per domain
  • πŸ” Maximum Content Extraction: Scrapes visible and hidden elements, emails, phone numbers, and text sections
  • πŸ”§ Custom Prompt Engineering: Tailored prompts for URL scoring and field extraction
  • πŸ“Š Smart Aggregation: Merges multiple extractions into one confident, enriched result per domain
  • πŸšͺ Resilient Parsing: Handles edge cases, malformed responses, and fallback scoring
  • βœ… GDPR-friendly Proxy Support: With optional German residential proxies

βš™οΈ Input

This actor expects the following input:

{
"urls":["https://example.com"],
"openaiApiKey":"sk-...",
"maxRequests":50,
"useProxy":true,
"enableUrlEvaluation":true,
"aggregateResults":true,
"includeExtendedFields":true,
"costLimit":1.0
}

πŸ”„ Workflow

  1. Main page is loaded
  2. LLM evaluates internal links for contact relevance
  3. Top N URLs are crawled (contact, impressum, team, etc.)
  4. Content is extracted (even from modals, hidden fields, footers)
  5. Text is preprocessed for LLM efficiency
  6. LLM parses the data into a structured JSON object
  7. Data is validated, weighted, and aggregated into one high-confidence result

🌐 Output Format

Each record pushed to the dataset contains:

{
"executive_name":"Max Mustermann",
"executive_title":"GeschΓ€ftsfΓΌhrer",
"company_email":"info@example.com",
"company_phone":"+41 44 123 45 67",
"company_address":"Musterstrasse 1, 8000 ZΓΌrich",
"confidence_score":0.92,
"sources":[...],
"aggregated_from_pages":6,
"domain":"example.com"
}

πŸ“ˆ Performance & Cost

  • Average ~40 websites for 0.07 $ (at gpt-3.5-turbo rates)
  • Each domain result is based on up to 8 evaluated subpages
  • Internal cost tracking included

πŸ” Notes

  • Requires valid OpenAI API key (gpt-3.5-turbo)
  • Proxy use is optional, but recommended for stable scraping
  • Works well for DE/CH/Austria-based companies (Impressum detection)

πŸšͺ Limitations

  • Not optimized for dynamic SPAs
  • Some LLM responses may still need fallback handling (included)

🚧 Future Improvements

  • Add multilingual prompt switching (based on targetLanguage input)
  • Upgrade to gpt-4-turbo for more robust data quality
  • Add custom scoring model for aggregation weighting

🌟 Created by Timo Sieber β€” for smarter, LLM-powered scraping at scale.

You might also like

Website Email Phone Finder

scrapier/website-email-phone-finder

πŸ”Ž Website Email & Phone Finder finds and verifies contact details from any website or domain β€” emails, phone numbers & contact pages. πŸš€ Ideal for B2B prospecting, lead gen, and outreach. Bulk search, validation, CSV export & API.

Website Phone Number Contact Finder

scraper-engine/website-phone-number-contact-finder

Website Phone Number Contact Finder scans websites to extract publicly listed phone numbers. Build accurate contact lists from company sites at scale. Ideal for sales teams, marketers, and agencies running outbound calling campaigns.

πŸ‘ User avatar

Scraper Engine

27

Contact Details Scraper – Emails, Phone Numbers & Social Media

davidsharadbhatt/socialprofilescrapper

Extract verified emails, phone numbers, and social media profiles from any website using this Contact Details Scraper. Perfect for lead generation, sales outreach, and business data collection. Automatically find contact info, LinkedIn, Twitter, and company profiles from multiple domains with ease.

86

1.0

Website Email & Contact Extractor: Lead Generation Tool

scrapepilot/website-email-contact-extractor

Extract emails, phone numbers and social media links from any website. Auto-scans homepage plus contact and about pages. Returns verified leads with LinkedIn, Twitter, Instagram profiles. Perfect for B2B outreach and lead generation.

17

3.0

B2B Website Contact & Company Intelligence Extractor (CRM-Ready

adinfosys-labs/b2b-website-contact-company-intelligence-extractor-crm-ready

Extract emails, phone numbers, and social links from thousands of websites. Automatically scans contact pages and returns clean, export-ready contact data.

πŸ‘ User avatar

Artashes Arakelyan

37

5.0

Website Contact Finder

prodiger/website-contact-finder

Website emails scraper and contact finder for lead generation. Extract email addresses, phone numbers, social profiles, and optional email verification from company websites in bulk. CRM-ready output, no browser required.