VOOZH about

URL: https://apify.com/codescraper/website-emails-and-socials-scraper

⇱ Website Contact Email & Social Links Scraper Β· Apify


πŸ‘ Website Emails And Socials Scraper avatar

Website Emails And Socials Scraper

Pricing

$3.99/month + usage

Go to Apify Store

Website Emails And Socials Scraper

Only $3.99/month β€” Extract verified contact emails and social media links from any website. Scans up to 20 contact-related pages per url with smart detection. Returns structured data: URL info, prioritized emails, social profiles, source pages, counts, and timestamps perfect for B2B lead generation

Pricing

$3.99/month + usage

Rating

5.0

(2)

Developer

πŸ‘ CodeScraper

CodeScraper

Maintained by Community

Actor stats

2

Bookmarked

12

Total users

1

Monthly active users

4 months ago

Last modified

Share

⭐ Website Contact Email & Social Scraper – B2B Lead Generation

This Apify actor scrapes contact email addresses and up to 8 official social media profile links from business websites with high accuracy and intelligent filtering.

It utilizes network request interception, smart contact page detection, and advanced social profile filtering to extract emails and official social accounts from homepage DOMs, API responses, and specific "Contact Us" pages β€” all packaged into a structured dataset ideal for lead generation and outreach.


πŸš€ What It Does

For every website URL provided, the actor extracts:

🏒 Site Overview

  • 🌐 Input URL (As entered)
  • πŸ”— Normalized URL (Standardized format)
  • πŸ“§ Total Emails Found
  • 🌍 Social Media Profiles (Up to 8 platforms)
  • ⏱️ Processing Duration
  • πŸ“… Scrape Timestamp

πŸ“© Email Data

For each email found:

βœ‰οΈ Email Address

🚦 Priority Classification

  • High: info@, contact@, support@ (or matches domain name)
  • Medium: sales@, jobs@, press@
  • Low: Personal/Specific names

πŸ”— Source URL (Exact page where the email was found)


🌐 Social Media Data (Up to 8 Platforms)

The actor intelligently extracts one best official profile link per platform, filtering out share links, login pages, and post URLs.

Supported Platforms

  • πŸ“˜ Facebook
  • 🐦 Twitter / X
  • πŸ“Έ Instagram
  • πŸ’Ό LinkedIn
  • ▢️ YouTube
  • πŸ“Œ Pinterest
  • 🎡 TikTok
  • πŸ’¬ Discord

Smart Filtering Rules

  • Ignores share links (e.g., intent/tweet, share.php)
  • Ignores login pages
  • Ignores post/video URLs (e.g., Instagram reels, YouTube watch links)
  • Selects the cleanest and shortest valid profile URL
  • Deduplicates automatically
  • Returns only one best candidate per platform

If a platform is not found, it simply will not appear in the socials object.


⚑ It Handles

  • βœ… Multiple Website URLs (Batch processing)
  • πŸ•΅οΈ Smart Contact Page Detection (Finds /contact, /about, /support etc.)
  • πŸ•ΈοΈ Network Interception (Catches emails hidden in API JSON responses)
  • 🌐 Social Profile Extraction (Up to 8 platforms)
  • πŸ›‘οΈ Domain Scoping (Ignores 3rd party emails like gmail.com unless matched)
  • 🧹 Deduplication (Ensures unique emails & social links per domain)
  • 🚦 Intelligent Prioritization (Sorts emails by relevance)

⚠️ The actor crawls the Homepage and up to 20 candidate contact pages per domain to ensure deep coverage without getting stuck.


🧠 How It Works

  • Normalizes input URLs to ensure valid formats.
  • Crawls the homepage to extract visible emails.
  • Intercepts background network requests (XHR/Fetch) to find hidden emails in JSON data.
  • Detects & navigates to "Contact Us" or "About Us" pages automatically.
  • Extracts and filters official social media profile links.
  • Classifies emails by priority (High/Medium/Low).
  • Deduplicates results to provide a clean structured dataset.
  • Saves structured data to Apify Dataset.

βš™οΈ Input Configuration

FieldTypeDescriptionExample
startUrlsArrayList of target websites["example.com"]
maxConcurrencyIntegerMax pages processed in parallel (Default: 10)10
maxRequestRetriesIntegerRetries per page on failure (Default: 2)2

🧩 Example Input

{
"startUrls":["https://www.iana.org","apify.com","https://inoleds.com"],
"maxConcurrency":10,
"maxRequestRetries":2
}

πŸ“Š Example Output

{
"inputUrl":"https://friendlyglitter.de/",
"normalizedUrl":"https://friendlyglitter.de",
"emails":[
{
"email":"info@glitzer-tattoo.net",
"priority":"high",
"source":"https://friendlyglitter.de/pages/impressum"
}
],
"socials":{
"facebook":"https://www.facebook.com/YbodyGlitzerTattoo/",
"instagram":"https://www.instagram.com/friendly.glitter",
"youtube":"https://www.youtube.com/channel/UC8S3IUHg1xozhVe84rK-Xaw"
},
"emailCount":1,
"processTimeMs":35013,
"scrapedAt":"2026-02-16T08:14:26.284Z"
}

🧠 Features

  • ⭐ Strict Domain Matching (Only extracts emails belonging to the target website)
  • πŸ“Š Priority Sorting (Get the most relevant contact info first)
  • 🌐 Extracts Up To 8 Official Social Profiles
  • 🧹 Automatic Deduplication (Emails & Social Links)
  • πŸ•’ Fast Execution (Optimized navigation limits)
  • 🌍 Universal Support (Works on .com, .de, .org, and all TLDs)
  • πŸ“¦ Structured JSON Output

πŸ’‘ Use Cases

  • B2B Lead Generation
  • Sales Outreach Automation
  • Influencer Outreach
  • Brand Research
  • Market Research
  • Recruitment & HR Sourcing
  • Directory Building

❓ FAQs

1. Does this scrape generic emails like Gmail or Yahoo?

No. The scraper enforces Strict Domain Matching. If you scrape example.com, it will only return emails ending in @example.com to ensure lead quality.


2. How does it prioritize emails?

It uses a prefix classification system:

  • High: Generic contacts (info, hello) or matches the domain name.
  • Medium: Department specific (sales, hr).
  • Low: Everything else.

3. Does it return social share links?

No. The actor filters out share links, login pages, and post URLs. It returns only official profile URLs.


4. Why didn't it find an email?

Some sites protect emails using:

  • Images instead of text
  • Complex JavaScript obfuscation (though network interception catches many of these)
  • Contact forms without visible email addresses

πŸ§‘β€πŸ’» Developer Info

Author: codescraper Email: codescraper011@gmail.com


🏷️ Tags

website-emails-scraper Β· social-media-scraper Β· email-scraper Β· contact-extractor Β· lead-generation Β· b2b-leads Β· crawler Β· web-scraping Β· marketing-automation Β· sales-tools

You might also like

Website Emails Scraper(Contact+OutReach)

codescraper/website-contact-email-scraper

Only $3/month β€” Get verified contact emails from any website. Scans up to 20 contact pages per domain. Returns structured data: URL info, prioritized emails, source pages, counts, and timestamps. Perfect for B2B lead generation, outreach, and prospecting.

31

5.0

Website Email & Contact Extractor: Lead Generation Tool

scrapepilot/website-email-contact-extractor

Extract emails, phone numbers and social media links from any website. Auto-scans homepage plus contact and about pages. Returns verified leads with LinkedIn, Twitter, Instagram profiles. Perfect for B2B outreach and lead generation.

19

3.0

Extract Emails Contacts Socials From Any Website

scrapio/extract-emails-contacts-socials-from-any-website

βœ‰οΈ Extract emails, phones, contact pages & social profiles from any website. πŸ”Ž Lightning-fast email extractor & contact scraper for lead gen, sales outreach, and research. πŸš€ Clean, structured export to power your CRM and campaigns.

Contact Details Scraper – Emails, Phone Numbers & Social Media

davidsharadbhatt/socialprofilescrapper

Extract verified emails, phone numbers, and social media profiles from any website using this Contact Details Scraper. Perfect for lead generation, sales outreach, and business data collection. Automatically find contact info, LinkedIn, Twitter, and company profiles from multiple domains with ease.

87

1.0

Website Lead Extractor, Emails, Phones & Social Profiles

george.the.developer/website-contact-scraper

Extract contact information from any website. Finds emails, phone numbers, social media profiles, and contact forms automatically. Perfect for building prospect lists, lead generation, and sales outreach. Handles JavaScript rendered pages.

25