👁 ✨ German Imprint Scraper & Leads Finder (Google Search) avatar

✨ German Imprint Scraper & Leads Finder (Google Search)

Pricing

from $3.90 / 1,000 successful data extractions

👁 ✨ German Imprint Scraper & Leads Finder (Google Search)

✨ German Imprint Scraper & Leads Finder (Google Search)

AI-powered Apify Actor that scrapes Impressum pages on German websites and extracts decision-makers (Geschäftsführer, Vorstand), validated B2B emails, company addresses, VAT IDs, and Handelsregister numbers. Structured JSON output for B2B lead generation, sales prospecting, and CRM enrichment.

Pricing

from $3.90 / 1,000 successful data extractions

Rating

5.0

(4)

Developer

👁 Winning Solutions

Winning Solutions

Maintained by Community

Actor stats

Bookmarked

152

Total users

Monthly active users

15 hours

Issues response

17 days ago

Last modified

German Impressum Scraper & Leads Finder – Decision Makers, Email Verification & Company Data

An AI-powered Apify Actor that finds and scrapes German website Impressum pages to extract contact details at scale. Start from your own URL list or simply enter Google search terms - the built-in Lead Finder turns queries like marketing agency berlin into ready-to-scrape leads automatically.

It detects imprint links, navigates pages reliably, and returns structured JSON with company name, all decision-makers (Geschäftsführer, Vorstand, Inhaber, ...), address, phone, email (with optional validation), social media profile URLs (Facebook, Instagram, LinkedIn, and more), VAT ID, and Handelsregister numbers.

Designed for lead generation, CRM enrichment, and compliance workflows, the scraper ensures high data accuracy through intelligent page detection, robust parsing logic, and optional email validation. It supports batch processing, handles edge cases like non-standard layouts, and includes retry mechanisms for scalable data extraction.

🚀 New in version 2.0: Lead Finder

No URL list? No problem.

Just type Google search terms like marketing agency berlin or steuerberater münchen and the Actor turns them into ready-to-use leads. It pulls the organic Google results for each query, visits every website, and extracts the full Impressum contact data automatically - from keyword to verified B2B lead in a single run.

Use Cases

Generate B2B leads with verified contact data
Enrich your CRM with German company information
Build outreach lists with validated emails
Run market and competitor research
Automate data collection workflows

Index

Release Notes

v2.3.1 - fixed an issue where the actor could time-out when scraping 800+ domains

v2.3 - Social media link extraction

New social_media_links field - an object with keys facebook, instagram, linkedin, xing, youtube, twitter, tiktok, pinterest, whatsapp. Each key holds the first profile URL found on the homepage or imprint page; empty string when not found.
New disableSocialMediaLinks input flag - set to true to omit social_media_links from each dataset row.

v2.2.1 - fixed an issue where the actor could time-out

v2.2 - All decision makers extracted

New decision_makers array - every responsible person found in the imprint (Geschäftsführer, Gesellschafter, Vorstand, Vertretungsberechtigte, Inhaber, Ansprechpartner, ...) is now returned as a structured array. Each entry contains first_name, last_name, salutation, academic_title (e.g. Dr., Prof. Dr.), profession (e.g. Steuerberater, Rechtsanwalt), and roles (array of section headings that person appears under).
Backward compatible - the existing singular contact_person field is unchanged and always contains the highest-priority person (Geschäftsführer → Verantwortlicher → Ansprechpartner → Inhaber).
New disableDecisionMakers input flag - set to true to omit the decision_makers array from output rows.
New All Decision Makers dataset view - shows the full array per company in the Apify Console.

v2.1 - Domain blacklist for lead discovery

Domain blacklist - new domainBlacklist input field (comma-separated domains) for Google Search lead discovery mode; matching discovered URLs are still charged (website-processed) but not scraped or written to the dataset

🚀 v2.0 - Lead discovery via Google Search

Lead discovery via Google Search - new inputMode: searchTerms mode to discover URLs from Google queries, then scrape their imprints automatically
New input fields - searchTerms, resultsPerSearchTerm, locationName, languageCode, device
New output field - search_term on every row (null when using Target URLs mode)
New pay-per-event - serp-search-page at $0.005 per SERP page ($5.00 / 1,000 pages, approximately 10 results per page)

v1.3 - API and MCP readiness

API and MCP usage guide - new README section with copyable curl, apify-client (JavaScript), and Apify MCP examples
Improved schema descriptions - input, output, and dataset descriptions sharpened for programmatic and AI tool callers; no field names or runtime behavior changed

v1.2 - Faster processing & smarter AI analysis

Improved parallel processing - Actor is now 50% faster
Improved imprint analysis with fallbacks - LLM is always used for the best result
Added "PerformanceMax" option - for faster processing (requires at least 2 GB RAM)
Improved log formatting
**imprint_url is null when not found** - if the imprint URL is not found, it will now show as null
New imprint_status field - indicates the outcome of imprint URL discovery (FOUND, NOT_FOUND, FETCH_ERROR, URL_ERROR, ERROR, UNKNOWN)

v1.1 - Bug fixes & reliability improvements

Reliability fixes - graceful charge-limit handling under concurrency, input-schema cleanup, and several scraper edge-case fixes
Improved AI analysis reliability - more robust handling of LLM responses with multi-provider fallback
Slimmer Build image - faster cold starts and smaller pulls

🎉 v1.0 - Initial public release

AI-powered imprint detection - finds Impressum links even when traditional parsing fails
Intelligent scraping engine - automatically switches between a lightweight fast mode and a full browser mode depending on the website's complexity
Comprehensive email validation - syntax, MX records, disposable filter, role-based detection, alias resolution, typo suggestions and live status tracking
Per-field privacy toggles - disable any output field you don't need (e.g. disableEmail, disableVatId)
Pay-per-event pricing - only pay for processed websites and (optionally) validated emails

Lead Discovery with Google Search

Instead of providing URLs manually, you can let the Actor discover leads automatically via Google Search. Set Input Mode to Google Search Terms and enter one search query per line (e.g. zahnarzt berlin, rechtsanwalt münchen). The Actor fetches organic Google results for each term and scrapes the imprint of every discovered website.

Each SERP page returned (approximately 10 organic results per page) is billed as one Google SERP page (Lead Discovery) pay-per-event charge of $0.005 per page ($5.00 / 1,000 pages) on top of the usual imprint-scraping fees.

Features

🤖 AI-Powered Link Detection: Uses AI to find imprint page links when traditional parsing fails

🧠 Smart Contact Extraction: Extracts comprehensive contact and company information

🇩🇪 German Website Optimization: Specifically optimized for German website structures and imprint pages

🛡️ Robust Error Handling: Built-in retry mechanisms and comprehensive error handling

📊 Detailed Output: Structured data with contact person, company details, and legal information

📱 Social Media Profiles: Extracts profile URLs for nine platforms from the homepage and imprint page

🌐 Proxy Support: Optional proxy configuration for enhanced reliability

Advanced Email Validation

When enabled, the Actor performs comprehensive email validation using multiple verification methods:

Syntax validation - Ensures proper email format
Domain existence check - Verifies the domain is active and reachable
MX record validation - Confirms the domain can receive emails
Disposable email detection - Filters out temporary/disposable email addresses
Role-based email detection - Identifies generic addresses (info@, admin@, etc.)
Email alias detection - Detects aliases for major providers (Gmail, Yahoo, Outlook/Hotmail)
Typo suggestions - Provides corrections for common email typos
Real-time monitoring - Live validation status tracking

This ensures you only get high-quality, deliverable email addresses for your lead generation campaigns.

Pricing

Cost item	Rate
Successful data extraction	$4.50 / 1,000 results
Website processing fee	$0.60 / 1,000 websites
Email validation service	$2.50 / 1,000 validations
Google SERP page (Lead Discovery)	$5.00 / 1,000 SERP pages (~10 results each)
Actor start	$0.04 (infrequent)
Apify platform compute (RAM/time)	Billed by Apify platform pricing

Cost per lead: ~$0.005 - stays constant regardless of run size.

Email validation is billed separately at $0.0025 per validated address when validateEmail is enabled.

Cost Examples

Scenario A: 100 URLs, ~90 successful imprints

Actor start: $0.04
100 websites processed: $0.06
90 successful results: $0.41
Total: ~$0.51

Scenario B: 500 URLs, ~430 successful imprints

Actor start: $0.04
500 websites processed: $0.30
430 successful results: $1.94
Total: ~$2.28

Scenario C: Lead Discovery, 50 search terms (10 results each), ~350 successful imprints

Actor start: $0.04
50 SERP pages: $0.25
~500 websites processed: $0.30
350 successful results: $1.58
Total: ~$2.17

Input

The Actor accepts the following input parameters (see the Input tab in the Apify Console for the full, interactive schema):

Parameter	Type	Required	Default	Description
`inputMode`	string	no	`urls`	Input mode: `urls` (Target URLs) or `searchTerms` (Google Search lead discovery).
`targetUrls`	array	when `inputMode=urls`	-	Array of `{ "url": "..." }` objects. Invalid URLs are skipped and reported as input warnings.
`searchTerms`	string	when `inputMode=searchTerms`	-	One Google query per line. Used only in searchTerms mode.
`resultsPerSearchTerm`	integer	no	`10`	Organic results to fetch per search term (1–100).
`locationName`	string	no	`Germany`	Region for Google search results. Enter a country or city spelled out in full, e.g. `Germany` or `Berlin`.
`languageCode`	string	no	`de`	ISO language code for Google results.
`device`	string	no	`desktop`	Device for SERP requests: `desktop` or `mobile`.
`domainBlacklist`	string	no	-	Comma-separated domains to exclude from output in searchTerms mode. Matching URLs are still charged but not written to the dataset. Ignored when `inputMode` is `urls`.
`skipResultsWithoutEmail`	boolean	no	`false`	When enabled, results without an email address are not written to the dataset.
`maxRetries`	integer	no	`3`	Maximum retry attempts for failed requests (0–10).
`timeout`	integer	no	`5`	Per-request timeout in seconds (5–60).
`validateEmail`	boolean	no	`false`	Enable email validation; extra usage fees apply when on.
`proxyConfiguration`	object	no	-	Apify Proxy or custom proxy URLs (`useApifyProxy`, optional `proxyUrls`, `groups`, `countryCode`).
`performanceMax`	boolean	no	`false`	Higher parallelism; use only with at least 2 GB Actor memory.
`disableImprintUrl`	boolean	no	`false`	Omit `imprint_url` and `imprint_status` from each row.
`disableContactPerson`	boolean	no	`false`	Omit `contact_person` from each row.
`disableDecisionMakers`	boolean	no	`false`	Omit `decision_makers` array from each row. Does not affect `contact_person`.
`disableCompanyName`	boolean	no	`false`	Omit `company_name` from each row.
`disableCompanyAddress`	boolean	no	`false`	Omit `company_address` from each row.
`disablePhoneNumber`	boolean	no	`false`	Omit `phone_number` from each row.
`disableEmail`	boolean	no	`false`	Omit `email` and `email_status` from each row.
`disableRegisterNumber`	boolean	no	`false`	Omit `register_number` from each row.
`disableVatId`	boolean	no	`false`	Omit `vat_id` from each row.
`disableSocialMediaLinks`	boolean	no	`false`	Omit `social_media_links` from each row.
`useHeadlessBrowser`	boolean	no	`false`	Not shown in Console. If true, force browser-based scraping for all URLs.
`enableAxiosRetry`	boolean	no	`true`	Not shown in Console. If true, axios-first scraping with browser fallback.
`retryOptions`	object	no	-	Not shown in Console. Retry trigger criteria for axios-first mode (`requireEmail`, `requireContactPerson`, `requireAnyField`, `minFieldsRequired`).

Input Example (Target URLs)

{
"inputMode":"urls",
"targetUrls":[
{"url":"https://example.de"},
{"url":"https://another-site.de"}
],
"skipResultsWithoutEmail":false,
"maxRetries":1,
"timeout":5,
"validateEmail":true,
"proxyConfiguration":{
"useApifyProxy":false
}
}

Input Example (Lead Discovery)

{
"inputMode":"searchTerms",
"searchTerms":"zahnarzt berlin\nrechtsanwalt münchen",
"resultsPerSearchTerm":10,
"locationName":"Germany",
"languageCode":"de",
"device":"desktop",
"validateEmail":false
}

API and MCP usage

Runs write dataset rows for each valid input URL that reaches processing. Every row includes target_url for deterministic input-output mapping. The Actor does not write a separate key-value OUTPUT record. Fetch rows from the default dataset after the run succeeds (or use the synchronous endpoint below).

REST (sync, returns dataset items): replace YOUR_USERNAME, YOUR_API_TOKEN, and use the same JSON body as in Input Example.

curl"https://api.apify.com/v2/acts/YOUR_USERNAME~german-imprint-scraper/run-sync-get-dataset-items?token=YOUR_API_TOKEN"\
-X POST \
-H"Content-Type: application/json"\
-d'{"targetUrls":[{"url":"https://www.winning-solutions.de"}],"validateEmail":false}'

JavaScript (apify-client):

import{ ApifyClient }from'apify-client';
const client =newApifyClient({token: process.env.APIFY_TOKEN});
const input ={
targetUrls:[{url:'https://www.winning-solutions.de'}],
validateEmail:false,
};
const run =await client.actor('YOUR_USERNAME~german-imprint-scraper').call(input);
const{ items }=await client.dataset(run.defaultDatasetId).listItems();
console.log(items);

Apify MCP server (AI agents): configure your MCP client with URL https://mcp.apify.com?tools=YOUR_USERNAME~german-imprint-scraper (you can combine multiple tools per Apify MCP docs). Pass the API token via your client (for example an Authorization: Bearer ... header), not inside the Actor input JSON.

Output Structure

The Actor returns structured data for each processed URL. The table below lists the main fields. Optional dataset views in the Apify Console may show a subset or flattened columns. When disable options are enabled, the corresponding keys are omitted from each row.

Field	Type	Description	Example Value
`search_term`	string	null	Google query that surfaced this URL; null in Target URLs mode
`target_url`	string	Original input URL that produced this row	`"https://example.de"`
`imprint_url`	string	Found impressum page URL, `null` when not found or an error occurred	`"https://example.de/impressum"`
`imprint_status`	string	Result of imprint URL discovery: `FOUND`, `NOT_FOUND`, `FETCH_ERROR`, `URL_ERROR`, `ERROR`, or `UNKNOWN`	`"FOUND"`
`contact_person.first_name`	string	First name of the primary contact (backward compatible, always present)	`"Max"`
`contact_person.last_name`	string	Last name of the primary contact	`"Mustermann"`
`contact_person.salutation`	string	Salutation (Herr/Frau)	`"Herr"`
`decision_makers`	array	All responsible persons found in the imprint. Each entry has `first_name`, `last_name`, `salutation`, `academic_title` (e.g. `"Dr."`), `profession` (e.g. `"Steuerberater"`), and `roles` (array, e.g. `["Geschäftsführer", "Gesellschafter"]`).	see below
`company_name`	string	Full official company name	`"Example GmbH"`
`company_address.street`	string	Street name	`"Musterstraße"`
`company_address.house_number`	string	House number	`"123"`
`company_address.postalcode`	string	Postal code	`"12345"`
`company_address.city`	string	City name	`"Musterstadt"`
`phone_number`	string	Contact phone number	`"+49 123 456789"`
`email`	string	Contact email address	`"info@example.de"`
`email_status`	string	With validation: `DELIVERABLE`, `UNDELIVERABLE`, `UNKNOWN`, or `MISSING_EMAIL` if no email was found	`"DELIVERABLE"`
`register_number`	string	Commercial register number / Handelsregisternummer	`"HRB 12345"`
`vat_id`	string	VAT ID number / Umsatzsteuer-Id	`"DE123456789"`
`social_media_links`	object	Profile URLs for `facebook`, `instagram`, `linkedin`, `xing`, `youtube`, `twitter`, `tiktok`, `pinterest`, `whatsapp`. Empty string per key when not found. Detected from homepage and imprint HTML - no extra AI cost.	see below
`retryTriggered`	boolean	Whether retry was triggered during scraping	`false`
`retryReasons`	array	Array of reasons why retry was triggered (optional)	`["email is empty"]`
`_metadata.websiteProcessed`	boolean	Whether the website was successfully processed	`true`
`_metadata.resultCharged`	boolean	Whether this result was charged	`true`
`_metadata.emailValidated`	boolean	Whether email validation was performed	`true`
`_metadata.limitReached`	boolean	Whether usage limit was reached	`false`
`_metadata.error`	string	Present when the row records a top-level run failure	(varies)
`_metadata.errorContext`	mixed	Extra debugging context when `error` is set	(varies)

Output Example

{
"search_term":"steuerberater münchen",
"target_url":"https://example.de",
"imprint_url":"https://example.de/impressum",
"imprint_status":"FOUND",
"contact_person":{
"first_name":"Felix",
"last_name":"Keß",
"salutation":"Herr"
},
"decision_makers":[
{
"first_name":"Felix",
"last_name":"Keß",
"salutation":"Herr",
"academic_title":"",
"profession":"Steuerberater",
"roles":["Vertretungsberechtigter","Gesellschafter"]
},
{
"first_name":"Marcus",
"last_name":"Stein",
"salutation":"Herr",
"academic_title":"Dr.",
"profession":"Steuerberater",
"roles":["Vertretungsberechtigter","Gesellschafter"]
},
{
"first_name":"Jutta",
"last_name":"Keß",
"salutation":"Frau",
"academic_title":"",
"profession":"Steuerberaterin",
"roles":["Gesellschafterin"]
}
],
"company_name":"Keß & Partner Steuerberatungsgesellschaft mbB",
"company_address":{
"street":"Musterstraße",
"house_number":"123",
"postalcode":"80331",
"city":"München"
},
"phone_number":"+49 89 123456",
"email":"info@example.de",
"email_status":"DELIVERABLE",
"register_number":"HRB 12345",
"vat_id":"DE123456789",
"social_media_links":{
"facebook":"https://www.facebook.com/example",
"instagram":"https://www.instagram.com/example",
"linkedin":"https://www.linkedin.com/company/example",
"xing":"",
"youtube":"",
"twitter":"",
"tiktok":"",
"pinterest":"",
"whatsapp":""
},
"retryTriggered":false,
"retryReasons":[],
"_metadata":{
"websiteProcessed":true,
"resultCharged":true,
"emailValidated":true,
"limitReached":false
}
}

👁 German Imprint Scraper with Decision Makers Names Extraction avatar

German Imprint Scraper with Decision Makers Names Extraction

dominic-quaiser/imprint-contact-scraper

An Actor that automatically locates and scrapes key contact details from German website imprint pages (Impressum). It extracts information such as company name, address, phone numbers, emails, and decision-makers (Entscheider, Entscheidungsträger)

👁 User avatar

Dominic M. Quaiser

503

4.0

👁 German Imprint Scraper avatar

German Imprint Scraper

codescraper/german-imprint-scraper

A powerful Actor scraper to find and extract legal "Impressum" data from German websites. Get company names, addresses, decision-makers, legal IDs, and more, all automatically.

👁 User avatar

CodeScraper

105

5.0

👁 German Impressum Scraper (Bulk) avatar

German Impressum Scraper (Bulk)

luca-artur/german-impressum-scraper-bulk

Scrape german website imprints for: Company data, decision maker, phone, mail, social profiles, register number, meta description, and more.

👁 User avatar

Luca S.

👁 Scale Google Maps Scraper avatar

Scale Google Maps Scraper

huncho/google-maps-scraper

The most efficient way to scrape Google Maps at scale - $0.0004/result

👁 User avatar

Rahul Singh

240

3.1

👁 Google Search Scraper — Organic SERP, PAA & Related avatar

Google Search Scraper — Organic SERP, PAA & Related

automation-lab/google-search-scraper

Scrape Google organic results, People Also Ask questions, and related searches for SEO research, rank tracking, and content gap workflows.

👁 User avatar

Stas Persiianenko

115

👁 German Company Registry Scraper avatar

German Company Registry Scraper

dataharvest/handelsregister-scraper

Scrape German company data from Handelsregister.de.

👁 User avatar

Alex v

👁 German Imprint Scraper (Contact+Social Links) avatar

German Imprint Scraper (Contact+Social Links)

codescraper/german-impressum-scraper-fast

Very fast actor, Get Impressum data for just $1.5/1000 Results. This powerful scraper finds any German impressum page and extracts key company data: companyName, address, registerNumber, taxId, emails, phones, socialLinks, and page metadata. Get clean, reliable B2B data in seconds.

👁 User avatar

CodeScraper

👁 Handelsregister Scraper avatar

Handelsregister Scraper

dominic-quaiser/handelsregister-scraper

Leistungsstarker Actor zur Identifikation von Firmen und Entscheidungsträgern aus dem deutschen Handelsregister. Zugriff auf Geschäftsführer, Vorstände und vertretungsberechtigte Personen per Echtzeit-API — ideal für Lead-Enrichment, Marktanalysen, KYC und Due-Diligence.

👁 User avatar

Dominic M. Quaiser

👁 Instagram Hashtag Scraper avatar

Instagram Hashtag Scraper

scrapesmith/instagram-hashtag-scraper

⚡ Instagram Hashtag Scraper – Extract posts & reels from any public hashtag. Get captions, likes, views, comments, timestamps & media URLs. Bulk hashtag support, no login needed. Export to JSON, CSV or Excel. $0.50/1,000 results.

👁 User avatar

Scrape Smith

657

3.7

CEO Finder

mystical_waterbird/ceo-finder

Extract CEOs, founders & decision-makers from EN websites with high accuracy. Input data as text or JSON. Integrates with Clay.com and other tools. Async, fast, validated output, handles /impressum, /team, LinkedIn & email patterns. Perfect for B2B lead enrichment.

👁 User avatar

Shahrukh Yahh

URL: https://apify.com/winningsolutions/german-imprint-scraper

⇱ ✨ German Imprint Scraper & Leads Finder (Google Search) · Apify

✨ German Imprint Scraper & Leads Finder (Google Search)

German Impressum Scraper & Leads Finder – Decision Makers, Email Verification & Company Data

🚀 New in version 2.0: Lead Finder

Use Cases

Index

Release Notes

v2.3.1 - fixed an issue where the actor could time-out when scraping 800+ domains

v2.3 - Social media link extraction

v2.2.1 - fixed an issue where the actor could time-out

v2.2 - All decision makers extracted

v2.1 - Domain blacklist for lead discovery

🚀 v2.0 - Lead discovery via Google Search

v1.3 - API and MCP readiness

v1.2 - Faster processing & smarter AI analysis

v1.1 - Bug fixes & reliability improvements

🎉 v1.0 - Initial public release

Lead Discovery with Google Search

Features

Advanced Email Validation

Pricing

Cost Examples

Input

Input Example (Target URLs)

Input Example (Lead Discovery)

API and MCP usage

Output Structure

Output Example

You might also like

German Imprint Scraper with Decision Makers Names Extraction

German Imprint Scraper

German Impressum Scraper (Bulk)

Scale Google Maps Scraper

Google Search Scraper — Organic SERP, PAA & Related

German Company Registry Scraper

German Imprint Scraper (Contact+Social Links)

Handelsregister Scraper

Instagram Hashtag Scraper

CEO Finder