German Imprint Leads Scraper

Pricing

Pay per event

German Imprint Leads Scraper

Extract German Impressum legal contacts, company details, VAT IDs, HRB records, emails, and decision-makers from domains.

Pricing

Pay per event

Rating

0.0

(0)

Developer

👁 Stas Persiianenko

Stas Persiianenko

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

8 days ago

Last modified

What does German Imprint Leads Scraper do?

It visits each submitted domain, checks common German legal-contact pages such as /impressum, /service/impressum, /imprint, and /kontakt, follows likely footer links, and saves one structured lead record per domain.

Who is it for?

🧑‍💼 Sales teams enriching German B2B account lists
🧾 Compliance teams checking public company disclosures
🧲 Lead-generation agencies building Germany-specific datasets
🧑‍💻 Recruiters finding company decision makers
🧹 CRM operations teams normalizing German legal contacts

Why use it?

German websites often place high-value company data in the Impressum instead of on a marketing contact page. This actor targets that legal-contact workflow directly instead of returning generic page text.

What data can it extract?

Field	Description
`inputUrl`	Submitted domain or URL
`imprintUrl`	Best Impressum/contact page found
`companyName`	Legal company name when detected
`legalForm`	GmbH, AG, KG, UG, e.K., and similar forms
`address`	Registered or legal address snippet
`emails`	Public email addresses
`phoneNumbers`	Public phone numbers
`vatId`	German VAT ID / USt-IdNr
`registrationCourt`	Amtsgericht / register court
`registrationNumber`	HRB/HRA registration number
`managingDirectors`	Geschäftsführer, Vorstand, or similar names
`responsiblePerson`	Responsible person when disclosed
`confidenceFlags`	Flags showing which important fields were found
`sourceSnippets`	Text snippets for verification

How much does it cost to extract German Impressum leads?

The actor uses pay-per-event pricing with a small start fee and a per-result fee. Current configured pricing is:

Event	Free	Bronze	Silver	Gold	Platinum	Diamond
Run start	$0.005	$0.005	$0.005	$0.005	$0.005	$0.005
Result extracted	$0.0006508	$0.00056591	$0.00044141	$0.00033955	$0.00022636	$0.00015845

Example estimates before Apify platform fees: 100 extracted domains cost about $0.070 on Free, $0.062 on Bronze, and $0.039 on Gold, including the start event. The default two-domain prefill costs about $0.0063 on Free, so it stays suitable for a quick first test.

Input

Provide domains or URLs in startUrls.

{
"startUrls":[
{"url":"https://www.rewe.de"},
{"url":"https://www.dm.de"}
],
"maxPagesPerDomain":8,
"includeSubpages":true,
"proxyConfiguration":{"useApifyProxy":false}
}

Output

Each dataset item represents one submitted domain or URL.

{
"inputUrl":"https://www.rewe.de",
"inputDomain":"rewe.de",
"imprintUrl":"https://www.rewe.de/service/impressum/",
"status":"found",
"companyName":"REWE Markt GmbH",
"legalForm":"GmbH",
"emails":["impressum@rewe.de"],
"vatId":"DE812706034",
"registrationNumber":"HRB 66773",
"confidenceFlags":["company_name_found","email_found"]
}

How to use it

Prepare a list of German domains or websites.
Paste them into the Start URLs field.
Keep maxPagesPerDomain low for quick enrichment.
Run the actor.
Export the dataset as JSON, CSV, Excel, or via API.

Tips for better results

Submit homepages, not random blog posts.
Keep includeSubpages enabled so footer Impressum links are followed.
Use no proxy first; most public legal pages are accessible directly.
Increase maxPagesPerDomain only for sites with unusual navigation.

Status values

found means an Impressum/contact page was located and parsed.
not_found means pages were checked but no legal-contact page scored high enough.
error means the domain could not be processed due to a network or parsing error.

Confidence flags

Confidence flags help filter records:

company_name_found
address_found
email_found
phone_found
vat_id_found
registration_found
decision_maker_found

Integrations

Use the output with:

HubSpot or Salesforce enrichment workflows
Clay tables and lead-routing systems
Google Sheets lead lists
Compliance review queues
Internal data-quality checks

API usage: Node.js

import{ ApifyClient }from'apify-client';
const client =newApifyClient({token: process.env.APIFY_TOKEN});
const run =await client.actor('automation-lab/german-imprint-leads-scraper').call({
startUrls:[{url:'https://www.rewe.de'}],
maxPagesPerDomain:8,
});
console.log(run.defaultDatasetId);

API usage: Python

from apify_client import ApifyClient
client = ApifyClient('YOUR_APIFY_TOKEN')
run = client.actor('automation-lab/german-imprint-leads-scraper').call(run_input={
'startUrls':[{'url':'https://www.rewe.de'}],
'maxPagesPerDomain':8,
})
print(run['defaultDatasetId'])

API usage: cURL

curl-X POST 'https://api.apify.com/v2/acts/automation-lab~german-imprint-leads-scraper/runs?token=YOUR_APIFY_TOKEN'\
-H'Content-Type: application/json'\
-d'{"startUrls":[{"url":"https://www.rewe.de"}],"maxPagesPerDomain":8}'

MCP usage

Connect Apify MCP with this actor enabled:

https://mcp.apify.com/?tools=automation-lab/german-imprint-leads-scraper

Claude Code setup:

$claude mcp add apify-german-imprint https://mcp.apify.com/?tools=automation-lab/german-imprint-leads-scraper

Claude Desktop JSON config:

{
"mcpServers":{
"apify-german-imprint":{
"url":"https://mcp.apify.com/?tools=automation-lab/german-imprint-leads-scraper"
}
}
}

Example prompts:

"Extract Impressum contacts for these 20 German domains."
"Find VAT IDs and managing directors for this German prospect list."
"Check which domains have no public legal contact details."

Legality

This actor extracts publicly available business information from websites you provide. You are responsible for using the data lawfully, respecting website terms, and complying with GDPR, ePrivacy, and other applicable rules.

FAQ

Why did one domain return `not_found`?

The site may use a non-standard legal page URL, block automated HTTP clients, or render legal data only in JavaScript. Try submitting the exact Impressum URL or increasing maxPagesPerDomain.

Does this actor validate email deliverability?

No. It extracts public emails from pages. Use a dedicated email validation service if you need deliverability checks.

Troubleshooting

If a site returns no data, try raising maxPagesPerDomain or submitting the exact Impressum URL.

If many requests fail, enable Apify Proxy or retry later. Some sites block automated traffic intermittently.

Related scrapers

Limitations

The actor uses HTTP and Cheerio for speed and low cost. Some JavaScript-only pages may expose fewer fields than a browser-based scraper.

Privacy notes

The actor does not log in, bypass paywalls, or access private systems. It only reads public pages reachable from submitted domains.

Changelog

Initial version extracts German Impressum legal-contact fields from submitted domains and URLs.

Support

If you need fields tuned for a specific German industry or CMS pattern, open an Apify issue with sample URLs and expected output.

Field reference

pagesChecked lists every URL requested for the domain. sourceSnippets contains nearby text around key legal labels so users can audit extraction quality.

Performance

HTTP-only crawling keeps runs lightweight. The default platform memory is 512 MB and the default crawl depth is capped per domain.

Data quality workflow

Use confidenceFlags to route complete leads into your CRM and send lower-confidence rows to manual review.

👁 German Imprint Scraper avatar

German Imprint Scraper

codescraper/german-imprint-scraper

A powerful Actor scraper to find and extract legal "Impressum" data from German websites. Get company names, addresses, decision-makers, legal IDs, and more, all automatically.

👁 User avatar

CodeScraper

107

5.0

👁 Impressum Standby Scraper (HTTP Version) avatar

Impressum Standby Scraper (HTTP Version)

dominic-quaiser/impressum-standby-scraper-http

Scrape German imprint pages instantly. Using a HTTP for fast scraping of common simple imprint pages. This Apify Actor finds and extracts structured contact & legal data from any German website — company name, address, phone, fax, email, VAT ID, register number, social media & decision makers.

👁 User avatar

Dominic M. Quaiser

👁 ✨ German Imprint Scraper & Leads Finder (Google Search) avatar

✨ German Imprint Scraper & Leads Finder (Google Search)

winningsolutions/german-imprint-scraper

AI-powered Apify Actor that scrapes Impressum pages on German websites and extracts decision-makers (Geschäftsführer, Vorstand), validated B2B emails, company addresses, VAT IDs, and Handelsregister numbers. Structured JSON output for B2B lead generation, sales prospecting, and CRM enrichment.

👁 User avatar

Winning Solutions

163

5.0

👁 German Imprint Scraper avatar

German Imprint Scraper

dominic-quaiser/imprint-contact-scraper

An Actor that automatically locates and scrapes key contact details from German website imprint pages (Impressum). It extracts information such as company name, address, phone numbers, emails, and decision-makers (Entscheider, Entscheidungsträger)

👁 User avatar

Dominic M. Quaiser

512

3.9

👁 Impressum Standby Scraper (Playwright Version) avatar

Impressum Standby Scraper (Playwright Version)

dominic-quaiser/impressum-standby-scraper

Scrape German imprint pages instantly. Using a headless-browser for dynamic modern sites. This Apify Actor finds and extracts structured contact & legal data from any German website — company name, address, phone, fax, email, VAT ID, register number, social media & decision makers.

👁 User avatar

Dominic M. Quaiser

👁 German Impressum Scraper (Bulk) avatar

German Impressum Scraper (Bulk)

luca-artur/german-impressum-scraper-bulk

Scrape german website imprints for: Company data, decision maker, phone, mail, social profiles, register number, meta description, and more.

👁 User avatar

Luca S.

👁 German Imprint Scraper (Contact+Social Links) avatar

German Imprint Scraper (Contact+Social Links)

codescraper/german-impressum-scraper-fast

Very fast actor, Get Impressum data for just $1.5/1000 Results. This powerful scraper finds any German impressum page and extracts key company data: companyName, address, registerNumber, taxId, emails, phones, socialLinks, and page metadata. Get clean, reliable B2B data in seconds.

👁 User avatar

CodeScraper

5.0

👁 German Company Registry Scraper avatar

German Company Registry Scraper

dataharvest/handelsregister-scraper

Scrape German company data from Handelsregister.de.

👁 User avatar

Alex v

🇩🇪 DEHawk — German Company Data Scraper (Handelsregister)

inexhaustible_glass/dehawk

Scrape German companies from Northdata.de / Handelsregister. Extract name, HRB number, legal form (GmbH/AG/SE), founding date, LEI code, address, management (CEOs & directors), business purpose. Auto risk grading. For German B2B, EU VCs, M&A, compliance, due diligence.

👁 User avatar

Hitman studio

5.0

👁 German Handelsregister API — Company Data & KYB avatar

German Handelsregister API — Company Data & KYB

pat1987/german-handelsregister-lookup

German company data & KYB API. Look up firms across all 5 German registers via handelsregister.de: Handelsregister (HRB/HRA), Genossenschaft (eG), Gesellschaftsregister (eGbR), Partnerschaft & Verein. Returns JSON — registration number, court, legal form, status, address. Firmendaten. Unofficial.

👁 User avatar

Patrick Lierse

URL: https://apify.com/automation-lab/german-imprint-leads-scraper