Pricing
from $2.00 / 1,000 serp listing scrapeds
Herold.at Scraper - Austrian Yellow Pages
Scrape business listings from herold.at, Austria's Yellow Pages with 360K+ businesses. Extract names, phone numbers, emails, addresses directly from search results. HTTP-only, low memory, competitive pricing.
Pricing
from $2.00 / 1,000 serp listing scrapeds
Rating
0.0
(0)
Developer
Actor stats
0
Bookmarked
6
Total users
1
Monthly active users
a month ago
Last modified
Categories
Share
Herold.at Scraper
Scrape business listings from herold.at, Austria's Yellow Pages with 360K+ businesses.
Features
- Phone + email on SERP β Direct
tel:andmailto:links, no detail page needed - ~30 results per page with automatic pagination
- Austrian address parsing β 4-digit postal codes + city names
- Optional detail pages β Website URLs, opening hours, geo coordinates, company details
- Qwik JSON extraction β Structured data from herold.at's Qwik framework (rating, industries, founding year, keywords)
- Social media links β Facebook, Instagram, LinkedIn, Xing, YouTube via vCard + HTML parsing
- HTTP-only β Low memory (~128MB), fast execution
- Pay-per-result pricing via Apify billing events
Use with AI Agents (MCP)
Connect this actor to any MCP-compatible AI client β Claude Desktop, Claude.ai, Cursor, VS Code, LangChain, LlamaIndex, or custom agents.
Apify MCP server URL:
https://mcp.apify.com?tools=santamaria-automations/herold-at-scraper
Example prompt once connected:
"Use
herold-at-scraperto scrape company data from herold at. Return results as a table."
Clients that support dynamic tool discovery (Claude.ai, VS Code) will receive the full input schema automatically via add-actor.
Input
| Field | Type | Default | Description |
|---|---|---|---|
keyword | string | required | Business type (e.g., "Elektriker", "Restaurant") |
location | string | optional | City/region (e.g., "Wien", "Salzburg"). Empty = all Austria |
maxResults | number | 100 | Maximum results to scrape |
includeDetails | boolean | false | Fetch detail pages for website/hours/geo |
proxyConfiguration | object | Apify proxy | Proxy settings |
Output Fields
From SERP (always available)
nameβ Business namephoneβ Phone number (directly from SERP!)emailβ Email address (directly from SERP!)addressβ Full addresscity,postal_codeβ Address componentscategoryβ Business categorydetail_urlβ Link to detail pagelogo_urlβ Business logo image URL (from SERP listing)
From Detail Page (when includeDetails=true)
websiteβ Business website URLopening_hoursβ Business hoursdescriptionβ Business description (from Qwik JSON or meta tags)latitude,longitudeβ Geo coordinateslogo_urlβ Business logo image URL (higher quality from Qwik JSON)social_linksβ Social media profiles as{ facebook: "url", instagram: "url", ... }branchesβ Comma-separated industry names (primary + secondary industries)rating_scoreβ Average rating (0-5 scale)rating_countβ Number of reviewsfounding_yearβ Company founding yearcompany_register_idβ Austrian company register number (Firmenbuchnummer)ksv_urlβ KSV1870 credit check linkkeywordsβ Comma-separated business keywords
Pricing Events
| Event | Price |
|---|---|
directory-start | $0.05 |
directory-serp-result | $0.003 |
directory-detail-result | $0.005 |
Enrichment add-ons
After the scrape completes, this actor can automatically trigger AI-powered extraction on every website found in the results. Each add-on runs as a separate actor and produces its own dataset.
Contact extraction
Extracts team member names, email addresses, phone numbers, positions, and departments from company websites. Powered by the Website Contact Extractor.
Enable it by setting enableContactExtraction: true and providing at least one LLM API key. The sub-actor run ID is stored in the key-value store as CONTACT_EXTRACTOR_RUN_ID.
Job listing extraction
Extracts open positions, job titles, locations, departments, and career page URLs from company websites. Powered by the Website Job Extractor.
Enable it by setting enableJobExtraction: true and providing at least one LLM API key. The sub-actor run ID is stored in the key-value store as JOB_EXTRACTOR_RUN_ID.
Browser fallback
Some company websites are built with JavaScript frameworks (React, Vue, Angular) that require a full browser to render. When enableBrowserFallback is set to true, the contact/job extractors will automatically re-scrape these sites with Playwright. This catches ~25% more sites but increases cost and runtime. Only applies when contact or job extraction is enabled.
LLM API keys
Both add-ons use LLMs to extract structured data. Provide one or more API keys. When multiple keys are provided, the system uses them in priority order with automatic fallback:
- Gemini (recommended) -- Best quality-to-cost ratio. Free tier includes 1M tokens/min. Get a key at Google AI Studio.
- Groq (optional) -- Ultra-fast inference. Get a key at Groq Console.
- OpenRouter (optional) -- Access to 100+ models. Get a key at OpenRouter.
One key is sufficient. With multiple keys, if the primary provider hits a rate limit, the system falls back to the next available provider automatically.
Related Actors
DACH Business Directories
- FirmenABC.at Scraper -- Austrian business directory
- Gelbe Seiten Scraper -- German Yellow Pages
- wlw.de Scraper -- German B2B supplier directory
- Das Oertliche Scraper -- German phone directory
- search.ch Scraper -- Swiss business directory
- Europages Scraper -- 30+ European countries
Swiss Company Data
- Zefix.ch Scraper -- Official Swiss commercial register
- Moneyhouse.ch Scraper -- Swiss company registry
Enrich your leads
- Website Email & Phone Scraper -- Extract emails and phones from company websites
- Website Contact Extractor -- Extract team members and decision-makers
- Google Maps Scraper -- Find businesses by location
- Trustpilot Reviews Scraper -- Get company reviews and ratings
Support
Found a bug or have a feature request? Open an issue.
