Pricing
from $4.99 / 1,000 results
Bundesagentur fΓΌr Arbeit Scraper
Pricing
from $4.99 / 1,000 results
Rating
0.0
(0)
Developer
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
10 days ago
Last modified
Categories
Share
π©πͺ Bundesagentur fΓΌr Arbeit Job Scraper
π The fastest, most complete Arbeitsagentur.de job scraper on Apify. Pull thousands of fresh German-language job postings β title, employer, salary notes, contract type, contact emails, full description, application URL and 30+ other fields β straight into a clean dataset.
π‘ Why Choose This Scraper?
- β‘ Blazing fast β async, concurrent fetching of search pages and job details simultaneously.
- π― Bulk input β feed any number of Arbeitsagentur.de search URLs and get a single deduped dataset back.
- π°οΈ Smart proxy fallback β starts direct, auto-escalates to Apify datacenter, then residential proxies on rate limits, then sticks with the upgraded tier.
- πΎ Live saving β every job is pushed to the dataset the moment it's ready. A crash mid-run never costs you data.
- π‘οΈ Production-grade error handling β retries with exponential backoff on every external call.
- π§ Contact email extraction β automatically pulls application emails out of job descriptions.
- πͺπΊ DACH coverage β postings from Germany, Austria, and Switzerland whenever Arbeitsagentur.de surfaces them.
π― Key Features
| Feature | Description |
|---|---|
| π Bulk URL input | Drop any number of Arbeitsagentur.de search URLs |
| π Keyword search | Optional position + location fallback inputs |
| π°οΈ Adaptive proxy | none β datacenter β residential (3 retries) escalation |
| π Rich output | 38 normalized fields per job |
| πΎ Live persistence | push_data after every record β no data loss on crash |
| π Retry policy | 3 attempts with exponential backoff + jitter |
| π§ Email extraction | Regex-pulled application emails from descriptions |
| π Deduplication | By Arbeitsagentur reference number across queries |
π₯ Input
{"startUrls":[{"url":"https://www.arbeitsagentur.de/jobsuche/suche?angebotsart=1&was=Softwareentwickler%2Fin"}],"position":"Softwareentwickler/in","location":"Berlin","maxItems":100,"proxyConfiguration":{"useApifyProxy":false}}
| Field | Type | Required | Description |
|---|---|---|---|
π startUrls | array | β | List of Arbeitsagentur.de job search URLs |
πΌ position | string | β | Keyword fallback when no URL is supplied |
π location | string | β | German city, region, or postal code |
π― maxItems | integer | β | Total result cap across all queries (default 100) |
π°οΈ proxyConfiguration | object | β | Defaults to no proxy; auto-falls back on blocks |
π€ Output
Each dataset row looks like:
{"title":"Softwareentwickler/in","mainJob":"Softwareentwickler/in","descriptionText":"LOCATION: Villach, AT, 9500 REQ ID: 198751 ...","jobType":"ARBEIT","referenceNumber":"11949-17224369-S","publishingStartDate":"2026-05-15","firstPublicationDate":"2026-05-15","lastUpdatedAt":"2026-05-15T17:04:43.169","contractDuration":"UNBEFRISTET","startDate":"2026-05-16","salaryNote":"KEINE_ANGABEN","employerName":"Lam Research AG","employerWebsite":null,"employerIndustry":null,"location":"9500 Villach-Lind, KAERNTEN, OESTERREICH","isFullTime":true,"isWorkFromHome":false,"externalURL":"https://jobs.ams.at/public/emps/jobs/...","isExternal":true,"applicationEmail":null,"url":"https://www.arbeitsagentur.de/jobsuche/jobdetail/11949-17224369-S"}
Full list of returned fields:
title, mainJob, descriptionText, jobType, referenceNumber, publishingStartDate, firstPublicationDate, lastUpdatedAt, contractDuration, contractDurationLimitedTo, contractDurationMonths, startDate, entryPeriod, salaryNote, employerName, employerCustomerNumberHash, employerWebsite, employerIndustry, employerFirmSize, location, employerAddress, isFullTime, isMarginalEmployment, isPartTimeMorning, isPartTimeAfternoon, isPartTimeEvening, isPartTimeFlexible, isWorkFromHome, nightOrWeekendShift, isDisabilityRequired, isPrivateAgency, isTemporaryEmployment, externalURL, isExternal, applicationEmail, allianzpartnerName, allianzpartnerUrl, url.
π How to Use (Apify Console)
- π Log in at https://console.apify.com β Actors.
- π Open this actor (Bundesagentur fΓΌr Arbeit Job Scraper).
- βοΈ Configure inputs:
- Paste one or more Arbeitsagentur.de search URLs, or
- Provide a position keyword + optional location.
- Set
maxItems(default: 100).
- βΆοΈ Click Start.
- π Watch the live log β every saved job appears in real time.
- π When the run finishes, open the Output tab.
- π₯ Export as JSON, CSV, XLSX, HTML, RSS or XML.
π€ Use via API / curl
Run synchronously and stream the dataset back:
curl-X POST "https://api.apify.com/v2/acts/<USER>~bundesagentur-fur-arbeit-scraper/run-sync-get-dataset-items?token=$APIFY_TOKEN"\-H"Content-Type: application/json"\-d'{"startUrls": [{ "url": "https://www.arbeitsagentur.de/jobsuche/suche?angebotsart=1&was=Softwareentwickler%2Fin" }],"maxItems": 50}'
Run asynchronously and poll:
curl-X POST "https://api.apify.com/v2/acts/<USER>~bundesagentur-fur-arbeit-scraper/runs?token=$APIFY_TOKEN"\-H"Content-Type: application/json"\-d'{"startUrls":[{"url":"https://www.arbeitsagentur.de/jobsuche/suche?angebotsart=1&was=Pflegekraft"}],"maxItems":200}'
πΌ Best Use Cases
- π§βπΌ Recruiters β track open vacancies in your sector and city
- π’ HR Tech / ATS vendors β enrich your job feed with German listings
- π Market research β analyse the German labor market by region, industry, contract type
- π€ Job aggregators β build vertical job boards on top of the official German source
- π° Journalists / analysts β quote real numbers on supply and demand
π° Pricing
This actor is on pay-per-event (PPE).
| Event | Description | Why |
|---|---|---|
π apify-actor-start | One charge per run start | Covers warm-up compute |
π apify-default-dataset-item | One charge per saved job posting | Pay only for results you actually receive |
Synthetic events are configured in the Apify Console. You only pay for successful, saved results β no data, no charge.
β Frequently Asked Questions
Q: Where does the data come from? A: The official Arbeitsagentur.de public job search API (the same one powering the website's search and detail pages).
Q: Do I need a proxy? A: No. The actor defaults to a direct connection. If the upstream rate-limits us, the proxy strategy auto-escalates to Apify datacenter, then residential proxies β no input needed.
Q: How fresh is the data? A: It's a live API call on every run β what you get is what Arbeitsagentur.de is serving right now.
Q: How many results can I scrape?
A: As many as Arbeitsagentur.de exposes β set maxItems to the upper bound you want.
Q: Are application emails always available? A: No β only when the employer included a contact email inside the public description text. We regex-extract whenever present.
Q: Why are some jobs from Austria?
A: Arbeitsagentur.de partners with AMS (Austria) and shows DACH-region postings. We expose the allianzpartnerName field so you can filter.
π‘οΈ Cautions / Legal
- π All data comes from publicly available endpoints.
- βοΈ End users are responsible for compliance with GDPR, the target site's ToS, and any local regulations.
- π€ The actor honors reasonable rate limits and uses exponential backoff to be a good API citizen.
π¬ Support & Feedback
Spotted a bug? Need a new field? Open an issue from the Apify Console β we read every report. π
Built with β€οΈ on Apify Β· 2026
