VOOZH about

URL: https://apify.com/memo23/doinggoodleeds-scraper

⇱ Doing Good Leeds Scraper Β· Apify


Pricing

from $2.00 / 1,000 results

Go to Apify Store

Doing Good Leeds Scraper

Scrape paid jobs, volunteering, events, and training from doinggoodleeds.org.uk via WP-JSON. Pick any subset of 4 entity types. ~178 entities total. Title, employer, location, salary, apply email/URL, full description HTML inline per row. JSON or CSV out, billed per result.

Pricing

from $2.00 / 1,000 results

Rating

0.0

(0)

Developer

πŸ‘ Muhamed Didovic

Muhamed Didovic

Maintained by Community

Actor stats

0

Bookmarked

20

Total users

19

Monthly active users

5.5 hours

Issues response

18 days ago

Last modified

Share

Scrape paid jobs, volunteering opportunities, events, and training courses from doinggoodleeds.org.uk β€” Leeds' volunteer hub. Four custom-post-type collections behind one WP-JSON REST API: pick one or many via the entityTypes input. Each row carries full structured data β€” title, employer, location, salary (when present), apply email/URL, and the complete description HTML. JSON or CSV out, no compute charge per run, just per result.

How it works

πŸ‘ How Doing Good Leeds Scraper works

✨ Why use this scraper?

Tracking Leeds' voluntary-sector hiring? Mapping volunteer opportunities for placement programmes? Building a community events calendar? Cataloguing the training courses charities are funding?

  • 🎯 Four entity types in one actor. job-listings (paid jobs, ~31), volunteers (volunteering opportunities, ~23), event (events, ~34), training-course (training courses, ~90) β€” pick any subset.
  • ⚑ WP-JSON REST API as the data source. Each entity is a WordPress custom post type with its own /wp-json/wp/v2/<cpt> endpoint.
  • 🏷️ Custom location taxonomy. WP Job Manager's _job_location meta is usually empty on Doing Good Leeds β€” we fall back to their custom location taxonomy (Leeds, regional Yorkshire, etc.).
  • πŸ“§ Apply email / URL captured. _application meta is split into applyEmail vs externalApplyUrl automatically.
  • 🌟 Cloudflare-friendly. Only the passive __cf_bm cookie is enforced β€” any sane UA passes without proxy.
  • πŸ“€ Clean exports. One row per entity, all fields inline. JSON + CSV exported automatically.

🎯 Use cases

TeamWhat they build
Voluntary sector recruitersDaily Leeds nonprofit hiring feeds
Volunteer co-ordinatorsOpportunity mapping for student placement programmes
Community events platformsPull events into a unified Leeds calendar
Training providersTrack what courses other charities are running / funding
Workforce strategyLeeds third-sector pay benchmarks
ResearchersLeeds civil-society datasets (jobs + volunteers + events + training)

πŸ“₯ Supported inputs

URL patternBehaviour
https://doinggoodleeds.org.uk/jobs/ etc.Listings for any CPT (the actor picks based on entityTypes)
https://doinggoodleeds.org.uk/job/<slug>/Single paid job
https://doinggoodleeds.org.uk/volunteer/<slug>/Single volunteer opportunity
https://doinggoodleeds.org.uk/event/<slug>/Single event
https://doinggoodleeds.org.uk/training-course/<slug>/Single training course
https://doinggoodleeds.org.uk/wp-json/wp/v2/{job-listings|volunteers|event|training_course}WP-JSON endpoint

Leave startUrls empty + pick entityTypes to scrape every entity of those types.

Not supported: mixing entity types in a single dataset row (each row is one CPT); hosts outside doinggoodleeds.org.uk.

πŸ”„ How it works

  1. Resolve start URLs β€” either from explicit startUrls, or from entityTypes (default ["job-listings"]).
  2. Classify + translate each URL into the canonical /wp-json/wp/v2/<cpt> shape β€” tagging it with which CPT it represents.
  3. Walk pagination via X-WP-TotalPages from the response header.
  4. Parse each WP-JSON item β€” title, content HTML, WP Job Manager meta (where present), _embed taxonomies (categories, types, location).
  5. Push one normalised row per entity to the dataset, tagged with the source CPT via the cpt field.

βš™οΈ Input parameters

ParameterTypeDefaultDescription
startUrlsarray["https://doinggoodleeds.org.uk/wp-json/wp/v2/job-listings"]Browser URLs, single-detail URLs, or WP-JSON endpoints.
entityTypesarray["job-listings"]Used when startUrls is empty. Allowed values: job-listings, volunteers, event, training-course.
enrichTaxonomiesbooleantrueWhen true, embeds taxonomy term names + featured image via WP-JSON _embed.
postedWithinHoursinteger(none)Only return rows posted in the last N hours (24 = last day, 72 = last 3 days). Empty/0 = all. Ideal for daily monitoring runs that only want fresh postings.
maxItemsinteger1000Hard cap on rows pushed (~178 total across all CPTs).
maxConcurrency / minConcurrencyinteger5 / 1Parallel WP-JSON page-fetch limits.
maxRequestRetriesinteger5Retries before a failed request is given up.
proxyobjectNo proxyCloudflare lets us through without a proxy.

πŸ“Š Output overview

Each entity is one single dataset row. The type field tells you what entity it is (job, volunteer, event, training), and the cpt field carries the raw CPT slug.

πŸ“¦ Output sample

{
"type":"job",
"cpt":"job-listings",
"source":"doinggoodleeds.org.uk",
"jobId":"111130",
"slug":"young-adults-worker-3",
"jobUrl":"https://doinggoodleeds.org.uk/job/young-adults-worker-3/",
"wpJsonUrl":"https://doinggoodleeds.org.uk/wp-json/wp/v2/job-listings/111130",
"title":"Young Adults Worker",
"description":"<p>Young Adults Worker role at Waythrough…</p>",
"descriptionText":"Young Adults Worker role at Waythrough…",
"companyName":null,
"companyWebsite":null,
"companyDomain":null,
"location":"Leeds",
"locations":["Leeds"],
"remote":false,
"salary":null,
"categories":[],
"employmentTypes":["Full Time"],
"contractType":"Full Time",
"featured":false,
"filled":false,
"status":"publish",
"postedDate":"2026-04-23T10:01:46Z",
"modifiedDate":"2026-04-23T10:01:46Z",
"applyType":"email",
"applyUrl":"https://doinggoodleeds.org.uk/job/young-adults-worker-3/",
"applyEmail":"olivia.hodgson@waythrough.org.uk",
"externalApplyUrl":null,
"featuredImageUrl":null,
"authorId":1,
"authorName":null,
"scrapedAt":"2026-05-20T00:13:00.000Z"
}

πŸ—‚ Key output fields

GroupFields
Identifierstype, cpt (job-listings / volunteers / event / training-course), source, jobId, slug, jobUrl, wpJsonUrl, scrapedAt
Contenttitle, description (HTML), descriptionText (plain)
DatespostedDate (ISO), modifiedDate (ISO)
EmployercompanyName (often null), companyWebsite, companyDomain, companyTagline, featuredImageUrl
Locationlocation (primary), locations[] (all taxonomy terms), remote
Compensationsalary.{currency, min, max, unit, raw} (when present in WP Job Manager meta)
Taxonomiescategories[], employmentTypes[], contractType
Flagsfeatured, filled, status
Apply flowapplyType, applyUrl, applyEmail, externalApplyUrl

❓ FAQ

Can I scrape all four entity types in one run? Yes. Set entityTypes to ["job-listings", "volunteers", "event", "training-course"]. Each row will have a cpt field indicating which collection it came from.

Why are some salaries empty? WP Job Manager's _job_salary meta isn't always populated for voluntary-sector roles. Look at the description HTML for compensation info when salary is null.

Why is companyName often null? Doing Good Leeds doesn't enforce the _company_name meta field. Org name is usually in the description or apply email domain β€” pull it from there if needed.

Can I scrape private pages or applicant data? No. Only the public WP-JSON REST API.

How do I limit results? Set maxItems. Run with ["training-course"] alone if you only want the 90 courses.

πŸ’¬ Support

πŸ›  Additional services

πŸ”Ž Explore more scrapers

See other scrapers at memo23's Apify profile β€” covering job boards, real estate, social media, and more.


⚠️ Disclaimer

This Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by Doing Good Leeds, Voluntary Action Leeds (VAL), or any of their subsidiaries or affiliates. All trademarks mentioned are the property of their respective owners.

The scraper accesses only the publicly available WP-JSON REST endpoints and public detail pages on doinggoodleeds.org.uk β€” no authenticated endpoints, recruiter-only features, or content behind a login. Users are responsible for ensuring their use complies with doinggoodleeds.org.uk's Terms of Service, applicable data-protection law (GDPR, CCPA, etc.), and any contractual obligations of their own organisation.


SEO Keywords

doing good leeds scraper, scrape doinggoodleeds.org.uk, doing good leeds api, leeds volunteer scraper, leeds charity jobs scraper, leeds nonprofit jobs api, leeds volunteer opportunities api, leeds community events scraper, leeds training courses scraper, voluntary action leeds scraper, Apify doing good leeds, leeds third sector jobs, leeds civil society data, yorkshire volunteer hub api, wp-json scraper, wp job manager scraper, charityjob alternative scraper, vassheffield alternative scraper, barnsleycvs alternative scraper, uk cvs jobs scraper, leeds nonprofit recruitment data

You might also like

Doing Good Leeds Scraper

parsebird/doinggoodleeds-scraper

Scrape paid jobs, volunteering, events, and training courses from doinggoodleeds.org.uk via WP-JSON. Four entity types, ~178 items total. Title, employer, location, salary, apply email/URL, full description HTML. Export as JSON, CSV, Excel.

Doing Good Leeds Scraper

maximedupre/doing-good-leeds-scraper

Scrape Doing Good Leeds jobs, volunteer opportunities, events, and training courses. Export titles, descriptions, dates, locations, action links, source URLs, and scrape times to Apify datasets, API, schedules, and integrations.

πŸ‘ User avatar

Maxime DuprΓ©

2

Community First Yorkshire Jobs Scraper

memo23/cfy-scraper

Scrape jobs and other portfolio content from communityfirstyorkshire.org.uk via WP-JSON portfolio CPT. Filter by taxonomy (default jobs β‰ˆ 6 vacancies). Title, full HTML, location, apply email/URL, best-effort closing date + salary regex. JSON or CSV out.

πŸ‘ User avatar

Muhamed Didovic

2

VAS Sheffield Jobs Scraper

memo23/vassheffield-scraper

Scrape voluntary-sector vacancies from jobs.vas.org.uk (Voluntary Action Sheffield) via WP-JSON. Title, sector taxonomy, posted date, external apply URL extracted from listing body, full description. ~25 live vacancies in one request. JSON or CSV out, billed per result

πŸ‘ User avatar

Muhamed Didovic

3

Highland Jobs (.scot) Scraper (/w EMAILS)

memo23/highlandjobs-scraper

crape every Scottish Highlands & Islands vacancy from highlandjobs.scot via the public WP-JSON API. Title, employer, location, salary (parsed), categories, posted date, full description HTML, apply email/URL. 204 jobs in ~3 requests. JSON or CSV out.

πŸ‘ User avatar

Muhamed Didovic

4

s1jobs.com Scraper (/w EMAILS)

memo23/s1jobs-scraper

Scrape Scottish job postings (all sectors) from s1jobs.com β€” title, salary, employer, location, real lat/lng coordinates, posted/closing dates, full description, structured skills, and the actual recruiter apply URL or apply email. Works with any listing or vacancy URL. JSON or CSV out.

πŸ‘ User avatar

Muhamed Didovic

22

Goodmoves.org Jobs Scraper

memo23/goodmoves-scraper

Scrape Scottish charity-sector job postings from Goodmoves.org β€” title, salary, workplace type, employer, posted/closing dates with time, full description, real lat/lng coordinates, OSCR charity number and apply email or apply URL. Works with any listing or vacancy URL. JSON or CSV out.

πŸ‘ User avatar

Muhamed Didovic

20

Google Jobs Scraper - Listings, Salary, Employer & Apply Links

santhej/google-jobs-scraper

Scrape Google Jobs at scale: job title, employer, location, salary, contract type, posting date & source/apply links. Any role or city, 190+ countries. Clean JSON/CSV for job boards, recruiters & market research. No API keys.

πŸ‘ User avatar

Santhej Kallada

1

VA Rotherham Jobs Scraper

memo23/varotherham-scraper

Scrape the varotherham.org.uk South Yorkshire voluntary-sector job board (Wix CMS). One HTTP request, every job inline: title, employer, location, closing date. Rotherham / Barnsley / Doncaster / Sheffield charities. JSON or CSV out, billed per result.

πŸ‘ User avatar

Muhamed Didovic

2

Snicket Jobs Scraper

memo23/snicket-scraper

Scrape snicket.org β€” Bradford and West Yorkshire community-sector vacancies. RSS + labelled detail-page extraction: title, organisation, salary, hours, closing date, payment schedule, contact name/email/phone, full HTML description. JSON or CSV out, billed per result

πŸ‘ User avatar

Muhamed Didovic

2