Pricing
from $10.00 / 1,000 results
Ofsted Reports Data Scraper
Scrape Ofsted full inspection reports for children's homes. Extracts 18 structured fields from PDFs โ judgement ratings, provider details, inspector info, home capacity and type โ filtered by date. Exports to MySQL and/or Apify dataset.
Pricing
from $10.00 / 1,000 results
Rating
0.0
(0)
Developer
Actor stats
0
Bookmarked
7
Total users
3
Monthly active users
a month ago
Last modified
Categories
Share
Extract structured data from Ofsted full inspection PDF reports for children's homes โ at scale. Judgement ratings, provider details, inspectors, home capacity, specialism, dates โ 18 fields per report, parsed directly from the source PDFs. Export to your MySQL database, your Apify dataset, or both.
Built and maintained by Alkausari M.
โฆ Highlights
- ๐ Full PDF parsing โ 18 structured fields extracted from each report
- ๐ Date-filtered crawling โ target only reports in your inspection date range
- ๐ MySQL export โ direct insert/update with
ON DUPLICATE KEY UPDATE, no duplicates on re-runs - โป๏ธ Smart deduplication โ startup checks your existing records and skips already-processed PDFs
- ๐ Direct PDF URL support โ pass a
files.ofsted.gov.ukURL to process a single report - ๐ก Resilient โ auto-retry with exponential backoff; unparseable PDFs logged separately
โ How it works
- Paste a search URL โ from the Ofsted reports portal with your filters applied. Or pass a direct PDF URL.
- Set a date range โ
latest_report_date_startandlatest_report_date_end(YYYY-MM-DD). - Click Start โ the Actor finds matching providers โ Full Inspection reports โ downloads and parses each PDF.
// Example input{"start_urls": [{ "url": "https://reports.ofsted.gov.uk/search?q=&level_1_types=3&level_2_types%5B0%5D=11&status%5B0%5D=1&start=0&rows=10" }],"latest_report_date_start": "2026-02-15","latest_report_date_end": "2026-02-28","max_depth": 3,"skip_db_export": false,"db_host": "your-db-host","db_database": "your-database-name","db_user": "your-db-user","db_password": "your-db-password"}
Set skip_db_export: true to use the Actor without any database โ all data still lands in your Apify dataset (JSON, CSV, Excel, API).
MySQL tables
When MySQL export is enabled, two tables are used:
ofsted_reportsโ primary output, keyed onpdf_url. Records are inserted on first run, updated on re-runs.ofsted_unsupported_reportsโ PDFs that don't match the expected Ofsted format (e.g. older layouts) are logged here for review rather than silently dropped.
๐ฆ What you get back
Each record represents one parsed inspection report:
{"PDF URL":"https://files.ofsted.gov.uk/v1/file/50298941","Unique reference number":"2587763","Registered provider":"Mercia Children Services Ltd","Registered provider address":"Windsor House, Bayshill Road, Cheltenham, Gloucestershire GL50 3AT","Provision sub-type":"Children's home","Responsible individual":"Michael Lloyd","Registered manager":"David Griffiths","Inspection dates":"3 and 4 March 2026","Inspection type":"Full inspection","Overall experiences and progress":"good","Help and protection":"good","Leadership and management":"good","Date of last inspection":"25 February 2025","Overall judgement at last inspection":"good","Enforcement action since last inspection":"None","Inspectors":[{"name":"Helen Fee","role":"Social Care Inspector"}],"Home Capacity":"4","Home Type":"social and emotional difficulties"}
๐ Input
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
start_urls | Array | Yes | โ | Ofsted search URL(s) or a direct files.ofsted.gov.uk PDF URL |
latest_report_date_start | String | Yes | Today | Start of inspection date range (YYYY-MM-DD) |
latest_report_date_end | String | Yes | Today | End of inspection date range (YYYY-MM-DD) |
max_depth | Integer | No | 3 | 1 = listing only, 2 = provider pages, 3 = full PDF extraction |
skip_db_export | Boolean | No | false | true = skip MySQL, save to Apify dataset only |
db_host | String | If DB export | โ | MySQL host |
db_database | String | If DB export | โ | MySQL database name |
db_user | String | If DB export | โ | MySQL username |
db_password | String | If DB export | โ | MySQL password |
Direct PDF โ single-report mode
{"start_urls": [{ "url": "https://files.ofsted.gov.uk/v1/file/50287454" }],"max_depth": 1,"skip_db_export": true}
๐ก Use cases
- Research โ academic and policy analysis of inspection trends across providers and regions.
- Compliance monitoring โ track ratings and enforcement actions across the providers you work with.
- Sector consultancy โ build a structured dataset of children's-home judgements for client reporting.
- Scheduled syncs โ set a rolling 7-day date window and schedule daily/weekly runs; dedup ensures no rework.
- Data products โ power dashboards and BI on top of clean, parsed Ofsted data via the Apify API.
๐ฎ Support
Bugs, feature requests, or custom scraping work โ open an issue on Apify or email alkausarimujahid@gmail.com.
