VOOZH about

URL: https://apify.com/automationagents/web-json-ld

โ‡ฑ JSON-LD Extractor ยท Apify


Pricing

from $20.00 / 1,000 successful api calls

Go to Apify Store

Extract structured JSON-LD (Schema.org) data from any web page.

Pricing

from $20.00 / 1,000 successful api calls

Rating

0.0

(0)

Developer

๐Ÿ‘ Alex Jordan

Alex Jordan

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

10 days ago

Last modified

Categories

Share

What does JSON-LD Extractor do?

JSON-LD Extractor pulls all structured Schema.org data embedded in any web page's <script type="application/ld+json"> blocks โ€” products, articles, organizations, breadcrumbs, events, reviews, and more โ€” and returns it as clean, parsed JSON. No parsing required on your end.

Built on the Apify platform, results are returned in seconds and integrate with Apify's scheduling, webhooks, and 1,500+ tools.

Why use JSON-LD Extractor?

  • E-commerce intelligence โ€” Extract structured product data (price, availability, rating) from competitor pages
  • News & content monitoring โ€” Pull article metadata (author, publish date, headline) at scale
  • SEO research โ€” Audit which Schema.org types competitors use to win rich snippets in Google
  • Business data โ€” Extract organization info (address, phone, social profiles) from company websites
  • Event tracking โ€” Collect structured event data from ticketing or conference sites

How to use JSON-LD Extractor

  1. Click Try for free on this Actor's page
  2. Enter the URL you want to extract JSON-LD from (e.g. https://www.bbc.com/news)
  3. Click Start and wait a few seconds
  4. Download your results from the Output tab in JSON, CSV, or Excel

Input

FieldTypeRequiredDescription
urlstringโœ…Page URL to extract JSON-LD from
cachebooleanโŒUse cached result if available (default true)

Example input:

{
"url":"https://www.bbc.com/news"
}

Output

Example output:

{
"items":[
{
"@type":"WebSite",
"@context":"https://schema.org",
"name":"BBC News",
"url":"https://www.bbc.com/news"
},
{
"@type":"Organization",
"@context":"https://schema.org",
"name":"BBC",
"logo":{"url":"https://www.bbc.com/logo.png"}
}
],
"meta":{"cache_hit":false,"execution_time_ms":380}
}

You can download the dataset in various formats such as JSON, HTML, CSV, or Excel.

Data fields

FieldTypeDescription
itemsarrayAll JSON-LD objects found on the page
items[][@type]stringSchema.org type (e.g. Article, Product, Organization)
items[][@context]stringSchema context (usually https://schema.org)

Pricing / Cost estimation

$0.02 per successful API call on Apify.

  • 1,000 successful Apify runs = $20.00

FAQ & Support

Is this legal? JSON-LD data is intentionally embedded by website owners for search engines and bots to read โ€” extracting it is fully legitimate.

Known limitations: JSON-LD injected dynamically via JavaScript after page load may not be captured. Static HTML JSON-LD is fully supported.

Need help? Open an issue in the Issues tab or contact the support team for custom solutions.

You might also like

LD+JSON Schema scraper

pocesar/json-ld-schema

Extract all LD+JSON tags from the given URLs.

457

5.0

JSON-LD Schema & Meta Tag Extractor

logiover/json-ld-schema-meta-tag-extractor

Bulk JSON-LD structured data scraper and meta tag extractor for any URL. Export Schema.org, OpenGraph and Twitter Cards to CSV/JSON. No API.

JSON-LD Validator

automation-lab/jsonld-validator

This actor validates JSON-LD structured data on web pages. It extracts all `<script type="application/ld+json">` blocks, validates JSON syntax, checks for required properties (@context, @type), and verifies recommended fields for known Schema.org types like Product, Article, Organization,...

๐Ÿ‘ User avatar

Stas Persiianenko

9

Structured Data Extractor

automation-lab/structured-data-extractor

This actor extracts structured data markup from web pages. It parses all three major formats: JSON-LD (`<script type="application/ld+json">`), Microdata (`itemscope`/`itemprop`), and RDFa (`typeof`/`property`). For each page, it returns the full structured data objects, detected Schema.org...

๐Ÿ‘ User avatar

Stas Persiianenko

15