VOOZH about

URL: https://apify.com/pink_comic/schema-markup-extractor

โ‡ฑ Structured Data Scraper - Schema Markup & JSON-LD API ยท Apify


๐Ÿ‘ Schema Markup & JSON-LD Scraper - Structured Data API avatar

Schema Markup & JSON-LD Scraper - Structured Data API

Pricing

from $2.00 / 1,000 urls

Go to Apify Store

Schema Markup & JSON-LD Scraper - Structured Data API

Extract schema markup, JSON-LD, Open Graph, Twitter Cards, and meta tags from any URL. Structured data scraper/API for SEO audits, rich result checks, schema validation, and competitor research.

Pricing

from $2.00 / 1,000 urls

Rating

0.0

(0)

Developer

๐Ÿ‘ Ava Torres

Ava Torres

Maintained by Community

Actor stats

0

Bookmarked

10

Total users

4

Monthly active users

12 days ago

Last modified

Share

Schema Markup & SEO Data Extractor

Extract JSON-LD structured data, Open Graph tags, Twitter Card metadata, and meta tags from any URL. Built for SEO auditors, developers, and data engineers who need structured page metadata at scale.

Pricing: $0.002 per URL (~$2 per 1,000 URLs)


What It Extracts

Data TypeExamples
JSON-LDProduct, Article, BreadcrumbList, FAQPage, LocalBusiness, WebSite, Person, Organization
Open Graphog:title, og:description, og:image, og:url, og:type, og:site_name
Twitter Cardtwitter:card, twitter:title, twitter:description, twitter:image, twitter:site
Meta Tagsdescription, keywords, author, robots, viewport, canonical
Schema TypesDeduplicated list of all @type values found on the page

Input

FieldTypeDefaultDescription
urlsarrayrequiredURLs to extract from
includeJsonLdbooleantrueParse JSON-LD script blocks
includeOpenGraphbooleantrueParse og: meta properties
includeTwitterCardbooleantrueParse twitter: meta tags
includeMetaTagsbooleantrueParse all <meta name=...> tags
concurrencyinteger5Parallel requests (1-20)
timeoutinteger30Per-URL timeout in seconds
maxResultsinteger50Cap on URLs processed

Output

Each URL produces one dataset record:

{
"url":"https://example.com/product/widget",
"jsonLd":[
{
"@context":"https://schema.org",
"@type":"Product",
"name":"Widget Pro",
"description":"A professional widget",
"offers":{
"@type":"Offer",
"price":"29.99",
"priceCurrency":"USD"
}
}
],
"openGraph":{
"title":"Widget Pro - Best Widgets",
"description":"A professional widget for professionals",
"image":"https://example.com/widget.jpg",
"type":"product"
},
"twitterCard":{
"card":"summary_large_image",
"title":"Widget Pro",
"image":"https://example.com/widget-twitter.jpg"
},
"metaTags":[
{"name":"description","content":"A professional widget for professionals"},
{"name":"keywords","content":"widget, pro, professional"}
],
"schemaTypes":["Product","Offer"]
}

If a URL fails to fetch or parse, the record includes an error field and empty arrays/objects for the structured data fields.


Use Cases

  • SEO audits โ€” verify JSON-LD is present and correct across hundreds of pages
  • Competitor research โ€” see what schema types competitors implement
  • Rich result eligibility โ€” check if pages qualify for Google rich results (Product, FAQ, Article, etc.)
  • Content aggregation โ€” extract og:image and og:title for link previews
  • Schema validation โ€” identify missing or malformed structured data before a site launch
  • Crawl pipelines โ€” feed output into downstream validators or dashboards

Notes

  • Uses a pure HTTP client โ€” no browser required, fast and cost-efficient
  • Handles @graph arrays in JSON-LD (common on WordPress/Yoast sites)
  • Handles both property="twitter:..." and name="twitter:..." meta tag formats
  • Follows up to 10 redirects per URL
  • Response body capped at 10 MB per page
  • No API key required

You might also like

JSON-LD Schema & Meta Tag Extractor

logiover/json-ld-schema-meta-tag-extractor

Bulk JSON-LD structured data scraper and meta tag extractor for any URL. Export Schema.org, OpenGraph and Twitter Cards to CSV/JSON. No API.

Structured Data Validator (JSON-LD / OG)

jungle_synthesizer/structured-data-validator-pro

Extract and validate structured data from any URL: JSON-LD, Open Graph, Twitter Cards, microdata, RDFa, meta tags. Local schema.org validation. Flags Google rich-result eligibility and AI-discovery readiness. Pure HTTP. Built for SEO audits and structured-data debugging at scale.

๐Ÿ‘ User avatar

BowTiedRaccoon

3

SEO/GEO - Schema Markup Scraper

wisteria_banjo/schema-markup-scraper

This actor to fetches JSON-LD/Schema Markup from Multiple URLs & checks whether the page contains markups for the following types: AggregateRating, Article, Event, FAQPage, LocalBusiness, Organization, Person, Product, & Review. Schema Markup helps search and generative engines find & read webpages.

64

LD+JSON Schema scraper

pocesar/json-ld-schema

Extract all LD+JSON tags from the given URLs.

457

5.0

Schema Markup Scraper - PPE

wisteria_banjo/schema-markup-scraper---ppe

This actor to fetches JSON-LD/Schema Markup from Multiple URLs & checks whether the page contains markups for the following types: AggregateRating, Article, Event, FAQPage, LocalBusiness, Organization, Person, Product, & Review. Schema Markup helps search and generative engines find & read webpages.