Patents to Markdown for RAG

Pricing

from $40.00 / 1,000 document/chunks

Patents to Markdown for RAG

Convert patents (US/EP/WO) into clean, chunked Markdown for RAG and LLM pipelines via Google Patents — abstract, claims, description.

Pricing

from $40.00 / 1,000 document/chunks

Rating

0.0

(0)

Developer

👁 NexGenData

NexGenData

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

5 hours ago

Last modified

📑 Patents to Markdown for RAG

Convert patents (US/EP/WO) into clean, chunked Markdown for RAG and LLM pipelines via Google Patents — abstract, claims, description.

🌐 The NexGenData Global IP & Trademark Suite

One office is never enough for IP due diligence — search trademarks and patents across the US, EU, China, Japan, Korea, Hong Kong, and WIPO. Every actor below is part of the NexGenData Global IP & Trademark Suite, so a clearance, freedom-to-operate, or brand-protection workflow that starts in one office can extend to every office, worldwide.

Trademarks

USPTO Trademark Search (US) — word-mark, owner & status lookup across the full USPTO TESS dataset.
EUIPO Trademark Search (EU) — EUIPO + TMview network across 35+ EU/EEA registries.
Hong Kong Trademark Search (APAC) — the Hong Kong IPD register by mark, class, owner or status.
Korea KIPO / KIPRIS Plus (KR) — Korean patents, trademarks & designs via the official KIPRIS Plus API.
Japan JPO / J-PlatPat (JP) — Japanese trademarks, patents, utility models & designs from J-PlatPat.

Patents

CNIPA China Patent Search (CN) — Chinese patents from the CNIPA database for IP & innovation tracking.
Japan JPO / J-PlatPat (JP) — Japanese patents, utility models & designs from J-PlatPat.
Korea KIPO / KIPRIS Plus (KR) — Korean patents & utility models via the official KIPRIS Plus API.
WIPO PATENTSCOPE (Global) — worldwide PCT & national patents across WIPO's global collection.
USPTO Patent Search (US) — US patents with full claims text for prior-art & freedom-to-operate work.
Patents → Markdown for RAG (AI) — clean Markdown export of patents, ready to embed in a RAG / LLM pipeline. ← you are here

Search every IP office, worldwide — one suite, pay-per-result, structured JSON.

⚡ What you get

One row per chunk: source, url, title, chunkIndex, totalChunks, markdown (LLM-ready, source URL = citation).

🎯 Use cases

RAG over this content 2. Vector-store ingestion 3. Searchable knowledge bases 4. Citation-tagged LLM data

🚀 Sample inputs

{"items":["US10000000B2","US9876543B2"],"chunkWords":800}

📦 Sample output

{"source":"US10000000B2","title":"...","chunkIndex":0,"totalChunks":8,"markdown":"# ...\n..."}

📊 Sample Output

👁 Sample output

🛠 How it works

Fetch each source. 2. Isolate the main document. 3. HTML → ATX Markdown. 4. Chunk ~chunkWords. 5. One row/chunk + citation.

🔗 Related Actors

USPTO Patent Search\n- Trademark Search\n- SEC Filings to RAG\n- RAG Web Browser

💰 Pricing Example

Pay-per-event: $0.005 per run + $0.04 per document/chunk (document-record).

Chunks	Cost
100	~$4.00
500	~$20.00
2,000	~$80.00
Apify's $5 free credit covers ~124 chunks. Start free →

⚖️ Legal & data sources

Fetches publicly-accessible documents with an identified User-Agent; output includes source URLs for attribution.

❓ FAQ

Citations? Yes. Chunk size? chunkWords. Fresh? Live. Key? No. Inputs? Public HTML. Dedup? Per run.

🆘 Troubleshooting

Empty markdown → JS-rendered/restricted page. - Boilerplate → use the canonical URL. - Huge → lower inputs/chunkWords. - 404 → check the URL/ID.

🏷️ About NexGenData

Public-data tools for analysts, developers, and operators. thenextgennexus.com

Google Patents Scraper

scrapium/google-patents-scraper

👁 User avatar

Scrapium

👁 Google Patents Scraper avatar

Google Patents Scraper

api-empire/google-patents-scraper

🔎 Google Patents Scraper (google-patents-scraper) extracts structured patent data from Google Patents—titles, abstracts, inventors, assignees, CPC, claims, citations, priority dates & PDF links. ⚙️ Ideal for IP research, competitive intel & R&D. Export to CSV/JSON for analysis. 🚀

👁 User avatar

API Empire

👁 Google Patents Scraper avatar

Google Patents Scraper

scrapio/google-patents-scraper

🔎 Google Patents Scraper (google-patents-scraper) extracts titles, abstracts, claims, inventors, assignees, citations, IPC/CPC, dates, legal status & PDFs. 📦 Export CSV/JSON, API & batch ready. 🚀 Ideal for IP research, prior art search, patent analytics & competitive intelligence.

👁 User avatar

Scrapio

Google Patents Scraper - Patent Data, Claims & Citations

lulzasaur/google-patents-scraper

Scrape Google Patents for patent details, abstracts, claims, inventors, assignees, classifications, citations, similar patents, and PDF links. Search or provide patent URLs.

👁 User avatar

lulz bot

👁 Google Patents Scraper avatar

Google Patents Scraper

scrapier/google-patents-scraper

🔎 Google Patents Scraper extracts structured patent data from Google Patents — titles, abstracts, inventors, assignees, CPC/IPC, citations, claims, dates & PDFs. ⚡ Fast, reliable, and bulk-ready for IP research, competitive intel & R&D landscaping. 📊 CSV/JSON/API.

👁 User avatar

Scrapier

👁 Web-to-Markdown Generator for AI & RAG Pipelines avatar

Web-to-Markdown Generator for AI & RAG Pipelines

profitstack/web-to-markdown-generator-for-ai-rag-pipelines

Convert any website into clean, heading-based chunking, LLM-ready Markdown for RAG and AI agents.

👁 User avatar

Manas Mantri

👁 Google Patents Scraper avatar

Google Patents Scraper

scraper-engine/google-patents-scraper

🔎 Google Patents Scraper extracts rich patent data from Google Patents—titles, abstracts, claims, inventors, assignees, CPC/IPC, citations, legal status, dates & PDFs. ⚙️ Export CSV/JSON. 🚀 Ideal for prior art, IP due diligence, competitive intel & tech scouting.

👁 User avatar

Scraper Engine

👁 Website To Markdown avatar

Website To Markdown

smart_api/website-to-markdown

Convert any webpage into clean, LLM-ready Markdown in seconds — perfect for AI training data, RAG pipelines, and content archiving.

👁 User avatar

SmartApi

5.0

News & Announcements to Markdown for RAG

nexgendata/news-announcements-rag-markdown

Convert press releases, corporate announcements & news articles into clean, chunked Markdown for RAG and LLM pipelines. Article URLs or RSS feeds. No login.

👁 User avatar

NexGenData

Website to Markdown for LLM and RAG

jeweled_jockstrap/my-actor-3

Convert any URL to clean Markdown text for AI applications. Strips HTML extracts content. For LLM training RAG pipelines and vector databases. Free Firecrawl alternative.

👁 User avatar

Juan Triviño

URL: https://apify.com/nexgendata/patent-trademark-rag