VOOZH about

URL: https://apify.com/nexgendata/patent-trademark-rag?fpr=2ayu9b

⇱ Patents to Markdown for RAG β€” Google Patents Β· Apify


Pricing

from $40.00 / 1,000 document/chunks

Go to Apify Store

Patents to Markdown for RAG

Convert patents (US/EP/WO) into clean, chunked Markdown for RAG and LLM pipelines via Google Patents β€” abstract, claims, description.

Pricing

from $40.00 / 1,000 document/chunks

Rating

0.0

(0)

Developer

πŸ‘ NexGenData

NexGenData

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

9 hours ago

Last modified

Share

πŸ“‘ Patents to Markdown for RAG

Convert patents (US/EP/WO) into clean, chunked Markdown for RAG and LLM pipelines via Google Patents β€” abstract, claims, description.

🌐 The NexGenData Global IP & Trademark Suite

One office is never enough for IP due diligence β€” search trademarks and patents across the US, EU, China, Japan, Korea, Hong Kong, and WIPO. Every actor below is part of the NexGenData Global IP & Trademark Suite, so a clearance, freedom-to-operate, or brand-protection workflow that starts in one office can extend to every office, worldwide.

Trademarks

Patents

Search every IP office, worldwide β€” one suite, pay-per-result, structured JSON.

⚑ What you get

One row per chunk: source, url, title, chunkIndex, totalChunks, markdown (LLM-ready, source URL = citation).

🎯 Use cases

  1. RAG over this content 2. Vector-store ingestion 3. Searchable knowledge bases 4. Citation-tagged LLM data

πŸš€ Sample inputs

{"items":["US10000000B2","US9876543B2"],"chunkWords":800}

πŸ“¦ Sample output

{"source":"US10000000B2","title":"...","chunkIndex":0,"totalChunks":8,"markdown":"# ...\n..."}

πŸ“Š Sample Output

πŸ‘ Sample output

πŸ›  How it works

  1. Fetch each source. 2. Isolate the main document. 3. HTML β†’ ATX Markdown. 4. Chunk ~chunkWords. 5. One row/chunk + citation.

πŸ”— Related Actors

πŸ’° Pricing Example

Pay-per-event: $0.005 per run + $0.04 per document/chunk (document-record).

ChunksCost
100~$4.00
500~$20.00
2,000~$80.00
Apify's $5 free credit covers ~124 chunks. Start free β†’

βš–οΈ Legal & data sources

Fetches publicly-accessible documents with an identified User-Agent; output includes source URLs for attribution.

❓ FAQ

Citations? Yes. Chunk size? chunkWords. Fresh? Live. Key? No. Inputs? Public HTML. Dedup? Per run.

πŸ†˜ Troubleshooting

  • Empty markdown β†’ JS-rendered/restricted page. - Boilerplate β†’ use the canonical URL. - Huge β†’ lower inputs/chunkWords. - 404 β†’ check the URL/ID.

🏷️ About NexGenData

Public-data tools for analysts, developers, and operators. thenextgennexus.com

You might also like

Google Patents Scraper

api-empire/google-patents-scraper

πŸ”Ž Google Patents Scraper (google-patents-scraper) extracts structured patent data from Google Patentsβ€”titles, abstracts, inventors, assignees, CPC, claims, citations, priority dates & PDF links. βš™οΈ Ideal for IP research, competitive intel & R&D. Export to CSV/JSON for analysis. πŸš€

Google Patents Scraper

scrapio/google-patents-scraper

πŸ”Ž Google Patents Scraper (google-patents-scraper) extracts titles, abstracts, claims, inventors, assignees, citations, IPC/CPC, dates, legal status & PDFs. πŸ“¦ Export CSV/JSON, API & batch ready. πŸš€ Ideal for IP research, prior art search, patent analytics & competitive intelligence.

Google Patents Scraper

scrapier/google-patents-scraper

πŸ”Ž Google Patents Scraper extracts structured patent data from Google Patents β€” titles, abstracts, inventors, assignees, CPC/IPC, citations, claims, dates & PDFs. ⚑ Fast, reliable, and bulk-ready for IP research, competitive intel & R&D landscaping. πŸ“Š CSV/JSON/API.

Web-to-Markdown Generator for AI & RAG Pipelines

profitstack/web-to-markdown-generator-for-ai-rag-pipelines

Convert any website into clean, heading-based chunking, LLM-ready Markdown for RAG and AI agents.

Google Patents Scraper

scraper-engine/google-patents-scraper

πŸ”Ž Google Patents Scraper extracts rich patent data from Google Patentsβ€”titles, abstracts, claims, inventors, assignees, CPC/IPC, citations, legal status, dates & PDFs. βš™οΈ Export CSV/JSON. πŸš€ Ideal for prior art, IP due diligence, competitive intel & tech scouting.

πŸ‘ User avatar

Scraper Engine

2

Website To Markdown

smart_api/website-to-markdown

Convert any webpage into clean, LLM-ready Markdown in seconds β€” perfect for AI training data, RAG pipelines, and content archiving.