Pricing
from $40.00 / 1,000 document/chunks
Patents to Markdown for RAG
Convert patents (US/EP/WO) into clean, chunked Markdown for RAG and LLM pipelines via Google Patents β abstract, claims, description.
Pricing
from $40.00 / 1,000 document/chunks
Rating
0.0
(0)
Developer
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
5 hours ago
Last modified
Categories
Share
π Patents to Markdown for RAG
Convert patents (US/EP/WO) into clean, chunked Markdown for RAG and LLM pipelines via Google Patents β abstract, claims, description.
π The NexGenData Global IP & Trademark Suite
One office is never enough for IP due diligence β search trademarks and patents across the US, EU, China, Japan, Korea, Hong Kong, and WIPO. Every actor below is part of the NexGenData Global IP & Trademark Suite, so a clearance, freedom-to-operate, or brand-protection workflow that starts in one office can extend to every office, worldwide.
Trademarks
- USPTO Trademark Search (US) β word-mark, owner & status lookup across the full USPTO TESS dataset.
- EUIPO Trademark Search (EU) β EUIPO + TMview network across 35+ EU/EEA registries.
- Hong Kong Trademark Search (APAC) β the Hong Kong IPD register by mark, class, owner or status.
- Korea KIPO / KIPRIS Plus (KR) β Korean patents, trademarks & designs via the official KIPRIS Plus API.
- Japan JPO / J-PlatPat (JP) β Japanese trademarks, patents, utility models & designs from J-PlatPat.
Patents
- CNIPA China Patent Search (CN) β Chinese patents from the CNIPA database for IP & innovation tracking.
- Japan JPO / J-PlatPat (JP) β Japanese patents, utility models & designs from J-PlatPat.
- Korea KIPO / KIPRIS Plus (KR) β Korean patents & utility models via the official KIPRIS Plus API.
- WIPO PATENTSCOPE (Global) β worldwide PCT & national patents across WIPO's global collection.
- USPTO Patent Search (US) β US patents with full claims text for prior-art & freedom-to-operate work.
- Patents β Markdown for RAG (AI) β clean Markdown export of patents, ready to embed in a RAG / LLM pipeline. β you are here
Search every IP office, worldwide β one suite, pay-per-result, structured JSON.
β‘ What you get
One row per chunk: source, url, title, chunkIndex, totalChunks, markdown (LLM-ready, source URL = citation).
π― Use cases
- RAG over this content 2. Vector-store ingestion 3. Searchable knowledge bases 4. Citation-tagged LLM data
π Sample inputs
{"items":["US10000000B2","US9876543B2"],"chunkWords":800}
π¦ Sample output
{"source":"US10000000B2","title":"...","chunkIndex":0,"totalChunks":8,"markdown":"# ...\n..."}
π Sample Output
π How it works
- Fetch each source. 2. Isolate the main document. 3. HTML β ATX Markdown. 4. Chunk ~chunkWords. 5. One row/chunk + citation.
π Related Actors
π° Pricing Example
Pay-per-event: $0.005 per run + $0.04 per document/chunk (document-record).
| Chunks | Cost |
|---|---|
| 100 | ~$4.00 |
| 500 | ~$20.00 |
| 2,000 | ~$80.00 |
| Apify's $5 free credit covers ~124 chunks. Start free β |
βοΈ Legal & data sources
Fetches publicly-accessible documents with an identified User-Agent; output includes source URLs for attribution.
β FAQ
Citations? Yes. Chunk size? chunkWords. Fresh? Live. Key? No. Inputs? Public HTML. Dedup? Per run.
π Troubleshooting
- Empty markdown β JS-rendered/restricted page. - Boilerplate β use the canonical URL. - Huge β lower inputs/chunkWords. - 404 β check the URL/ID.
π·οΈ About NexGenData
Public-data tools for analysts, developers, and operators. thenextgennexus.com
