👁 PDF Toolkit — Extract Text, Metadata & Page Count avatar

PDF Toolkit — Extract Text, Metadata & Page Count

Pricing

$4.00 / 1,000 pdf processeds

Try for free

Go to Apify Store

👁 PDF Toolkit — Extract Text, Metadata & Page Count

PDF Toolkit — Extract Text, Metadata & Page Count

Try for free

Extract text from PDFs, read metadata (title, author, dates), count pages. Bulk processing from URLs. $0.003 per PDF.

Pricing

$4.00 / 1,000 pdf processeds

Rating

0.0

(0)

Developer

👁 Manchitt Sanan

Manchitt Sanan

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

Operations

Operation	What it returns
`extract-text`	Full text content + page count
`get-metadata`	Title, author, subject, creator, producer, creation/modification dates + page count
`page-count`	Number of pages only

Quick start

{
"items":[
{
"url":"https://www.w3.org/WAI/ER/tests/xhtml/testfiles/resources/pdf/dummy.pdf",
"operation":"extract-text"
}
]
}

Input

Each item in the items array:

Field	Type	Required	Description
`url`	string	Yes	URL to a PDF file
`operation`	enum	Yes	`extract-text`, `get-metadata`, or `page-count`

Output

{
"url":"https://example.com/document.pdf",
"operation":"extract-text",
"text":"Full extracted text content...",
"pageCount":12,
"fileSize":245760,
"status":"success",
"error":null
}

Pricing

$0.003 per PDF processed (pay-per-event pricing).

Errors and dry runs are never charged.
100 PDFs = $0.30
1,000 PDFs = $3.00

Limitations

Text extraction only — no OCR. Scanned PDFs (images of text) will return empty or minimal text.
Max file size depends on Apify memory allocation. Default 256MB handles most PDFs.
No PDF generation — this actor reads PDFs, doesn't create them. Use Apify's official HTML-to-PDF actor for generation.

Related actors in this suite

Other tools by accurate_pouch for content + asset processing:

QR Code Toolkit — Generate + decode, custom colors, logos, SVG/PNG/base64. $0.004/QR.
TheCrawler — Web scraper + LLM-powered structured extraction, includes PDF + DOCX. AGPL-3.0, also on npm (thecrawler@0.1.1). $0.005/page.
Google Sheets R/W — Read, append, replace, modify, backup. $0.004/op.
Broken Link Checker — Recursive crawl, sitemap + robots.txt parsing, webhook, Sheets export. $0.005/page.

Run on Apify

👁 Run on Apify

No setup needed. Click above to run in the cloud. $0.003 per operation.

👁 PDF Text Extractor - Bulk PDF to Text & Metadata avatar

PDF Text Extractor - Bulk PDF to Text & Metadata

santamaria-automations/pdf-extractor

Extract text and metadata from any PDF URL in bulk. Get page content, author, title, creation date, and more. Detects scanned PDFs that need OCR. Perfect for document analysis, research, and compliance.

👁 User avatar

Ale

PDF Text Extractor

automation-lab/pdf-text-extractor

Extract text, metadata, and page-by-page content from PDF files. Provide PDF URLs and get structured JSON with full text, per-page text, page count, author, title, creation date, and more. Export as JSON, CSV, or Excel. No browser or proxy needed.

👁 User avatar

Stas Persiianenko

👁 PDF Parser API avatar

PDF Parser API

george.the.developer/pdf-parser-api

Instant API that parses any PDF from a URL — extracts full text, page count, metadata (title, author, dates), and PDF version. Returns structured JSON. Perfect for document processing pipelines and AI agents.

👁 User avatar

George Kioko

PDF Extractor: Structured Text + Metadata

aitoolbreakdown/atb-pdf-extractor

Point it at one or many PDF URLs. Get clean structured JSON back: full text, per-page text, title, author, page count, and word count. Ready for RAG, search, or doc automation.

👁 User avatar

AI Tool Breakdown

👁 PDF Scraper avatar

PDF Scraper

onidivo/pdf-scraper

Scrape and extract text from PDF links.

👁 User avatar

Onidivo Technologies

512

👁 Pdf To Text Scraper avatar

Pdf To Text Scraper

getdataforme/pdf-to-text-scraper

The Pdf To Text Scraper is an Apify Actor that efficiently extracts text from PDFs, preserving structure and supporting batch processing....

👁 User avatar

GetDataForMe

👁 Extract text from PDF avatar

Extract text from PDF

akash9078/pdf-text-extractor

Efficiently extract text content from PDF files, ideal for data processing, content analysis, and automation workflows. Supports various PDF structures and outputs clean, readable text.

👁 User avatar

Akash Kumar Naik

107

👁 Fast Pdf Processor avatar

Fast Pdf Processor

contemporary_fruit/pdf-processor-actor

This API is a PDF Processing Service allowing users to upload a PDF to: Extract Text: Reads all text from the PDF and returns it as structured JSON data per page. Merge Pages: Creates a new PDF containing only the specific pages selected by the user. (260 characters)

👁 User avatar

Andric

👁 AI Data Extraction from PDF avatar

AI Data Extraction from PDF

actor4you/ai-data-extraction-from-pdf

Extract text data from PDF files using AI. Upload PDFs directly or provide URLs. Supports text chunking for LLM workflows.

👁 User avatar

Actor4you

PDF Text Extractor API - URL to Text, Per-Page, Batch

gratifying_graph/pdf-extract-api

Turn any public PDF URL into clean text and metadata. Per-page output, batch processing, and a synchronous API mode for AI agents. Pay per page extracted, cheaper than the alternatives.

👁 User avatar

Jimmy A

URL: https://apify.com/accurate_pouch/pdf-toolkit