VOOZH about

URL: https://apify.com/accurate_pouch/pdf-toolkit

โ‡ฑ PDF Toolkit โ€” Extract Text, Metadata & Page Count ยท Apify


๐Ÿ‘ PDF Toolkit โ€” Extract Text, Metadata & Page Count avatar

PDF Toolkit โ€” Extract Text, Metadata & Page Count

Pricing

$4.00 / 1,000 pdf processeds

Go to Apify Store

PDF Toolkit โ€” Extract Text, Metadata & Page Count

Extract text from PDFs, read metadata (title, author, dates), count pages. Bulk processing from URLs. $0.003 per PDF.

Pricing

$4.00 / 1,000 pdf processeds

Rating

0.0

(0)

Developer

๐Ÿ‘ Manchitt Sanan

Manchitt Sanan

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a month ago

Last modified

Categories

Share

Process PDFs from URLs in bulk. Extract full text content, read document metadata (title, author, creation date), and count pages. $0.003 per PDF processed.


Operations

OperationWhat it returns
extract-textFull text content + page count
get-metadataTitle, author, subject, creator, producer, creation/modification dates + page count
page-countNumber of pages only

Quick start

{
"items":[
{
"url":"https://www.w3.org/WAI/ER/tests/xhtml/testfiles/resources/pdf/dummy.pdf",
"operation":"extract-text"
}
]
}

Input

Each item in the items array:

FieldTypeRequiredDescription
urlstringYesURL to a PDF file
operationenumYesextract-text, get-metadata, or page-count

Output

{
"url":"https://example.com/document.pdf",
"operation":"extract-text",
"text":"Full extracted text content...",
"pageCount":12,
"fileSize":245760,
"status":"success",
"error":null
}

Pricing

$0.003 per PDF processed (pay-per-event pricing).

  • Errors and dry runs are never charged.
  • 100 PDFs = $0.30
  • 1,000 PDFs = $3.00

Limitations

  • Text extraction only โ€” no OCR. Scanned PDFs (images of text) will return empty or minimal text.
  • Max file size depends on Apify memory allocation. Default 256MB handles most PDFs.
  • No PDF generation โ€” this actor reads PDFs, doesn't create them. Use Apify's official HTML-to-PDF actor for generation.

Related actors in this suite

Other tools by accurate_pouch for content + asset processing:

  • QR Code Toolkit โ€” Generate + decode, custom colors, logos, SVG/PNG/base64. $0.004/QR.
  • TheCrawler โ€” Web scraper + LLM-powered structured extraction, includes PDF + DOCX. AGPL-3.0, also on npm (thecrawler@0.1.1). $0.005/page.
  • Google Sheets R/W โ€” Read, append, replace, modify, backup. $0.004/op.
  • Broken Link Checker โ€” Recursive crawl, sitemap + robots.txt parsing, webhook, Sheets export. $0.005/page.

Run on Apify

๐Ÿ‘ Run on Apify

No setup needed. Click above to run in the cloud. $0.003 per operation.

You might also like

PDF Text Extractor - Bulk PDF to Text & Metadata

santamaria-automations/pdf-extractor

Extract text and metadata from any PDF URL in bulk. Get page content, author, title, creation date, and more. Detects scanned PDFs that need OCR. Perfect for document analysis, research, and compliance.

PDF Parser API

george.the.developer/pdf-parser-api

Instant API that parses any PDF from a URL โ€” extracts full text, page count, metadata (title, author, dates), and PDF version. Returns structured JSON. Perfect for document processing pipelines and AI agents.

PDF Scraper

onidivo/pdf-scraper

Scrape and extract text from PDF links.

๐Ÿ‘ User avatar

Onidivo Technologies

512

Pdf To Text Scraper

getdataforme/pdf-to-text-scraper

The Pdf To Text Scraper is an Apify Actor that efficiently extracts text from PDFs, preserving structure and supporting batch processing....

Extract text from PDF

akash9078/pdf-text-extractor

Efficiently extract text content from PDF files, ideal for data processing, content analysis, and automation workflows. Supports various PDF structures and outputs clean, readable text.

๐Ÿ‘ User avatar

Akash Kumar Naik

107

Fast Pdf Processor

contemporary_fruit/pdf-processor-actor

This API is a PDF Processing Service allowing users to upload a PDF to: Extract Text: Reads all text from the PDF and returns it as structured JSON data per page. Merge Pages: Creates a new PDF containing only the specific pages selected by the user. (260 characters)

AI Data Extraction from PDF

actor4you/ai-data-extraction-from-pdf

Extract text data from PDF files using AI. Upload PDFs directly or provide URLs. Supports text chunking for LLM workflows.