VOOZH about

URL: https://apify.com/agenscrape/pdf-power-tools

⇱ Pdf Power Tools · Apify


Pricing

Pay per event

Go to Apify Store

Pdf Power Tools

Split, merge, compress, convert & OCR PDFs via API. Extract text from scanned documents in 14 languages. Compress files for email, convert pages to PNG/JPEG/WebP, split by pages or ranges, merge multiple PDFs. Perfect for document automation & data extraction workflows.

Pricing

Pay per event

Rating

0.0

(0)

Developer

👁 Agenscrape

Agenscrape

Maintained by Community

Actor stats

1

Bookmarked

21

Total users

3

Monthly active users

7 months ago

Last modified

Share

Facing an issue, unexpected error, edge case, or have a feature suggestion? Post it here and we'll address it within 24 hours.

What is PDF Power Tools?

PDF Power Tools is a comprehensive PDF processing API that handles all your PDF manipulation needs in the cloud. Whether you need to split large documents, merge multiple PDFs, compress files for email, extract text from scanned documents using OCR, or convert PDF pages to images - this actor does it all.

Perfect for:

  • Document automation workflows - Process PDFs at scale without local software
  • Data extraction pipelines - Extract text from scanned invoices, receipts, contracts
  • Content management systems - Generate thumbnails, compress uploads, split documents
  • Archival and digitization - OCR historical documents, enhance scanned pages
  • Web applications - Server-side PDF processing via API

Features

Split PDF

Break down large PDF documents into smaller, manageable files. Split options include:

  • Each page separate - Create individual PDFs for every page
  • By page ranges - Split into custom ranges (e.g., pages 1-10, 11-20, 21-30)
  • Split in half - Divide document into two equal parts
  • Extract specific pages - Pull out only the pages you need
  • By file size - Automatically split when file exceeds size limit

Merge PDF

Combine multiple PDF files into a single document:

  • Merge unlimited PDFs in sequence
  • Custom merge order
  • Interleave pages from multiple documents
  • Insert pages from one PDF into another at specific positions

Compress PDF

Reduce PDF file size for email attachments, web uploads, or storage optimization:

  • Low compression - Minimal size reduction, highest quality
  • Medium compression - Balanced quality and file size (default)
  • High compression - Maximum size reduction
  • Screen preset - Optimized for on-screen viewing
  • Print preset - Optimized for printing quality

Convert PDF to Images

Transform PDF pages into high-quality images:

  • Output formats: PNG, JPEG, WebP, TIFF
  • Customizable DPI (72-600)
  • Convert all pages or specific page selection
  • Combine all pages into single tall image
  • Generate thumbnails

OCR - Text Extraction from Scanned PDFs

Extract text from scanned documents, images, and non-searchable PDFs using Tesseract OCR:

  • 14 supported languages: English, French, German, Spanish, Italian, Portuguese, Dutch, Polish, Russian, Chinese (Simplified & Traditional), Japanese, Korean, Arabic
  • Image preprocessing for improved accuracy
  • Confidence scores per page
  • Word and line count statistics

Enhance Scanned PDFs

Improve readability of scanned documents:

  • Sharpen blurry text and images
  • Reduce noise and artifacts
  • Adjust contrast and brightness
  • Configurable DPI settings

Page Manipulation

Fine-grained control over PDF pages:

  • Reorder pages within a document
  • Remove unwanted pages
  • Insert pages at specific positions

PDF Information

Analyze PDF files before processing:

  • Page count and dimensions
  • File size breakdown
  • Detect if PDF is scanned or native text
  • Compression estimate

Input Options

Basic Input

{
"operation":"split",
"pdfUrl":"https://example.com/document.pdf"
}

Using Base64 Input

{
"operation":"compress",
"pdfBase64":"JVBERi0xLjcKCjEgMCBvYmoK..."
}

Operation Examples

Get PDF Information

{
"operation":"info",
"pdfUrl":"https://example.com/document.pdf"
}

Split Into Individual Pages

{
"operation":"split",
"pdfUrl":"https://example.com/large-document.pdf",
"splitMode":"each_page"
}

Split By Page Ranges

{
"operation":"split",
"pdfUrl":"https://example.com/document.pdf",
"splitMode":"ranges",
"ranges":["1-10","11-20","21-30"]
}

Extract Specific Pages

{
"operation":"split",
"pdfUrl":"https://example.com/document.pdf",
"splitMode":"extract",
"pages":[1,5,10,15]
}

Merge Multiple PDFs

{
"operation":"merge",
"pdfUrls":[
"https://example.com/part1.pdf",
"https://example.com/part2.pdf",
"https://example.com/part3.pdf"
]
}

Merge With Custom Order

{
"operation":"merge",
"pdfUrls":["doc1.pdf","doc2.pdf","doc3.pdf"],
"order":[2,0,1]
}

Compress PDF

{
"operation":"compress",
"pdfUrl":"https://example.com/large-file.pdf",
"compressionPreset":"high"
}

Convert PDF to PNG Images

{
"operation":"convert",
"pdfUrl":"https://example.com/document.pdf",
"outputFormat":"png",
"dpi":200,
"quality":95
}

Convert Specific Pages to JPEG

{
"operation":"convert",
"pdfUrl":"https://example.com/document.pdf",
"outputFormat":"jpeg",
"pages":[1,3,5],
"dpi":150
}

OCR - Extract Text from Scanned PDF

{
"operation":"ocr",
"pdfUrl":"https://example.com/scanned-document.pdf",
"language":"eng",
"preprocess":true
}

OCR in French

{
"operation":"ocr",
"pdfUrl":"https://example.com/french-scan.pdf",
"language":"fra"
}

Enhance Scanned Document

{
"operation":"enhance",
"pdfUrl":"https://example.com/old-scan.pdf",
"sharpen":true,
"denoise":true,
"contrast":1.3,
"brightness":1.1
}

Generate Thumbnail

{
"operation":"thumbnail",
"pdfUrl":"https://example.com/document.pdf",
"thumbnailWidth":300,
"outputFormat":"png"
}

Remove Pages

{
"operation":"merge",
"pdfUrl":"https://example.com/document.pdf",
"pagesToRemove":[2,5,8]
}

Reorder Pages

{
"operation":"merge",
"pdfUrl":"https://example.com/document.pdf",
"newPageOrder":[4,3,2,1,5,6]
}

Output

Results are saved to the run's Key-Value Store for easy download:

OperationOutput Files
Splitpage_001.pdf, page_002.pdf, ... or pages_1-10.pdf, etc.
Mergemerged.pdf
Compresscompressed.pdf
Convertpage_001.png, page_002.png, ...
OCRextracted_text.txt + Dataset with per-page results
Enhanceenhanced.pdf
Thumbnailthumbnail.png

Sample Output

{
"operation":"compress",
"preset":"high",
"pageCount":25,
"originalSize":"4.5 MB",
"compressedSize":"1.2 MB",
"compressionRatio":"73.3%",
"outputKey":"compressed.pdf"
}

Supported Languages for OCR

CodeLanguage
engEnglish
fraFrench
deuGerman
spaSpanish
itaItalian
porPortuguese
nldDutch
polPolish
rusRussian
chi_simChinese (Simplified)
chi_traChinese (Traditional)
jpnJapanese
korKorean
araArabic

Compression Presets

PresetImage QualityBest For
low90%Archives, legal documents
medium75%General use, email
high50%Web uploads, storage saving
screen60%On-screen viewing
print85%Print-quality output

Pricing

EventPriceDescription
pdf-loaded$0.005Each PDF loaded from URL or base64
page-enhanced$0.01Each page enhanced (sharpen, denoise)
page-processed$0.002Each page processed (split, merge, compress)
ocr-page$0.02Each page with OCR text extraction
pdf-compressed$0.01PDF compression completed
page-converted$0.005Each page converted to image
pdf-merged$0.01PDF merge operation completed
metadata-extracted$0.005PDF info/metadata extraction
text-extracted$0.005Text extraction completed

Use Cases

  • Invoice Processing - Extract data from scanned invoices using OCR
  • Document Splitting - Break down large reports into chapters
  • PDF Compression - Reduce file size for email attachments
  • Image Generation - Create thumbnails for document previews
  • Document Merging - Combine multiple contracts into one file
  • Archival - Enhance and OCR historical scanned documents
  • Web Publishing - Convert PDF pages to web-friendly images
  • Data Extraction - Pull text from non-searchable PDFs

You might also like

Image to Text (OCR) — Extract Text from Screenshots & Photos

junipr/image-to-text

Extract text from images using Tesseract.js OCR engine. Supports 100+ languages, PDFs, and bulk image processing.

Парсер Wildberries: товары, цены и продавцы

tugelbay/wb-seller-intel

Парсер Wildberries для анализа товаров, цен, скидок, остатков и продавцов. Поиск по русским запросам или nmId, supplier ID, рейтинг, отзывы, цены в рублях. Guide: https://konabayev.com/tools/wb-seller-intel/?utm_source=apify_info&utm_medium=referral&utm_campaign=wb-seller-intel

👁 User avatar

Tugelbay Konabayev

45

Парсер карточки товара Вайлдберриз (Wildberries)

akoinc/wb-card-parser

Полчайте данные из карточек товаров Вайлдберриз(Wildberries) за мгновение.

177

5.0

Wildberries Product Search Scraper

stealth_mode/wildberries-product-search-scraper

Scrape product listings from Wildberries.ru, Russia's largest online marketplace. Extract prices, ratings, supplier info, stock levels, and product specifications from search and category pages. Ideal for market research, price monitoring, and competitive analysis in the Russian e-commerce market.

👁 User avatar

Stealth mode

56

Wildberries Product Scraper 🔍🛒 - Cheap дешёвый  Вайлдберриз

scrapestorm/wildberries-product-scraper---cheap-deshyovyy-vayldberriz

Easily collect product listings from Wildberries  ягодки, one of Russia’s largest e-commerce platforms. Simply enter a keyword to extract key details like name, brand, price, discount, availability & images 🔍 Seamlessly integrate with ur analytics tools for price tracking or competitor analysis!⚡📊

👁 User avatar

Storm_Scraper

59

5.0

Wildberries Products Search Scraper

powerai/wildberries-products-search-scraper

Scrape products from Wildberries.ru with automatic pagination and comprehensive product data extraction including prices, ratings, reviews, and promotions.

181

1.0

Image To Text Ai

welcoming_fireplace/image-to-text-ai

A powerful OCR tool that goes beyond standard text extraction. Powered by a Premium Vision AI model, it accurately reads handwriting, preserves table structures, and converts messy receipts or documents into structured JSON or Markdown. Supports batch processing for high-volume workflows.

👁 User avatar

Richmond Nkrumah

41

LinkedIn Ads Scraper

simpleapi/linkedin-ads-scraper

LinkedIn Ads Scraper extracts LinkedIn ad data, including creatives, headlines, descriptions, CTAs, advertiser info, formats, and impression insights. Ideal for ad research, competitive analysis, campaign tracking, and automating structured LinkedIn advertising data collection.

PDF Tools (Merge / Split / Compress / OCR / Watermark)

mrkrokko/pdf-tools

All-in-one PDF processor: merge multiple PDFs, split by page ranges, compress file size, extract text, OCR scanned documents (Tesseract), add text watermarks, rotate pages, and read metadata. Accepts PDF URLs or Key-Value Store keys.

Image OCR Scraper

seemuapps/image-ocr-scraper

Extract text from any image. Bulk OCR for screenshots, scanned documents, receipts, signs, and photos. Supports 109 languages and outputs clean Markdown or structured JSON with bounding boxes.