VOOZH about

URL: https://products.groupdocs.cloud/parser/curl

⇱ Parse Documents and Extract Data with cURL Commands


👁 GroupDocs.Parser Cloud for cURL
GroupDocs.Parser
Cloud for cURL

GitHub Learn Buy

GroupDocs.Parser Cloud for cURL allows parsing and data extraction for almost all of the common business document formats. Using simple cURL commands; text and images can be extracted from the whole document or from the specific pages range. REST API also supports archives like ZIP to be parsed and extract images, document information and the details of the container items.

  • At a
    Glance
  • Supported
    File Formats
  • Platform
    Independence

An overview of the features supported by the document parser Cloud API.‎

  • Parse documents
  • Parse document by template
  • Extract text
  • Extract images
  • Extract metadata
  • Emails
  • eBooks
  • PDF files
  • PDF Portfolio
  • Files within ZIP archives
  • Text and Markup documents
  • Microsoft & OpenDocument

Document parser Cloud API supported formats.

  • Word Processing: DOC, DOCX, DOCM, DOT, DOTX, DOTM
  • Spreadsheets: XLS, XLSX, XLSM, XLSB, XLT, XLTX, XLTM, XLA, XLAM
  • Presentations: PPT, PPTX, PPTM, PPS, PPSX, PPSM, POT, POTX, POTM
  • OneNote: ONE
  • OpenDocument Formats: ODT, OTT, ODS, OTS, ODP, OTP
  • Text: TXT, RTF
  • Markup: HTML, XHTML, MHTML, MD, XML
  • Portable Formats: PDF
  • Email: PST, OST, EML, EMLX, MSG
  • eBook Formats: EPUB, FB2, CHM
  • Other: NUMBERS, CSV, ZIP

Supported Operating Systems and Frameworks

Document Parser REST API Features

Parse PDF, word processing documents, spreadsheets & presentations

Parse Microsoft Word, Excel, PowerPoint and OpenDocument template files

Parse Macro-Enabled document & templates for MS Word, Excel & PowerPoint

Extract text content from the whole document

Extract text and images from specific pages

Extract formatted text by setting extraction mode

Extract text from the documents in ZIP archives or Emails

Retrieve documents from Emails, PDF portfolios & MS Outlook storages

Get document page count and other information

Extract Text from a Document

Using GroupDocs.Parser Cloud API, you can quickly extract images, text, and metadata from the document placed in your Cloud storage. Following simple cURL command shows how to extract text from a document.

Parse MS Word Document to Extract Text - cURL

curl -X POST "https://api.groupdocs.cloud/v1.0/parser/text" \
-H "accept: application/json" \
-H "authorization: Bearer TOKEN-HERE" \
-H "Content-Type: application/json" \
-d "{ \"FileInfo\": { \"FilePath\": \"foldername/document.docx\", }}"

Extract images from Documents using GroupDocs.Parser Free App

ODTPSTPPSXPPSMPOTMHTMLXLTMPPTMDOTMFB2DOCTIFFXLSOTPDOTEMLXONECHMXHTMLCSVTXTODPGZTARXLAOTSBZ2OTTOSTXLSBPPTXTIFBMPRTFEPUBMHTMLXLSMPDFPOTXEMLDOCMXMLXLTXXLSXPPSDOCXJPGXLTRARMDXLAMZIPPPTGIFPNGODSMSGDOTXJPEGJP2POT

Extract images from Documents using GroupDocs.Parser Free App

CHMEPUBDOCXHTMLDOCMRARRTFDOCXOSTMHTMLPDFOTPXLSMODTOTTEMLXPPSXPOTXJPGXLAMXMLJPEGJP2PPTMTARFB2XLSXPPSPPTDOTMMDPNGDOTCSVODPXLTMPSTEMLOTSDOTXXLAPPTXONEPOTMMSGBZ2XLTXGZGIFXLSTXTHTMLXLSBTIFFBMPTIFODSPPSMXLTZIPPOT

Support and Learning Resources

Start Free Trial
Download GitHub Repository

GroupDocs.Parser Cloud also offers individual document parsing SDKs for other languages as listed below:

👁 GroupDocs.Parser Cloud SDK for .NET
GroupDocs.Parser
Cloud SDK for .NET

👁 GroupDocs.Parser Cloud SDK for Java
GroupDocs.Parser
Cloud SDK for Java

👁 GroupDocs.Parser Cloud SDK for PHP
GroupDocs.Parser
Cloud SDK for PHP

👁 GroupDocs.Parser Cloud SDK for Python
GroupDocs.Parser
Cloud SDK for Python

👁 GroupDocs.Parser Cloud SDK for Ruby
GroupDocs.Parser
Cloud SDK for Ruby

👁 GroupDocs.Parser Cloud SDK for Node.js
GroupDocs.Parser
Cloud SDK for Node.js

7ba92c6