VOOZH about

URL: https://apify.com/andok/html-table-extractor

⇱ HTML Table to JSON/CSV Extractor Β· Apify


πŸ‘ HTML Table to JSON/CSV Extractor avatar

HTML Table to JSON/CSV Extractor

Pricing

from $1.00 / 1,000 table extracteds

Go to Apify Store

HTML Table to JSON/CSV Extractor

Convert complex web tables into clean, structured JSON or CSV data. Automate data entry and reporting without writing custom parsers.

Pricing

from $1.00 / 1,000 table extracteds

Rating

0.0

(0)

Developer

πŸ‘ Andok

Andok

Maintained by Community

Actor stats

0

Bookmarked

8

Total users

0

Monthly active users

3 months ago

Last modified

Share

HTML Table Extractor

Pull structured table data from any web page and export it as clean JSON or CSV. No custom scraper needed β€” just provide URLs containing HTML tables and get rows and columns as structured data. Process multiple pages in a single run.

Features

  • Automatic table detection β€” finds all <table> elements on each page
  • Header recognition β€” detects <th> headers or uses the first row as column names
  • Bulk processing β€” extract tables from multiple URLs in one run
  • Clean JSON output β€” each table row becomes a structured object with named fields
  • Configurable concurrency β€” process 1 to 50 URLs in parallel

Input

FieldTypeRequiredDefaultDescription
urlsarrayNoβ€”List of webpage URLs to extract tables from
urlstringNoβ€”Single URL for backwards compatibility (use urls for bulk)
timeoutSecondsintegerNo15Maximum seconds to wait for each URL response
concurrencyintegerNo10Number of URLs to process in parallel (1-50)

Input Example

{
"urls":[
"https://en.wikipedia.org/wiki/List_of_countries_by_population_(United_Nations)"
]
}

Output

Each URL produces one dataset item containing all tables found on the page.

Key output fields:

  • inputUrl (string) β€” the original URL provided
  • finalUrl (string) β€” the URL after following redirects
  • status (number) β€” HTTP status code
  • tableCount (number) β€” number of tables found on the page
  • tables (array) β€” array of table objects, each containing headers and rows
  • error (string) β€” error message if extraction failed, otherwise null
  • checkedAt (string) β€” ISO 8601 timestamp

Output Example

{
"inputUrl":"https://en.wikipedia.org/wiki/List_of_countries_by_population_(United_Nations)",
"finalUrl":"https://en.wikipedia.org/wiki/List_of_countries_by_population_(United_Nations)",
"status":200,
"tableCount":2,
"tables":[
{
"headers":["Rank","Country","Population"],
"rows":[
["1","India","1,450,935,791"],
["2","China","1,419,321,278"]
]
}
],
"error":null,
"checkedAt":"2025-01-15T10:30:00.000Z"
}

Pricing

EventCost
Table ExtractedPay-per-event (see actor pricing page)

Use Cases

  • Data collection β€” grab financial data, sports stats, or product specs from web pages without writing a scraper
  • Spreadsheet import β€” convert HTML tables to CSV or JSON for import into Excel or Google Sheets
  • Research automation β€” extract tabular data from Wikipedia, government sites, or academic pages
  • Price monitoring β€” pull pricing tables from competitor websites

Related Actors

ActorWhat it adds
Web Page to Markdown Converter for LLMsConvert full page content to Markdown including tables
JSON-LD Schema ExtractorExtract structured data from Schema.org markup

You might also like

HTML Table Extractor

automation-lab/html-table-extractor

Extract HTML tables from any webpage into structured JSON. Supports multiple URLs, filtering by CSS selector or table index, auto-header detection, and nested tables. Pure HTTP β€” no proxy needed.

πŸ‘ User avatar

Stas Persiianenko

19

HTML to JSON Smart Parser

parseforge/html-to-json-smart-parser

Convert HTML to structured JSON using AI! Uses OpenAI to extract and structure data from HTML into clean JSON format. Perfect for developers and data analysts who need to transform HTML into structured data without manual parsing.

40

5.0

Json To Excel

zuzka/json-to-excel

Convert your json into a tabular form, such as CSV, Excel or HTML table fast and easy.

πŸ‘ User avatar

Zuzka PelechovΓ‘

55

JSON to CSV Converter

eloquent_mountain/json-to-csv-converter

JSON to CSV Converter. Effortlessly transform JSON data into CSV with our Apify actor. Handle nested structures, expand lists into rows, and customize separators and delimiters. Input via URL or paste JSON text. Ideal for data analysis and reporting!

HTML to Markdown

web.harvester/html-to-markdown

Convert HTML to clean Markdown. Supports GFM tables, code blocks, and custom rules. Perfect for content migration and documentation.

3

Data Converter β€” JSON, CSV & XML

accurate_pouch/data-converter

Convert between JSON, CSV, and XML formats in bulk. JSON to CSV, CSV to JSON, JSON to XML, XML to JSON. Handles quoted fields, nested objects. $0.003/conversion.

πŸ‘ User avatar

Manchitt Sanan

2

Universal Web Scraper & Data Extractor – Fast No-Code Tool

motivational_nickel/my-actor

Universal web scraper that extracts structured data from almost any website. Detect and scrape webpage content into clean datasets (CSV, Excel, JSON) without coding. Ideal for web scraping, research, lead generation, automation pipelines, and large-scale data extraction.

πŸ‘ User avatar

Leoncio Jr Coronado

54

5.0

XML to JSON Converter

eloquent_mountain/xml-to-json-converter

XML to JSON Converter Convert any XML file to JSON effortlessly with this Apify actor. Handle complex XML structures and transform them into structured JSON data. Supports input via URL or direct text paste, making it easy to integrate into your data processing workflows.

PDF To JSON Parser

parseforge/pdf-to-json-parser

Convert PDF documents into structured JSON using AI-powered OCR and smart data extraction. The Actor processes every page to ensure complete coverage, then identifies text, fields, tables, and key details, delivering clean, organized JSON ready for automation or analysis.

56

5.0

Related articles

How to parse JSON with Python
Read more