VOOZH about

URL: https://apify.com/eliai/webpage-tables-extractor

⇱ Webpage Tables Extractor Β· Apify


Pricing

$20.00 / 1,000 tables extracteds

Go to Apify Store

Webpage Tables Extractor

Extract every HTML <table> from a page into clean JSON arrays (headers + rows) β€” feed spreadsheets straight to an agent or pipeline.

Pricing

$20.00 / 1,000 tables extracteds

Rating

0.0

(0)

Developer

πŸ‘ Anthony Snider

Anthony Snider

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

5 days ago

Last modified

Share

Turn any webpage's HTML <table>s into clean, structured JSON β€” headers and rows ready for a spreadsheet, an LLM, or a data pipeline.

Live on the Apify Store β€” run it instantly or call it as an agent tool via Apify MCP.

What you get

  • Every real data <table> on the page, parsed to JSON.
  • Each table: index, headers, rowCount, and rows (objects keyed by header, falling back to column index).
  • Layout/spacer tables (single column or fewer than 2 rows) are automatically skipped.
  • Loose colspan handling so cells stay aligned with headers.
  • Single URL or bulk URLs in one run.

Input

{
"url":"https://en.wikipedia.org/wiki/List_of_largest_companies_by_revenue",
"maxUrls":25
}

Or bulk:

{
"urls":[
"https://example.com/report-a",
"https://example.com/report-b"
]
}

Output

One dataset item per page:

{
"url":"https://en.wikipedia.org/wiki/List_of_largest_companies_by_revenue",
"tableCount":1,
"tables":[
{
"index":0,
"headers":["Rank","Name","Industry","Revenue (USD millions)"],
"rowCount":50,
"rows":[
{
"Rank":"1",
"Name":"Walmart",
"Industry":"Retail",
"Revenue (USD millions)":"648,125"
}
]
}
]
}

Pricing: pay-per-event β€” charged once per page processed.

You might also like

HTML Table Extractor

automation-lab/html-table-extractor

Extract HTML tables from any webpage into structured JSON. Supports multiple URLs, filtering by CSS selector or table index, auto-header detection, and nested tables. Pure HTTP β€” no proxy needed.

πŸ‘ User avatar

Stas Persiianenko

21

HTML Scraper

making-data-meaningful/html-scraper

Access and extract full HTML source code from any webpage instantly. The HTML Scraper API lets you retrieve clean, accurate page HTML for SEO analysis, web scraping, and content monitoring - all without being blocked.

Json To Excel

zuzka/json-to-excel

Convert your json into a tabular form, such as CSV, Excel or HTML table fast and easy.

πŸ‘ User avatar

Zuzka PelechovΓ‘

55

PDF URL to Markdown, Tables & RAG Extractor

thescrapelab/Apify-PDF-url-scraper

Extract clean Markdown, page text, tables, metadata, summaries, and AI-ready RAG chunks from PDF URLs.