Webpage Tables Extractor

Pricing

$20.00 / 1,000 tables extracteds

Webpage Tables Extractor

Extract every HTML <table> from a page into clean JSON arrays (headers + rows) — feed spreadsheets straight to an agent or pipeline.

Pricing

$20.00 / 1,000 tables extracteds

Rating

0.0

(0)

Developer

👁 Anthony Snider

Anthony Snider

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

5 days ago

Last modified

What you get

Every real data <table> on the page, parsed to JSON.
Each table: index, headers, rowCount, and rows (objects keyed by header, falling back to column index).
Layout/spacer tables (single column or fewer than 2 rows) are automatically skipped.
Loose colspan handling so cells stay aligned with headers.
Single URL or bulk URLs in one run.

Input

{
"url":"https://en.wikipedia.org/wiki/List_of_largest_companies_by_revenue",
"maxUrls":25
}

Or bulk:

{
"urls":[
"https://example.com/report-a",
"https://example.com/report-b"
]
}

Output

One dataset item per page:

{
"url":"https://en.wikipedia.org/wiki/List_of_largest_companies_by_revenue",
"tableCount":1,
"tables":[
{
"index":0,
"headers":["Rank","Name","Industry","Revenue (USD millions)"],
"rowCount":50,
"rows":[
{
"Rank":"1",
"Name":"Walmart",
"Industry":"Retail",
"Revenue (USD millions)":"648,125"
}
]
}
]
}

Pricing: pay-per-event — charged once per page processed.

👁 HTML Table Extractor avatar

HTML Table Extractor

automation-lab/html-table-extractor

Extract HTML tables from any webpage into structured JSON. Supports multiple URLs, filtering by CSS selector or table index, auto-header detection, and nested tables. Pure HTTP — no proxy needed.

👁 User avatar

Stas Persiianenko

HTML Table to JSON/CSV Extractor

andok/html-table-extractor

Convert complex web tables into clean, structured JSON or CSV data. Automate data entry and reporting without writing custom parsers.

👁 User avatar

Andok

Webpage To Clean Markdown

technicaldost/webpage-to-clean-markdown

👁 User avatar

Technical Dost Solutions

PDF Table Extractor

zentrafoundry/pdf-table-extractor

Transform pdf table extractor inputs into structured rows, clear errors, confidence signals, and automation-ready output.

👁 User avatar

Zentra

HTML Tables to Markdown (GFM) for RAG & LLMs

awesome_highboy/tableforge

Extract every HTML table from any URL into clean, deterministic GitHub-Flavored Markdown (GFM). Auto-detects headers (or synthesizes col1..N), escapes pipes, collapses whitespace, and stamps each table with an sha256 hash for dedup & idempotency. RAG / embeddings / LLM ready. Same HTML, same output.

👁 User avatar

Adam

Excel Agent

queueing_jump/excel-agent

Lets you control spreadsheets with plain English. Provide an instruction and an optional .xlsx or .csv file, and the AI agent reads, analyses, transforms, and generates spreadsheets.

👁 User avatar

Matt Russell

👁 HTML Scraper avatar

HTML Scraper

making-data-meaningful/html-scraper

Access and extract full HTML source code from any webpage instantly. The HTML Scraper API lets you retrieve clean, accurate page HTML for SEO analysis, web scraping, and content monitoring - all without being blocked.

👁 User avatar

Scrape Hub

👁 Json To Excel avatar

Json To Excel

zuzka/json-to-excel

Convert your json into a tabular form, such as CSV, Excel or HTML table fast and easy.

👁 User avatar

Zuzka Pelechová

Financial Table Extractor for PDFs

dainty_dogfish/okra-financial-table-extractor

Extract annual-report and 10-K table rows from PDF URLs into typed JSON with page, quote, and cell bbox evidence. Runs self-contained on Apify; no Okra API key required.

👁 User avatar

Steven

👁 PDF URL to Markdown, Tables & RAG Extractor avatar

PDF URL to Markdown, Tables & RAG Extractor

thescrapelab/Apify-PDF-url-scraper

Extract clean Markdown, page text, tables, metadata, summaries, and AI-ready RAG chunks from PDF URLs.

👁 User avatar

Inus Grobler

URL: https://apify.com/eliai/webpage-tables-extractor