Link Extractor

Pricing

Pay per event

Link Extractor

This actor extracts all hyperlinks from web pages. For each link, it captures the anchor text, href, rel attributes (nofollow, ugc, sponsored), target attribute, and classifies links as internal or external. It also detects the link's location in the page (nav, header, footer, main content,...

Pricing

Pay per event

Rating

0.0

(0)

Developer

👁 Stas Persiianenko

Stas Persiianenko

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

3 months ago

Last modified

What does Link Extractor do?

This actor extracts all hyperlinks from web pages. For each link, it captures the anchor text, href, rel attributes (nofollow, ugc, sponsored), target attribute, and classifies links as internal or external. It also detects the link's location in the page (nav, header, footer, main content, sidebar). Process hundreds of pages in a single run to build a complete link profile for any website.

Who is it for?

🔍 SEO specialists — extracting all outbound and internal links from web pages for auditing
💻 Web developers — mapping site link structures for migration planning
📊 Market researchers — discovering linked resources and partner networks from competitor sites
🛡️ Security analysts — identifying external link destinations for phishing or malware detection
📝 Content strategists — analyzing link patterns and resource references across web pages

Use cases

SEO specialists -- audit internal and external linking patterns to optimize site architecture and link equity flow
Link builders -- identify nofollow and sponsored link usage on target websites for outreach planning
Content strategists -- understand link distribution across page sections to improve content structure
Migration teams -- extract all links before and after URL changes to verify nothing is broken
Competitive analysts -- discover who competitors link to and how they structure their outbound links

Why use Link Extractor?

Batch processing -- extract links from hundreds of pages in a single run
Detailed link attributes -- captures rel, target, anchor text, and link type for each link
Internal/external classification -- automatically sorts links by domain relationship
Page location detection -- identifies whether links sit in nav, header, footer, sidebar, or main content
Structured JSON output -- machine-readable results ready for analysis or import into SEO tools
Pay-per-event pricing -- cost-effective at scale, starting at fractions of a cent per URL
Fast and lightweight -- HTTP-only requests with no browser overhead, so runs complete quickly

Input parameters

Parameter	Type	Required	Default	Description
`urls`	string[]	Yes	--	List of web page URLs to extract all links from

Example input

{
"urls":[
"https://www.google.com",
"https://en.wikipedia.org/wiki/Web_scraping",
"https://example.com"
]
}

Output example

{
"url":"https://example.com",
"title":"Example Domain",
"totalLinks":1,
"internalLinks":0,
"externalLinks":1,
"nofollowLinks":0,
"uniqueInternalDomains":0,
"uniqueExternalDomains":1,
"links":[
{
"sourceUrl":"https://example.com",
"href":"https://www.iana.org/domains/example",
"anchorText":"More information...",
"isInternal":false,
"isExternal":true,
"isNofollow":false,
"isUgc":false,
"isSponsored":false,
"rel":null,
"target":null,
"linkType":"page",
"location":"body"
}
],
"error":null,
"extractedAt":"2026-03-01T12:00:00.000Z"
}

How to extract links from web pages

Go to Link Extractor on Apify Store.
Enter the URLs you want to extract links from in the urls field.
Click Start and wait for the run to finish.
Download your results as JSON, CSV, or Excel from the Dataset tab.

How much does it cost to extract links?

Event	Price	Description
Start	$0.035	One-time per run
URL extracted	$0.001	Per page processed

Example costs:

10 URLs: $0.035 + 10 x $0.001 = $0.045
100 URLs: $0.035 + 100 x $0.001 = $0.135
1,000 URLs: $0.035 + 1,000 x $0.001 = $1.035

Using the Apify API

Node.js

import{ ApifyClient }from'apify-client';
const client =newApifyClient({token:'YOUR_TOKEN'});
const run =await client.actor('automation-lab/link-extractor').call({
urls:['https://example.com'],
});
const{ items }=await client.dataset(run.defaultDatasetId).listItems();
console.log(items);

Python

from apify_client import ApifyClient
client = ApifyClient('YOUR_TOKEN')
run = client.actor('automation-lab/link-extractor').call(run_input={
'urls':['https://example.com'],
})
items = client.dataset(run['defaultDatasetId']).list_items().items
print(f'Found {items[0]["totalLinks"]} links')

cURL

curl"https://api.apify.com/v2/acts/automation-lab~link-extractor/runs"\
-X POST \
-H"Content-Type: application/json"\
-H"Authorization: Bearer YOUR_TOKEN"\
-d'{"urls": ["https://example.com"]}'

Use with AI agents via MCP

Link Extractor is available as a tool for AI assistants via the Model Context Protocol (MCP).

Setup for Claude Code

$claude mcp add--transport http apify "https://mcp.apify.com?tools=automation-lab/link-extractor"

Setup for Claude Desktop, Cursor, or VS Code

{
"mcpServers":{
"apify":{
"url":"https://mcp.apify.com?tools=automation-lab/link-extractor"
}
}
}

Example prompts

"Extract all links from this webpage"
"Get all outbound links from our homepage"

Learn more in the Apify MCP documentation.

Integrations

Link Extractor integrates with your existing workflow tools through the Apify platform. Connect it to Make (formerly Integromat), Zapier, or n8n to automate link extraction and feed results into SEO tools or spreadsheets. Export link data to Google Sheets for collaborative analysis, send alerts to Slack when new external links appear, or use webhooks to trigger downstream processing whenever a run completes.

Common integration patterns include:

SEO dashboard -- schedule weekly runs and push results to Google Sheets to track link profile changes over time
Link monitoring -- use webhooks to compare new results against previous runs and alert on unexpected external link additions

Tips and best practices

Combine with a sitemap -- feed your sitemap URLs into the actor to get a complete link profile for your entire site.
Filter by rel attribute -- use the isNofollow, isUgc, and isSponsored fields to segment links by their SEO significance.
Check link location -- links in nav and footer carry different SEO weight than links in the main content body.
Run regularly to track link changes over time, especially after content updates or site redesigns.
Export to CSV -- download the dataset as CSV from the Apify Console for easy import into Excel or Google Sheets.

Legality

This tool analyzes publicly accessible web content. Automated analysis of public web resources is standard practice in SEO and web development. Always respect robots.txt directives and rate limits when analyzing third-party websites. For personal data processing, ensure compliance with applicable privacy regulations.

FAQ

What types of links does it extract? It extracts all <a> tag hyperlinks including page links, anchor links, mailto links, and tel links. It captures the href, anchor text, rel attributes, and target attribute for each.

Does it follow links to other pages? No. The actor extracts links from the pages you provide, but does not crawl or follow those links to additional pages. Each URL in the input is processed independently.

Can it detect broken links? No. Link Extractor captures link URLs and attributes but does not check whether the destination URLs return valid responses. Pair it with a dedicated link checker for that purpose.

What does the location field mean? The location field indicates where on the page the link was found -- for example, nav, header, footer, sidebar, or body. This helps you understand the context and SEO weight of each link.

Some links are missing from the results. Why? The actor extracts links from the initial HTML response without running JavaScript. Links injected dynamically by JavaScript frameworks (React, Vue, Angular) after page load will not be captured. Most server-rendered pages and traditional CMS platforms include all links in the HTML source. For JavaScript-heavy single-page applications, consider using a browser-based scraper.

How many URLs can I process in one run? There is no hard limit. The actor processes URLs concurrently, so runs with hundreds or even thousands of URLs complete efficiently.

Other SEO tools

Broken Link Checker -- find broken links and 404 errors across your website
Internal Link Analyzer -- analyze internal link structure and anchor text distribution
Mixed Content Checker -- detect HTTP resources on HTTPS pages
Mobile Friendly Checker -- check mobile-friendliness with viewport and responsive design analysis
SEO Title Checker -- audit page titles for SEO best practices
Website Health Report -- comprehensive website health and performance audit

👁 FindLaw Scraper avatar

FindLaw Scraper

jungle_synthesizer/findlaw-scraper

Scrape attorney and law firm data from FindLaw Lawyer Directory to generate high-quality, targeted legal industry leads

👁 User avatar

BowTiedRaccoon

180

👁 Website URL Extractor - Get All Site URLs avatar

Website URL Extractor - Get All Site URLs

lofomachines/urls-extractor

Extract every URL from any website automatically — no code needed. This URL extractor crawls pages and parses XML sitemaps into one structured list with metadata (lastmod, priority, changefreq). Filter by keyword, cap results, and export to JSON, CSV, or Excel. Built for SEO audits & migrations.

👁 User avatar

Lofomachines

149

5.0

👁 Sitemap Extractor avatar

Sitemap Extractor

apify/sitemap-extractor

This Apify Actor extracts all URLs from a website's sitemaps and checks their status codes via lightweight HTTP requests. It provides a clean list of valid links, acting as an ideal pre-processor to ensure your larger crawling projects target only active URLs.

👁 User avatar

Apify

166

1.7

Actor Builder

handleco-app/actor-builder

👁 User avatar

handleco-app

👁 Nofollow Link Audit avatar

Nofollow Link Audit

zerobreak/nofollow-link-audit

Nofollow link audit tool that scans any webpage and reports every link's rel attribute, whether dofollow, nofollow, sponsored, or UGC, so SEO teams can check link types without reading raw page source.

👁 User avatar

ZeroBreak

👁 Website Tech Profiler avatar

Website Tech Profiler

lofomachines/website-tech-profiler

Advanced technology stack scraper and Wappalyzer alternative. Detect frontend frameworks (React, Vue, Angular), backend technologies, CDN, hosting providers, analytics, advertising scripts, API endpoints, and more. Complete techstack analysis for competitive research and lead generation.

👁 User avatar

Lofomachines

5.0

👁 Website Screenshot Generator avatar

Website Screenshot Generator

crawlerbros/screenshot-url

Capture full-page screenshots of any website as PNG images or PDF documents. Supports custom viewport, scroll-to-bottom, element hiding, and configurable wait conditions.

👁 User avatar

Crawler Bros

151

👁 /llms.txt Generator avatar

/llms.txt Generator

jakub.kopecky/llmstxt-generator

The /llms.txt Generator 🕸️📄 extracts website content to create an llms.txt file for AI apps 🤖✨ like LLM fine-tuning and indexing. Output is available 📥 in the Key-Value Store for easy download and integration into workflows. 🚀

👁 User avatar

Jakub Kopecký

789

3.1

👁 Twitter followers scraper avatar

Twitter followers scraper

curious_coder/twitter-scraper

Scrape followers, following and subscriptions list of any twitter profile

👁 User avatar

Curious Coder

2.7K

1.2

👁 Website Link Graph & Outbound Links Crawler avatar

Website Link Graph & Outbound Links Crawler

logiover/website-link-graph-crawler

Extract all links from a website to CSV/JSON. Maps internal & outbound link graph with anchor text + nofollow/rel flags. No API, no login.

👁 User avatar

Logiover

URL: https://apify.com/automation-lab/link-extractor

⇱ Link Extractor — Extract All Links from Any Web Page · Apify

Link Extractor

What does Link Extractor do?

Who is it for?

Use cases

Why use Link Extractor?

Input parameters

Example input

Output example

How to extract links from web pages

How much does it cost to extract links?

Using the Apify API

Node.js

Python

cURL

Use with AI agents via MCP

Setup for Claude Code

Setup for Claude Desktop, Cursor, or VS Code

Example prompts

Integrations

Tips and best practices

Legality

FAQ

Other SEO tools

You might also like

FindLaw Scraper

Website URL Extractor - Get All Site URLs

Sitemap Extractor

Actor Builder

Nofollow Link Audit

Website Tech Profiler

Website Screenshot Generator

/llms.txt Generator

Twitter followers scraper

Website Link Graph & Outbound Links Crawler