VOOZH about

URL: https://apify.com/logiover/website-link-graph-crawler

⇱ Website Link Graph Crawler β€” Export Links to CSV/JSON Β· Apify


πŸ‘ Website Link Graph & Outbound Links Crawler avatar

Website Link Graph & Outbound Links Crawler

Pricing

from $3.50 / 1,000 results

Go to Apify Store

Website Link Graph & Outbound Links Crawler

Extract all links from a website to CSV/JSON. Maps internal & outbound link graph with anchor text + nofollow/rel flags. No API, no login.

Pricing

from $3.50 / 1,000 results

Rating

0.0

(0)

Developer

πŸ‘ Logiover

Logiover

Maintained by Community

Actor stats

0

Bookmarked

5

Total users

4

Monthly active users

4 days ago

Last modified

Share

Website Link Graph & Outbound Links Crawler β€” Link Extractor πŸ•ΈοΈ

Extract every link from a website and map its link graph. This link extractor crawls an entire site and exports every internal and outbound link as a graph edge β€” the source page, the target URL, the anchor text, the link type (internal / subdomain / external) and the rel flags (nofollow, sponsored, ugc). Map a whole site's internal linking and outbound links from a single start URL. No login, no headless browser.

Need to extract all links from a website, run an internal linking SEO audit, analyze outbound / external links, or check nofollow attributes at scale? This actor builds the complete link graph.


✨ Key features

  • πŸ•·οΈ Full-site crawl β€” start from one URL and follow internal links across the whole domain.
  • πŸ”— Every link as an edge β€” source β†’ target with anchor text, for internal and external links.
  • 🏷️ Rel & attribute parsing β€” nofollow, sponsored, ugc flags and target="_blank".
  • 🎯 Scope control β€” export all links, only internal, or only outbound/external links.
  • ⚑ Faster & higher-volume than a link checker β€” it maps the graph without HTTP status checks, so it's ideal for big sites.

πŸ’‘ Use cases

  • Internal-linking SEO audits β€” see which pages link where and with what anchor text to optimize link equity.
  • Outbound / external link analysis β€” list every external site you link to and which links are nofollow.
  • Site-structure mapping β€” build a link graph for visualization, analysis or crawl-budget optimization.
  • Link cleanup & compliance β€” find sponsored/UGC links that should carry the right rel attributes.
  • Anchor-text analysis β€” audit the anchor text distribution across an entire site.

πŸ“¦ What you get

One row per link (edge):

FieldDescription
sourceUrlThe page the link is on
targetUrlThe link target (absolute URL)
targetDomainHostname of the target
anchorTextThe clickable text
linkTypeinternal, subdomain or external
relRaw rel attribute
isNofollow / isUgc / isSponsoredParsed rel flags
opensNewTabtrue if target="_blank"
crawledAtISO 8601 timestamp

Example output

{
"sourceUrl":"https://example.com/blog/seo-guide",
"targetUrl":"https://partner.com/tool",
"targetDomain":"partner.com",
"anchorText":"our favorite SEO tool",
"linkType":"external",
"rel":"nofollow sponsored",
"isNofollow":true,
"isSponsored":true,
"isUgc":false,
"opensNewTab":true,
"crawledAt":"2026-05-25T14:17:17.531Z"
}

πŸš€ How to use it

  1. Click Try for free / Start.
  2. Paste one or more website URLs into Start URLs.
  3. (Optional) Set Max pages to crawl β€” 0 for the whole site.
  4. (Optional) Choose Which links to export: all / internal / external.
  5. Click Save & Start.
  6. Export the link graph as JSON, CSV, Excel or via API.

βš™οΈ Input

OptionDescriptionDefault
startUrlsWebsites to crawl– (required)
maxPagesToCrawlMax pages per run (0 = whole site)1000
linkScopeall, internal or externalall
maxConcurrencyParallel requests10

Example input

{
"startUrls":[{"url":"https://example.com"}],
"maxPagesToCrawl":5000,
"linkScope":"all"
}

πŸ” How it works

The crawler follows internal links within the same domain as your Start URLs, and on each page records every <a href> as an edge with its anchor text and rel attributes. URLs are resolved to absolute and de-duplicated per page. Use linkScope to keep only internal or only outbound links. Pure HTTP β€” fast and cheap.

🧰 Tips & best practices

  • Use linkScope: "external" for a pure outbound-link/backlink-target report.
  • Use linkScope: "internal" to analyze internal link distribution and orphan-page risk.
  • Pivot on targetDomain to see which external domains you link to most.
  • Set maxPagesToCrawl to 0 for complete coverage of large sites.

❓ FAQ

What's the difference vs the Broken Link Checker? This actor maps the link graph (anchors, rel, internal/external) without checking HTTP status, so it's faster and produces higher volume. Use the Broken Link Checker when you specifically need to find dead links and status codes.

Does it detect nofollow / sponsored / ugc? Yes β€” the rel attribute is parsed into boolean flags.

Does it render JavaScript? No β€” it parses server-rendered HTML for speed and low cost.

How do I crawl the whole site? Set maxPagesToCrawl to 0.

What formats can I export? JSON, CSV, Excel, HTML and a full REST API.

How do I extract all links from a website to CSV or JSON?

Paste your start URL, run the crawl, then export the link graph as CSV, JSON, Excel or via API β€” one row per link with source page, target URL, anchor text and rel flags.

Can I scrape website links without an API or browser?

Yes. There is no third-party API or login required, and it uses pure HTTP (no headless browser), so it extracts internal and outbound links fast and cheap.

Is this an internal linking audit and outbound link analysis tool?

Yes. Use linkScope: "internal" for an internal-linking SEO audit, or linkScope: "external" for an outbound/external link analysis report grouped by targetDomain.

Does it extract anchor text and nofollow links?

Yes. Every edge records its anchor text and parses the rel attribute into nofollow, sponsored and ugc boolean flags.

πŸ”— Related actors by the same author

  • Broken Link Checker β€” crawl a site and find dead links with HTTP status codes.
  • Website SEO Audit Crawler β€” full on-page SEO audit for every page.
  • Sitemap to URL Crawler β€” extract all URLs from any sitemap.xml.
  • Website to Markdown & Text Crawler β€” clean text + Markdown for AI / RAG.

πŸ“ Changelog

2026-06-15

  • Reliability pass: re-verified end-to-end on live data with real-world inputs. Routine maintenance build.

2026-06-07

  • Docs: added coverage for exporting website links to CSV/JSON, scraping links without an API or browser, and internal-linking / outbound link audits.

2026-06-05

  • πŸ›‘οΈ Reliability fix: results are no longer dropped by strict output validation β€” runs now complete cleanly even at high volume (thousands of results).
  • ⚑ Stability & performance hardening; fresh rebuild.

2026-06-04

  • Verified live & refreshed build β€” reliability/maintenance pass.

You might also like

Link Extractor

automation-lab/link-extractor

This actor extracts all hyperlinks from web pages. For each link, it captures the anchor text, href, rel attributes (nofollow, ugc, sponsored), target attribute, and classifies links as internal or external. It also detects the link's location in the page (nav, header, footer, main content,...

πŸ‘ User avatar

Stas Persiianenko

67

Nofollow Link Audit

zerobreak/nofollow-link-audit

Nofollow link audit tool that scans any webpage and reports every link's rel attribute, whether dofollow, nofollow, sponsored, or UGC, so SEO teams can check link types without reading raw page source.

Website Link Extractor β€” List All URLs from Any Page

maged120/get-urls

Extract all links from any web page. Returns every URL found with anchor text and link type β€” useful for quick link audits, competitor research, or sitemap building.

Broken Link Checker

taroyamada/broken-link-checker

Crawl supplied websites to find dead internal and outbound links with status codes, anchor context, redirect hints, and source pages.

Website URL Crawler & Link Extractor

maximedupre/website-url-crawler

Crawl JavaScript-rendered websites and export a URL link map. Get source pages, depth, anchor text, link type, HTTP metadata, and crawl status.

πŸ‘ User avatar

Maxime DuprΓ©

4

Website Links Graph Generator

crawlerbros/web-link-graph-visualizer

Creates an oriented graph visualizing links between webpages. Outputs: graph.png (visual network diagram) and graph.json (structured data) saved to Key-Value Store, plus detailed dataset of all crawled pages. Configure depth, boundaries, and layout.

45

5.0

Link Relationship Analyzer

zerobreak/link-relationship-analyzer

Link relationship analyzer that crawls any website and maps all internal and external links, so SEO teams can audit anchor text, find orphaned pages, and see how a site's pages connect.

Website URL Crawler & Link Extractor

maged120/get-urls-pro

Crawl any website and extract all URLs with full hierarchy β€” depth, parent URL, and anchor text. Supports static and JavaScript-rendered sites. Configurable depth and domain filtering.

Website Content Crawler

rupom888/website-content-crawler

Backlinks Checker

s-r/backlinks-checker

Get the full backlink profile for any domain β€” domain score, referring-domain count, total-link count, and per-link records (origin URL, target URL, anchor text, follow/nofollow, dates).