VOOZH about

URL: https://apify.com/ayeeyee/website-content-crawler

⇱ Website Content Crawler Β· Apify


Pricing

from $0.50 / 1,000 results

Go to Apify Store

Website Content Crawler

Full website crawling

Pricing

from $0.50 / 1,000 results

Rating

0.0

(0)

Developer

πŸ‘ Virtual Footprint LLC

Virtual Footprint LLC

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

0

Monthly active users

3 days ago

Last modified

Share

Extract web content and document-derived structured data with Website Content Crawler for knowledge workflows.

Features

  • Collect structured content and metadata fields
  • Support direct URL and query-driven extraction modes
  • Return normalized records suitable for RAG and analytics
  • Batch-friendly processing for multiple sources
  • Designed for automated content pipelines
  • Output is optimized for Website Content Crawler buyer workflows on Apify

Common Use Cases

  • Content intelligence
  • Article monitoring
  • Knowledge base ingestion
  • Research workflows
  • Data enrichment
  • Internal reporting

Example Input

{
"query":"market research",
"queries":[
"market research"
],
"urls":[
"https://www.example.org"
],
"maxResults":25,
"includeRaw":false,
"maxCostPerRun":5
}

Example Output

{
"query":"market research",
"url":"https://www.example.org",
"actorSlug":"record value",
"source":"record value",
"title":"record value",
"description":"record value",
"scrapedAt":"record value"
}

Input Parameters

FieldTypeRequiredDescription
querystringNoPrimary keyword, URL, profile, company, product, or identifier to collect
queriesarrayNoOptional batch list of query strings. Used when query is empty or when batching is…
urlsarrayNoOptional direct URLs to process. These take priority over discovery when provided
maxResultsintegerNoMaximum number of dataset items to emit
includeRawbooleanNoInclude collection diagnostics and raw source metadata where available
maxCostPerRunnumberNoOptional guardrail in USD. The actor caps output before exceeding this amount
proxyConfigurationobjectNoApify proxy settings for production runs

Output Fields

FieldTypeDescription
querystringNormalized query value
urlstringNormalized url value
actorSlugstringNormalized actorSlug value
sourcestringNormalized source value
titlestringNormalized title value
descriptionstringNormalized description value
scrapedAtstringNormalized scrapedAt value
runIdstringNormalized runId value
rankstringNormalized rank value
contentstringNormalized content value
summarystringNormalized summary value
authorstringNormalized author value

Export Formats

  • JSON
  • CSV
  • Excel
  • XML
  • RSS

Pricing

Pricing Model: PAY_PER_EVENT

$3.00 per 1,000 dataset items.

FAQ

Does this actor support batch processing?

Yes.

Can I export results to CSV?

Yes.

Can I schedule runs?

Yes, through Apify schedules.

Can I run this actor via API?

Yes, via the Apify API.

Does it support direct URLs?

Yes.

Can I integrate this actor with n8n or Make?

Yes.

You might also like

Website Content Crawler

rupom888/website-content-crawler

AI Website Content Crawler

ilborso/ai-website-content-crawler

A super fast website crawler for Agentic AI integration

πŸ‘ User avatar

Fabio Borsotti

6

5.0

Website Content Crawler API - Markdown for RAG

tugelbay/website-content-crawler

Crawl public websites and extract clean Markdown, text, or HTML for RAG pipelines, AI agents, documentation indexing, and content monitoring. Guide: https://konabayev.com/tools/website-content-crawler/?utm_source=apify_info&utm_medium=referral&utm_campaign=website-content-crawler

πŸ‘ User avatar

Tugelbay Konabayev

26

Website Content Crawler Fast

timelody/website-content-crawler-fast

Scraping data from every single web page.

Website Contacts Crawler

quaking_pail/contact-crawler

Scrap website searching for contact details, emails and phone numbers

Related articles

What is a vector database?
Read more