VOOZH about

URL: https://apify.com/caring_dizi/blog-content-scraper-fixed

⇱ πŸ§ͺHigh-Volume Website Content & Media Scraper Β· Apify


πŸ‘ πŸ§ͺHigh-Volume Website Content & Media Scraper avatar

πŸ§ͺHigh-Volume Website Content & Media Scraper

Pricing

$4.50 / 1,000 results

Go to Apify Store

πŸ§ͺHigh-Volume Website Content & Media Scraper

πŸ§ͺCrawling Done Right! Let me now what you think, what or where or how i can improve my actor, and i am all for constructive criticism. So please message if you have any questions. Enjoy and have a good day.

Pricing

$4.50 / 1,000 results

Rating

5.0

(2)

Developer

πŸ‘ Jeff Halverson

Jeff Halverson

Maintained by Community

Actor stats

6

Bookmarked

148

Total users

6

Monthly active users

18 days ago

Last modified

Share

ALL Social Media/WebScraper

Extract structured content from public social profile pages, article pages, landing pages, and other JavaScript-heavy websites. This actor focuses on turning a page into a clean record of text blocks, metadata, images, video references, and outgoing links.

What it does

  • Opens each public URL in a browser session
  • Extracts the page title and basic metadata
  • Captures article-like text blocks from the page
  • Collects image URLs, embedded video URLs, direct video source URLs, and outbound links
  • Optionally filters Facebook links out of the outbound link list
  • Stores diagnostic screenshots for failed pages

Good fit

  • Public Instagram profile pages
  • Blog articles and news pages
  • Marketing sites and landing pages
  • Content research and competitor monitoring
  • Collecting media/link inventories from public pages

Not a good fit

  • Logged-in or private content
  • Full API-style social scraping for each platform
  • Comments, followers, or hidden profile data
  • Sites that require persistent authenticated sessions

Input example

{
"startUrls":[
{"url":"https://instagram.com/muddlemix_"},
{"url":"https://example.com/blog/example-article"}
],
"includeFacebookLinks":true,
"headless":true,
"maxConcurrency":3,
"requestHandlerTimeoutSecs":90,
"navigationTimeoutSecs":90,
"waitAfterLoadSecs":0.5,
"saveErrorScreenshots":true
}

Output fields

Each dataset item can include:

  • url
  • title
  • meta
  • articles
  • images
  • videos
  • links
  • scraped
  • scrapeTime
  • processingTimeMs
  • contentType
  • error
  • diag
  • status

Notes

  • The default dataset is the main output.
  • Failed pages are still pushed into the dataset with status, error, and optional diagnostic screenshot URL so runs stay debuggable.
  • This actor is best positioned as a public-page media and content extractor, not a full per-platform private-data scraper.

You might also like

Grant & Foundation Opportunities Scraper

scrapepilot/grant-foundation-opportunities-scraper

Scrape grant and funding opportunities from grants.gov, fundsforngos.org, and any grant portal. Extracts 6 fields: grant_id, funder, amount, eligibility, deadline, and link. Exports to JSON, CSV, and Excel. Enable Demo Mode to preview 10 sample records instantly β€” no scraping needed.

Company Employees Scraper

build_matrix/company-employees-scraper

Fetch all employees from a company.

805

4.3

Company Detail Scraper for LinkedIn (No Cookies)

apimaestro/linkedin-company-detail

Extract detailed LinkedIn company data instantly. Get company overview, employee count, locations, funding info, and more. Perfect for market research, lead generation, and competitor analysis. Clean, structured data ready for your business needs.

4.4K

3.2

Company Employees Scraper for LinkedIn | No Cookies

apimaestro/linkedin-company-employees-scraper-no-cookies

Extract LinkedIn company employees without sharing your cookies or account. Get structured data including profile details, job titles, and current positions. No login required.

4.8K

3.8

Find Linkedin Company Page Urls

sbzh/domain-names-or-website-urls-to-linkedin-company-page-urls

Use this tool to retrieve the LinkedIn URLs from websites. Simply enter a list of domain names or website URLs and, when available, retrieve the LinkedIn URL of the company page in the format https://www.linkedin.com/company/...

SAM.gov Scraper - Contracts, Exclusions & Grants

jungle_synthesizer/samgov-scraper

Scrape SAM.gov for federal contract opportunities, exclusion records (debarment list), wage determinations, and assistance listings. No API key required. Filter by keyword, NAICS code, opportunity type, set-aside category, agency, and state.

πŸ‘ User avatar

BowTiedRaccoon

142

1.0

LinkedIn Company Data & Insights Scraper [NO COOKIE]

riceman/linkedin-company-data-insights-scraper

Scrape comprehensive LinkedIn company data & business insights without your LinkedIn account. Extract basic company data, headcount growth, job openings, hiring patterns & employee analytics. Automatic retries, pay-per-use.

Linkedin Company Employees

simpleapi/linkedin-company-employees

LinkedIn Company Employees Scraper extracts employee lists from LinkedIn company pages, including names, roles, locations, experience, and profile URLs. Ideal for recruiting, lead generation, market research, and automating structured employee data collection at scale

Related articles

Your Apify Actor's input schema is its UI. Here's how I design mine after 20+ Actors.
Read more
How I priced an inherited household using eBay data
Read more
The definitive guide to text scraping
Read more