VOOZH about

URL: https://apify.com/quotidian_plum/my-actor

⇱ virtualoplossing scrapper [DEPRECATED] Β· Apify


πŸ‘ virtualoplossing scrapper avatar

virtualoplossing scrapper

Deprecated

Pricing

Pay per usage

Go to Apify Store

virtualoplossing scrapper

Deprecated

virtualoplossing scrapper

Pricing

Pay per usage

Rating

0.0

(0)

Developer

πŸ‘ karanbir

karanbir

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

2 months ago

Last modified

Share

JavaScript PuppeteerCrawler Actor template

This template is a production ready boilerplate for developing with PuppeteerCrawler. The PuppeteerCrawler provides a simple framework for parallel crawling of web pages using headless Chrome with Puppeteer. Since PuppeteerCrawler uses headless Chrome to download web pages and extract data, it is useful for crawling of websites that require to execute JavaScript.

If you're looking for examples or want to learn more visit:

Included features

  • Puppeteer Crawler - simple framework for parallel crawling of web pages using headless Chrome with Puppeteer
  • Configurable Proxy - tool for working around IP blocking
  • Input schema - define and easily validate a schema for your Actor's input
  • Dataset - store structured data where each object stored has the same attributes
  • Apify SDK - toolkit for building Actors

How it works

  1. Actor.getInput() gets the input from INPUT.json where the start urls are defined

  2. Create a configuration for proxy servers to be used during the crawling with Actor.createProxyConfiguration() to work around IP blocking. Use Apify Proxy or your own Proxy URLs provided and rotated according to the configuration. You can read more about proxy configuration here.

  3. Create an instance of Crawlee's Puppeteer Crawler with new PuppeteerCrawler(). You can pass options to the crawler constructor as:

    • proxyConfiguration - provide the proxy configuration to the crawler
    • requestHandler - handle each request with custom router defined in the routes.js file.
  4. Handle requests with the custom router from routes.js file. Read more about custom routing for the Cheerio Crawler here

    • Create a new router instance with new createPuppeteerRouter()

    • Define default handler that will be called for all URLs that are not handled by other handlers by adding router.addDefaultHandler(() => { ... })

    • Define additional handlers - here you can add your own handling of the page

      router.addHandler('detail',async({ request, page, log })=>{
      const title =await page.title();
      // You can add your own page handling here
      await Dataset.pushData({
      url: request.loadedUrl,
      title,
      });
      });
  5. crawler.run(startUrls); start the crawler and wait for its finish

Resources

If you're looking for examples or want to learn more visit:

Getting started

For complete information see this article. In short, you will:

  1. Build the Actor
  2. Run the Actor

Pull the Actor for local development

If you would like to develop locally, you can pull the existing Actor from Apify console using Apify CLI:

  1. Install apify-cli

    Using Homebrew

    $brew install apify-cli

    Using NPM

    $npm-ginstall apify-cli
  2. Pull the Actor by its unique <ActorId>, which is one of the following:

    • unique name of the Actor to pull (e.g. "apify/hello-world")
    • or ID of the Actor to pull (e.g. "E2jjCZBezvAZnX8Rb")

    You can find both by clicking on the Actor title at the top of the page, which will open a modal containing both Actor unique name and Actor ID.

    This command will copy the Actor into the current directory on your local machine.

    $apify pull <ActorId>

Documentation reference

To learn more about Apify and Actors, take a look at the following resources:

You might also like

TheFork Restaurant Intelligence Scrapper

jdtpnjtp/thefork-restaurant-scraper-advanced

Comprehensive restaurant intelligence from TheFork platform. Get detailed venue data, pricing, reviews, ratings, and availability across 11 European countries. Enterprise-grade reliability with 99.9 % success rate.

88

4.1

(2)

Meta (Facebook) Ad Scrapper (Using Ad Library URL) (Premium)

scrapeio/meta-facebook-ad-scrapper-using-ad-library-url-premium

Scrape every Meta ad (Facebook, Instagram, Messenger, Audience Network) from the Ad Library by keyword, Facebook Page ID, or Ad Library URL. Sort by total impressions or most recent. Extract creatives, copy, CTAs, impressions, and spend ranges. 100+ languages. No Meta Graph API or developer app.

905

4.8

(7)

πŸš€ Instagram Followers and Following Scrapper

figue/instagram-followers-and-following-scrapper

Extract Instagram followers and following data from any Instagram profile! High-speed bot extracts ~500 followers/min, supports proxies, simulates human behavior, and provides configurable pagination. Collect detailed user data efficientlyβ€”perfect for market research and audience insights!

2.1K

2.4

(16)

Linkedin Jobs Scrapper πŸ”πŸ”₯

scrapeverse/linkedin-jobs-scrapper

Get detailed and insightful data just by entering few information, No need to worry about Cookies and Authentication.

651

1.5

(2)

LinkedIn Profile Scrapper

zerobreak/linkedin-profile-scrapper

Fast LinkedIn Profile Scraper that extracts detailed profile data including work experience, education, skills, connections, and more. Ideal for recruitment, sales prospecting, market research, and bulk LinkedIn profile analysis.

205

5.0

(1)

LinkedIn People Search Scrapper

zerobreak/linkedin-people-search-scrapper

A powerful LinkedIn People Search Scraper that finds accurate profiles by name and company. Ideal for recruitment, sales, and research, it delivers clean, structured data with a fast two-step search system and easy JSON input. Perfect for bulk LinkedIn people lookup and profile extraction.

102

5.0

(1)

Youtube Channel Scrapper

deeper/youtube-channel-scrapper

Scrape YouTube Channel pages from page URL. There is not much configuration as it uses initial data available on page load. You'll get video Id, title, descriptions, #likes, #comments, #views and similar information for channels. Feel free to try it out with our default settings by hitting β–· Start.

πŸ‘ User avatar

Deeper Scrapper

745

3.1

(3)

βœ… CHEAP GOOGLE NEWS SCRAPPER βœ…

shoya/cheap-google-news-scrapper

Extract news articles from Google News with unlimited keywords, custom location, language, and time period filters. Supports advanced search operators, topic-based scraping, and automatic deduplication. One of the most affordable Google News scrapers on Apify optimized for speed and cost efficiency.

Instagram Followers And Following Scrapper

scrapier/instagram-followers-and-following-scrapper

πŸ” Instagram Followers & Following Scrapper (Scraper) pulls public profiles, usernames, bios & counts. πŸ“Š Export CSV/JSON for audience insights, competitor research & growth. ⚑ Fast, reliable, rate-limit aware. πŸš€ Perfect for marketers, analysts & researchers.

Reddit Post Scrapper

dead00/reddit-post-scrapper

A Reddit post scraper is a tool or script that automatically collects data from Reddit postsβ€”such as titles, content, comments.