VOOZH about

URL: https://apify.com/bleffoo/economics-calendar-scraper

⇱ Economics Calendar Scraper Β· Apify


Pricing

from $1.00 / 1,000 results

Go to Apify Store

Economics Calendar Scraper

This scraper uses Crawlee with Playwright to extract upcoming weekday economic events.

Pricing

from $1.00 / 1,000 results

Rating

0.0

(0)

Developer

πŸ‘ bleffoo

bleffoo

Maintained by Community

Actor stats

1

Bookmarked

10

Total users

0

Monthly active users

8 months ago

Last modified

Categories

Share

JavaScript PuppeteerCrawler Actor template

This template is a production ready boilerplate for developing with PuppeteerCrawler. The PuppeteerCrawler provides a simple framework for parallel crawling of web pages using headless Chrome with Puppeteer. Since PuppeteerCrawler uses headless Chrome to download web pages and extract data, it is useful for crawling of websites that require to execute JavaScript.

If you're looking for examples or want to learn more visit:

Included features

  • Puppeteer Crawler - simple framework for parallel crawling of web pages using headless Chrome with Puppeteer
  • Configurable Proxy - tool for working around IP blocking
  • Input schema - define and easily validate a schema for your Actor's input
  • Dataset - store structured data where each object stored has the same attributes
  • Apify SDK - toolkit for building Actors

How it works

  1. Actor.getInput() gets the input from INPUT.json where the start urls are defined

  2. Create a configuration for proxy servers to be used during the crawling with Actor.createProxyConfiguration() to work around IP blocking. Use Apify Proxy or your own Proxy URLs provided and rotated according to the configuration. You can read more about proxy configuration here.

  3. Create an instance of Crawlee's Puppeteer Crawler with new PuppeteerCrawler(). You can pass options to the crawler constructor as:

    • proxyConfiguration - provide the proxy configuration to the crawler
    • requestHandler - handle each request with custom router defined in the routes.js file.
  4. Handle requests with the custom router from routes.js file. Read more about custom routing for the Cheerio Crawler here

    • Create a new router instance with new createPuppeteerRouter()

    • Define default handler that will be called for all URLs that are not handled by other handlers by adding router.addDefaultHandler(() => { ... })

    • Define additional handlers - here you can add your own handling of the page

      router.addHandler('detail',async({ request, page, log })=>{
      const title =await page.title();
      // You can add your own page handling here
      await Dataset.pushData({
      url: request.loadedUrl,
      title,
      });
      });
  5. crawler.run(startUrls); start the crawler and wait for its finish

Resources

If you're looking for examples or want to learn more visit:

Getting started

For complete information see this article. In short, you will:

  1. Build the Actor
  2. Run the Actor

Pull the Actor for local development

If you would like to develop locally, you can pull the existing Actor from Apify console using Apify CLI:

  1. Install apify-cli

    Using Homebrew

    $brew install apify-cli

    Using NPM

    $npm-ginstall apify-cli
  2. Pull the Actor by its unique <ActorId>, which is one of the following:

    • unique name of the Actor to pull (e.g. "apify/hello-world")
    • or ID of the Actor to pull (e.g. "E2jjCZBezvAZnX8Rb")

    You can find both by clicking on the Actor title at the top of the page, which will open a modal containing both Actor unique name and Actor ID.

    This command will copy the Actor into the current directory on your local machine.

    $apify pull <ActorId>

Documentation reference

To learn more about Apify and Actors, take a look at the following resources:

You might also like

Weekday Product Scraper πŸ›οΈ

easyapi/weekday-product-scraper

A powerful scraper for extracting product data from Weekday's online store. Fetch detailed product information including prices, variants, images, and availability across different categories with advanced pagination handling and proxy support.

Economic Calendar Data (Investing.com)

pintostudio/economic-calendar-data-investing-com

This Apify Actor is designed to extract economic calendar data from Investing.com based on specified filters such as time zone, countries, importance, categories, and date ranges. It's a powerful tool for financial analysts, traders, and anyone needing to stay updated on global economic events.

302

5.0

The Economic Calendar

medh/Economic-Calendar

The economic calendar to explore key global events on the horizon that could subtly shift or substantially shake up the financial markets.

TikTok Scraper

akash9078/tiktok-scraper

Scrape video metadata from TikTok user profile pages using Crawlee and Playwright.

πŸ‘ User avatar

Akash Kumar Naik

5

Economic Calendar Data Scraper

solidcode/economic-calendar-data-scraper

[πŸ’° $8.0 / 1K] Extract economic calendar events from Investing.com β€” event name, country, date, time, importance, and actual/forecast/previous values. Filter by countries, importance, categories, and date range.

FRED Economic Data

dash_authority/fred-economic-data

Search and retrieve economic data series from the Federal Reserve Economic Data (FRED) API. Fetch time series observations for thousands of economic indicators. Categories: economics, financial data, FRED, economic indicators.

πŸ‘ User avatar

Dash Authority

1

My Actor Playwright+Crawlee Template 4 NOVA OK

mpelas/my-actor-playwright-crawlee-template-4-nova-ok

My Actor Playwright+Crawlee Template 4 NOVA

πŸ‘ User avatar

Michalis Paignigiannis

8

Crawlee Scraper

ellustar/my-actor-62

Crawlee Scraper** is a lightweight JavaScript actor for fast and reliable web scraping using Crawlee and Cheerio. It efficiently crawls pages, extracts structured data, and supports scalable, customizable scraping workflows.

Related articles

Crawlee for Python tutorial (ultimate beginner’s guide)
Read more