VOOZH about

URL: https://apify.com/lukaskrivka/open-source-actors-scraper

⇱ Open Source Actors Scraper Β· Apify


Pricing

Pay per usage

Go to Apify Store

Open Source Actors Scraper

Get all open-source Actors from Apify Store.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

πŸ‘ LukΓ‘Ε‘ KΕ™ivka

LukΓ‘Ε‘ KΕ™ivka

Maintained by Community

Actor stats

1

Bookmarked

24

Total users

0

Monthly active users

2 months ago

Last modified

Share

TypeScript Crawlee & CheerioCrawler template

A template example built with Crawlee to scrape data from a website using Cheerio wrapped into CheerioCrawler.

Included features

  • Apify SDK - toolkit for building Actors
  • Crawlee - web scraping and browser automation library
  • Input schema - define and easily validate a schema for your Actor's input
  • Dataset - store structured data where each object stored has the same attributes
  • Cheerio - a fast, flexible & elegant library for parsing and manipulating HTML and XML

How it works

This code is a TypeScript script that uses Crawlee CheerioCralwer framework to crawl a website and extract the data from the crawled URLs with Cheerio. It then stores the website titles in a dataset.

  • The crawler starts with URLs provided from the input startUrls field defined by the input schema. Number of scraped pages is limited by maxPagesPerCrawl field from input schema.
  • The crawler uses requestHandler for each URL to extract the data from the page with the Cheerio library and to save the title and URL of each page to the dataset. It also logs out each result that is being saved.

Resources

Getting started

For complete information see this article. To run the actor use the following command:

$apify run

Deploy to Apify

Connect Git repository to Apify

If you've created a Git repository for the project, you can easily connect to Apify:

  1. Go to Actor creation page
  2. Click on Link Git Repository button

Push project on your local machine to Apify

You can also deploy the project on your local machine to Apify without the need for the Git repository.

  1. Log in to Apify. You will need to provide your Apify API Token to complete this action.

    $apify login
  2. Deploy your Actor. This command will deploy and build the Actor on the Apify Platform. You can find your newly created Actor under Actors -> My Actors.

    $apify push

Documentation reference

To learn more about Apify and Actors, take a look at the following resources:

You might also like

Actors Monitoring

hamza.alwan/actors-monitoring

πŸŽ‰ Apify Actors

prog-party/apify-actors

This Apify Actors Actor retrieves data from Apify, allowing to filter, and returns a list of actors as a Dataset.

My Actors Issues Notifier πŸ””

azzouzana/my-actors-issues-notifier

Up your Apify game πŸ”₯ Get notified when users open/re-open issues on your Apify actors. This actor monitors your actors for new open/re-opened issues and automatically writes them to a JSON dataset, ready for external integrations (Telegram, Slack, Jira, n8n or custom webhooks..) - 100% Open Source

Apify Store Scraper

gio21/apify-store-scraper

Scrape and analyze 20,000+ public actors from the Apify Store.

Apify Store Actors Scraper 2025

jupri/apify-store

πŸ’« Apify.com Store Extension

Public Actors Lister

jancurn/public-actors-fetcher

Downloads a list of all Actors published in Apify Store, with all properties such as URL, title, description, etc. This is useful to create a knowledge file for a GPT, so that it knows which Actors can it use.

73

5.0

Apify Store Scraper

vulnv/apify-store-scraper

Scrape detailed information about Apify Actors from the Apify Store. Extract actor metadata, pricing, statistics, reviews, and more with advanced filtering options.

Apify Developer Profile & Actor Deep Scraper

madeingermany/apify-profile-scraper

Scrape all public actors from Apify profiles, including details inside the actor.

πŸ‘ User avatar

Made In Germany

2

Abort Actor Runs

mnmkng/abort-actor-runs

This actor enables the aborting of all user's running actors with a single click or by a single API call. Scans all actors of the user, aborts all RUNNING and READY actors. It is set to minimize compute unit usage at the expense of speed. Scans the user's actors sequentially to prevent API abuse.

Related articles

Introduction to Apify Store: the world’s largest marketplace of web scrapers
Read more
Single JavaScript file Actors are being deprecated
Read more
6 AI agent tools that keep your agents grounded in current data
Read more