VOOZH about

URL: https://apify.com/styleindexamerica/kr-lottemart-scraper

โ‡ฑ KR Lottemart Scraper ยท Apify


Pricing

$9.99/month + usage

Go to Apify Store

KR Lottemart Scraper

This actor is intended to extract data from lotteon.com

Pricing

$9.99/month + usage

Rating

0.0

(0)

Developer

๐Ÿ‘ PopinBorder Castnet

PopinBorder Castnet

Maintained by Community

Actor stats

1

Bookmarked

3

Total users

0

Monthly active users

11 days ago

Last modified

Categories

Share

TypeScript PuppeteerCrawler Actor template

This template is a production ready boilerplate for developing with PuppeteerCrawler. The PuppeteerCrawler provides a simple framework for parallel crawling of web pages using headless Chrome with Puppeteer. Since PuppeteerCrawler uses headless Chrome to download web pages and extract data, it is useful for crawling of websites that require to execute JavaScript.

If you're looking for examples or want to learn more visit:

Included features

  • Puppeteer Crawler - simple framework for parallel crawling of web pages using headless Chrome with Puppeteer
  • Configurable Proxy - tool for working around IP blocking
  • Input schema - define and easily validate a schema for your Actor's input
  • Dataset - store structured data where each object stored has the same attributes
  • Apify SDK - toolkit for building Actors

How it works

  1. Actor.getInput() gets the input from INPUT.json where the start urls are defined

  2. Create a configuration for proxy servers to be used during the crawling with Actor.createProxyConfiguration() to work around IP blocking. Use Apify Proxy or your own Proxy URLs provided and rotated according to the configuration. You can read more about proxy configuration here.

  3. Create an instance of Crawlee's Puppeteer Crawler with new PuppeteerCrawler(). You can pass options to the crawler constructor as:

    • proxyConfiguration - provide the proxy configuration to the crawler
    • requestHandler - handle each request with custom router defined in the routes.ts file.
  4. Handle requests with the custom router from routes.ts file. Read more about custom routing for the Cheerio Crawler here

    • Create a new router instance with new createPuppeteerRouter()

    • Define default handler that will be called for all URLs that are not handled by other handlers by adding router.addDefaultHandler(() => { ... })

    • Define additional handlers - here you can add your own handling of the page

      router.addHandler('detail',async({ request, page, log })=>{
      const title =await page.title();
      // You can add your own page handling here
      await Dataset.pushData({
      url: request.loadedUrl,
      title,
      });
      });
  5. crawler.run(startUrls); start the crawler and wait for its finish

Resources

If you're looking for examples or want to learn more visit:

Getting started

For complete information see this article. To run the Actor use the following command:

$apify run

Deploy to Apify

Connect Git repository to Apify

If you've created a Git repository for the project, you can easily connect to Apify:

  1. Go to Actor creation page
  2. Click on Link Git Repository button

Push project on your local machine to Apify

You can also deploy the project on your local machine to Apify without the need for the Git repository.

  1. Log in to Apify. You will need to provide your Apify API Token to complete this action.

    $apify login
  2. Deploy your Actor. This command will deploy and build the Actor on the Apify Platform. You can find your newly created Actor under Actors -> My Actors.

    $apify push

Documentation reference

To learn more about Apify and Actors, take a look at the following resources:

You might also like

KR IKEA Scraper

styleindexamerica/kr-ikea-scraper

This actor is intended to extract data from ikea.com/kr/ko/

๐Ÿ‘ User avatar

PopinBorder Castnet

2

KR Linefriends Scraper

styleindexamerica/kr-linefriends-scraper

This actor is intended to extract data from linefriendssquare.com

๐Ÿ‘ User avatar

PopinBorder Castnet

2

KR Uniqlo Scraper

styleindexamerica/kr-uniqlo-scraper

This actor is intended to extract data from uniqlo.com/kr/ko/

๐Ÿ‘ User avatar

PopinBorder Castnet

3

KR Oliveyoung Scraper

styleindexamerica/kr-oliveyoung-scraper

This actor is intended to extract data from oliveyoung.co.kr

๐Ÿ‘ User avatar

PopinBorder Castnet

11

1.0

KR Mizuno Scraper

styleindexamerica/kr-mizuno-scraper

This actor is intended to extract data from kor.mizuno.com/kr-kr/

๐Ÿ‘ User avatar

PopinBorder Castnet

3

KR Andar Scraper

styleindexamerica/kr-andar-scraper

This actor is intended to extract data from andar.co.kr

๐Ÿ‘ User avatar

PopinBorder Castnet

3

KR Musisna Scraper - Tmall

styleindexamerica/kr-musisna-scraper---tmall

This actor is intended to extract data from musinsa.com

๐Ÿ‘ User avatar

PopinBorder Castnet

3

KR Gmarket Scraper

styleindexamerica/kr-gmarket-scraper

This actor is intended to extract data from gmarket.co.kr

๐Ÿ‘ User avatar

PopinBorder Castnet

6

KR Zigzag Scraper

styleindexamerica/kr-zigzag-scraper

This actor is intended to extract data from zigzag.kr

๐Ÿ‘ User avatar

PopinBorder Castnet

2

KR Abcgrandstage Scraper

styleindexamerica/kr-abcgrandstage-scraper

This actor is intended to extract data from grandstage.a-rt.com

๐Ÿ‘ User avatar

PopinBorder Castnet

3