VOOZH about

URL: https://apify.com/xtracto/thestar-scraper

⇱ The Star (Malaysia) Article Scraper Β· Apify


πŸ‘ The Star (Malaysia) Article Scraper avatar

The Star (Malaysia) Article Scraper

Pricing

from $2.00 / 1,000 results

Go to Apify Store

The Star (Malaysia) Article Scraper

Extract article metadata and available content from thestar.com.my. Extracts intro content visible to non-subscribers. No browser needed - HTTP-only.

Pricing

from $2.00 / 1,000 results

Rating

0.0

(0)

Developer

πŸ‘ Farhan Febrian Nauval

Farhan Febrian Nauval

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

11 days ago

Last modified

Categories

Share

Extract article metadata, author, and available text from any thestar.com.my article URL. The Star is Malaysia's largest English-language newspaper covering national news, business, and lifestyle.

Why Use This Actor?

  • Malaysian news monitoring - track the most-read English-language paper in Malaysia.
  • Regional Southeast Asian coverage - unique perspective on Malaysia, ASEAN, and regional politics.
  • Metadata and headline extraction - useful even under the Piano paywall: author, date, title, and intro always available.

How It Works

This actor uses only HTTP requests - no browser, no Selenium, no Playwright. Articles are extracted in seconds with RAM usage well under 256 MB.

Note: The Star uses a Piano paywall. Only the intro paragraphs (typically 1–3) are freely visible. Author, title, and publication date are always extracted from structured metadata.

Input

{
"url":"https://www.thestar.com.my/news/nation/2026/03/06/interactive-malaysias-score-improves-for-gender-equality-but-we-are-still-worst-in-asean",
"urls":[
"https://www.thestar.com.my/news/nation/2026/04/23/article-one",
"https://www.thestar.com.my/business/business-news/2026/04/22/article-two"
],
"mode":"article",
"limit":10
}

Output

{
"url":"https://www.thestar.com.my/news/nation/2026/05/15/singer-namewee-freed-of-drug-related-charges",
"source":"The Star",
"title":"Singer Namewee freed of drug-related charges",
"description":"",
"content":"KUALA LUMPUR: Singer Wee Meng Chee, better known as Namewee, has been acquitted by the Magistrate’s Court of two drug-related charges. Magistrate Khairunnisak Hassni granted a discharge and acquittal after the prosecution informed the court that the Attorney General’s Chambers had accepted Namewee’s second representation to drop the charges. His first representation, submitted last month, was rejected....",
"image":"https://apicms.thestar.com.my/uploads/images/2026/05/15/3908737.jpg",
"language":"en_GB",
"word_count":116,
"published_date":"2026-05-14T16:00:00.000Z",
"modified_date":"2026-05-15T01:31:16.000Z",
"authors":[
"The Star Online"
],
"categories":"",
"tags":""
}

Fetch Latest News

Set mode to "latest" to fetch the newest article URLs and titles from The Star news section instead of extracting a single article.

Input:

{
"mode":"latest",
"limit":10
}

Output - array of objects:

[
{
"url":"https://www.thestar.com.my/news/nation/2026/04/23/37-foreigners-held-in-raid-on-johor-entertainment-outlet",
"title":"37 foreigners held in raid on Johor entertainment outlet",
"source":"The Star"
}
]

Source: https://www.thestar.com.my/news (homepage, no public RSS available)

Cron Schedule: Auto-Fetch Newest Articles

Combine mode: "latest" and mode: "article" to keep a fresh feed running on autopilot:

  1. Schedule a recurring run of this Actor with {"mode": "latest", "limit": 20} via Apify Schedules (UI β–Έ Schedules β–Έ Create new). A cron expression like */30 * * * * runs it every 30 minutes.
  2. Webhook the dataset of the latest run into another Actor run with mode: "article" and the new URLs as input β€” Apify integrations let you chain runs via the "Actor finished" webhook without any glue code.
  3. The article-mode run extracts the full body, image, authors, and metadata for each URL and appends to your master dataset.

Common cron expressions:

FrequencyCron
Every 15 minutes*/15 * * * *
Hourly0 * * * *
Every 6 hours0 */6 * * *
Daily at 06:00 UTC0 6 * * *

Other News Actors

Need a different news source? All actors in this collection:

ActorSource
aljazeera-scraperAl Jazeera
apnews-scraperAP News
bbc-scraperBBC News
bisnis-scraperBisnis Indonesia
cnbc-scraperCNBC
dataindonesia-scraperData Indonesia
forbes-scraperForbes
fortune-scraperFortune
ft-scraperFinancial Times
guardian-scraperThe Guardian
investors-scraperInvestor's Business Daily
msn-scraperMSN News
nytimes-scraperNew York Times
reuters-scraperReuters
scmp-scraperSouth China Morning Post
smh-scraperSydney Morning Herald
straitstimes-scraperThe Straits Times
techcrunch-scraperTechCrunch
thestar-scraperThe Star (Malaysia)
upi-scraperUPI
yahoo-finance-scraperYahoo Finance

You might also like

investors.com Business Daily Article Scraper

xtracto/investors-scraper

Extract article metadata and visible intro content from investors.com (IBD). Full articles contents, No browser needed - HTTP-only.

πŸ‘ User avatar

Farhan Febrian Nauval

3

Sydney Morning Herald Article Scraper

xtracto/smh-scraper

Extract article metadata and visible intro content from smh.com.au. Full articles require a Nine subscription. No browser needed - HTTP-only.

πŸ‘ User avatar

Farhan Febrian Nauval

2

Article Content Extractor πŸ“„

easyapi/article-content-extractor

Extract clean article content, metadata and structured information from any web page. Supports multiple URLs and returns well-formatted JSON with title, description, content, author, publish date and more. πŸ”πŸ“„

The Straits Times Article Scraper

xtracto/straitstimes-scraper

Extract full article text, authors, dates, and metadata from The Straits Times URLs. No browser needed - fast HTTP-only extraction.

πŸ‘ User avatar

Farhan Febrian Nauval

2

Google News Article Scraper

webscrap18/google-news-article-scraper

Scrape Google News, Extract full content with Title, Article Text, Images and Structured data.

Article Extraction API

tugelbay/article-extractor

Extract clean article text and metadata from URLs as Markdown, text, or HTML for RAG, AI agents, monitoring, and research. Guide: https://konabayev.com/tools/article-extractor/?utm_source=apify_info&utm_medium=referral&utm_campaign=article-extractor

πŸ‘ User avatar

Tugelbay Konabayev

41

Cnbc Article Scraper

xtracto/cnbc-scraper

Scrape full article content, title, authors, and metadata from cnbc.com. Supports `mode: latest` for live CNBC headline feed. HTTP-only, no browser

πŸ‘ User avatar

Farhan Febrian Nauval

3