VOOZH about

URL: https://apify.com/xtracto/nytimes-scraper

⇱ New York Times News Scraper Β· Apify


Pricing

from $15.00 / 1,000 results

Go to Apify Store

New York Times News Scraper

Extracts full New York Times articles while successfully bypassing partial-render paywalls to ensure complete content delivery.

Pricing

from $15.00 / 1,000 results

Rating

0.0

(0)

Developer

πŸ‘ Farhan Febrian Nauval

Farhan Febrian Nauval

Maintained by Community

Actor stats

0

Bookmarked

20

Total users

8

Monthly active users

11 days ago

Last modified

Share

New York Times Article Scraper

Extract full article text, headline, byline, authors, publication date, categories, tags, and metadata from nytimes.com article URLs. Extracts full article text including content behind the metered soft paywall.

Why Use This Actor?

  • Bypasses soft paywall - extracts full NYT article text available to web visitors.
  • Full article extraction - retrieves complete article text (10,000+ chars) including investigative long-form pieces.
  • Rich metadata - extracts authors, published/modified dates, categories, tags, and description.
  • Investigative journalism research - NYT investigative pieces are the definitive source for deep US stories.
  • Cultural coverage - NYT covers books, film, food, and culture alongside hard news.

How It Works

This actor uses only HTTP requests - no browser, no Selenium, no Playwright. Bypasses Cloudflare protection without a real browser. Articles are extracted in seconds with RAM usage well under 256 MB.

Input

{
"url":"https://www.nytimes.com/2026/04/12/technology/example-article.html",
"urls":[
"https://www.nytimes.com/2026/04/12/us/politics/article-one.html",
"https://www.nytimes.com/2026/04/11/world/article-two.html"
]
,
"mode":"article",
"limit":10
}

Output

{
"url":"https://www.nytimes.com/2026/05/15/nyregion/hezbollah-arrest-nyc-jewish-targets.html",
"source":"New York Times",
"title":"Kataib Hezbollah Commander Accused of Planning Attacks on N.Y.C.",
"description":"Mohammad Baqer Saad Dawood Al-Saadi is accused of directing and urging others to attack American and Israeli interests. Prosecutors say he is a leader of Kataib Hezbollah, an Iraqi militia with ties to Iran.",
"content":"A commander of an Iran-backed militia has been charged with plotting to attack Jewish sites in the United States, including a synagogue in New York City, federal authorities said. A criminal complaint unsealed on Friday accused the commander, Mohammad Baqer Saad Dawood Al-Saadi, of planning at least 18 attacks in Europe and Canada since late February, in retaliation for the United States and Israel’s attacks on Iran. Mr....",
"image":"https://static01.nyt.com/newsgraphics/images/icons/defaultPromoCrop.png",
"language":"en",
"word_count":313,
"published_date":"2026-05-15T16:54:16.351Z",
"modified_date":"",
"authors":[],
"categories":"",
"tags":""
}

Fetch Latest News

Set mode to "latest" to fetch the newest article URLs and titles from New York Times instead of extracting a single article.

Input:

{
"mode":"latest",
"limit":10
}

Output - array of objects:

[
{
"url":"https://www.nytimes.com/2026/04/20/world/asia/asia-pacific-iran-war-oil.html",
"title":"The Iran War Sent Shock Waves Through Asia That Are Likely to Spread",
"published_date":"Mon, 20 Apr 2026 05:39:43 +0000",
"source":"New York Times"
}
//...
]

Source: https://rss.nytimes.com/services/xml/rss/nyt/HomePage.xml (RSS feed)

Cron Schedule: Auto-Fetch Newest Articles

Combine mode: "latest" and mode: "article" to keep a fresh feed running on autopilot:

  1. Schedule a recurring run of this Actor with {"mode": "latest", "limit": 20} via Apify Schedules (UI β–Έ Schedules β–Έ Create new). A cron expression like */30 * * * * runs it every 30 minutes.
  2. Webhook the dataset of the latest run into another Actor run with mode: "article" and the new URLs as input β€” Apify integrations let you chain runs via the "Actor finished" webhook without any glue code.
  3. The article-mode run extracts the full body, image, authors, and metadata for each URL and appends to your master dataset.

Common cron expressions:

FrequencyCron
Every 15 minutes*/15 * * * *
Hourly0 * * * *
Every 6 hours0 */6 * * *
Daily at 06:00 UTC0 6 * * *

Other News Actors

Need a different news source? All actors in this collection:

ActorSource
aljazeera-scraperAl Jazeera
apnews-scraperAP News
bbc-scraperBBC News
cnbc-scraperCNBC
forbes-scraperForbes
fortune-scraperFortune
ft-scraperFinancial Times
guardian-scraperThe Guardian
msn-scraperMSN News
nytimes-scraperNew York Times
reuters-scraperReuters
scmp-scraperSouth China Morning Post
techcrunch-scraperTechCrunch
upi-scraperUPI
yahoo-finance-scraperYahoo Finance
smart-news-loaderAny URL - adaptive HTTP loader
bloomberg-scraperBloomberg

All actors support mode: "latest" for fetching newest article URLs from each source.

You might also like

New York Times Scraper

theo/new-york-times-scraper

Scrape news data from nytimes.com with this unofficial API. Extract articles, monitor their popularity and performance and automate the fight against fake news. Filter the results by authors, topics, categories, or publication dates. Preview or download the results in your preferred format.

227

Financial Times News Scraper

xtracto/ft-scraper

Seamlessly retrieves full Financial Times articles by bypassing Cloudflare protection without requiring expensive residential proxies.

πŸ‘ User avatar

Farhan Febrian Nauval

29

5.0

FT.com Articles Scraper | Financial Times Headlines

parseforge/ft-com-articles-scraper

Collect Financial Times articles with headline, byline, date, section, summary, and full body. Filter by topic or keyword. Built for finance media monitoring, market intelligence, NLP datasets, and analyst teams tracking global business and economics news.

YELP Extractor 2.1 🚧

jupri/yelp

πŸ’« Scrape Business data from Yelp.com

Bloomberg Full Article Scraper - Cheapest

xtracto/bloomberg-news-article-scraper

Extract full Bloomberg news articles - including premium content - fast and cost-efficient using HTTP-only requests. No proxy, no browser, no login required.

πŸ‘ User avatar

Farhan Febrian Nauval

23

5.0

Ultimate Yelp Scraper

epctex/yelp-scraper

Gather info on millions of businesses, events, and reviews. Scrape businesses and extract descriptions, images, menus, addresses, and all other properties.. Customize with search terms, filters, and mappings for limitless insights. Ultimate Yelp scraper you need. No limits! No limits!

NYT Articles Scraper | New York Times Headlines

parseforge/nyt-articles-scraper

Extract New York Times articles with headline, byline, date, section, abstract, and full content. Filter by section, keyword, or date range. Ideal for media analysts, NLP datasets, sentiment research, and competitive content monitoring across US news.

Yelp Scraper

tri_angle/yelp-scraper

Free Yelp web scraper to extract data from Yelp. Fast Yelp review scraper, but also gets business details and ratings without using the Yelp API.

πŸ‘ User avatar

Tri⟁angle

6.3K

3.2

Article Extractor & News Scraper

web.harvester/article-extractor-news-scraper

Extract articles from any news site, blog, or webpage. Get title, full text, author, date, images & metadata using 7 extraction engines (Newspaper4k, Trafilatura, Goose3). Anti-bot bypass, proxy rotation, automatic fallback. Perfect for news monitoring, NLP datasets & content aggregation.

50

5.0

Bloomberg News Scraper Full Article Cheapest

ahmed_jasarevic/bloomberg-news-scraper

Unlock the full power of Bloomberg insights with our advanced extraction tool. This scraper is engineered to bypass 'subscription-only' restrictions, delivering comprehensive article content and deepmetadata without the need for an active account.

πŸ‘ User avatar

Ahmed Jasarevic

207