VOOZH about

URL: https://apify.com/jungle_synthesizer/nyt-cooking-recipe-scraper

โ‡ฑ NYT Cooking Recipe Scraper ยท Apify


Pricing

Pay per event

Go to Apify Store

NYT Cooking Recipe Scraper

Enumerate all ~25K public NYT Cooking recipes from the official sitemap and extract structured recipe data (ingredients, instructions, nutrition, ratings) from schema.org Recipe JSON-LD.

Pricing

Pay per event

Rating

0.0

(0)

Developer

๐Ÿ‘ BowTiedRaccoon

BowTiedRaccoon

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a month ago

Last modified

Share

Enumerate the complete NYT Cooking recipe catalog (~25K recipes) from the official sitemap and extract structured recipe data from the public schema.org Recipe JSON-LD embedded in each page.

What it collects

Every record contains the following fields:

FieldTypeDescription
recipe_idstringUnique NYT Cooking recipe identifier
urlstringCanonical recipe URL
namestringRecipe title
authorstringNYT Cooking contributor byline
descriptionstringRecipe description / headnote
recipe_yieldstringServing size (e.g. "4 servings")
total_timestringTotal cooking time (e.g. "1 hr 30 min")
prep_timestringPreparation time
cook_timestringActive cooking time
recipe_categorystringMeal category (e.g. "Dinner, Main Course")
recipe_cuisinestringCuisine style (e.g. "Mediterranean Inspired")
recipe_ingredientarrayList of ingredient strings with quantities
recipe_instructionsarrayStep-by-step instructions
nutritionstringJSON-serialized nutrition facts (calories, fat, carbs, protein, sodium, etc.) from schema.org NutritionInformation. null for recipes without nutrition data.
aggregate_ratingnumberAverage user rating (1โ€“5 scale)
rating_countintegerNumber of user ratings
keywordsarrayTags and keywords (ingredient highlights, technique, difficulty, etc.)
image_urlsarrayFull-resolution image URLs
date_publishedstringISO 8601 publication date

Discovery

By default the actor walks the official NYT Cooking sitemap index (https://www.nytimes.com/sitemaps/new/cooking.xml.gz), which contains monthly sub-sitemaps covering the full recipe inventory. Only /recipes/ paths are collected โ€” article and guide pages are excluded.

Inputs

InputTypeDefaultDescription
maxItemsinteger10Maximum number of recipes to collect. Set to 0 for no limit (full catalog run).
startUrlsarrayโ€”Optional list of specific NYT Cooking recipe URLs to scrape directly, bypassing sitemap discovery. Useful for targeted single-recipe or small-batch runs.

Data source

All data is extracted from the schema.org/Recipe JSON-LD markup that NYT Cooking embeds in every public recipe page for SEO purposes. Recipe content โ€” including ingredients, instructions, and metadata โ€” is publicly available. The NYT Cooking paywall only gates account-specific features (recipe box, personal notes, collections) and does not restrict access to recipe markup.

Usage notes

  • For a full catalog run (~25K recipes), use maxItems: 0 and allow sufficient run time.
  • Nutrition data (nutrition field) is present on most recipes but absent on some recently published ones; the field is null in those cases.
  • The sitemap updates frequently (new recipes appear within hours of publication). Re-running with maxItems: 0 against the latest sub-sitemaps will catch additions.

You might also like

NYT Cooking Scraper

harvest/nyt-cooking-scraper

Scrapes recipe data from a New York Times Cooking recipe page. It extracts key details such as the recipe name, ingredients, instructions, cooking time, servings, and nutrition facts.

46

Recipe Scraper โ€” Extract Recipes from 100+ Cooking Websites

studio-amba/recipe-scraper

Scrape recipes with ingredients, instructions, nutrition, ratings, and cooking times from popular recipe websites. Supports allrecipes.com, bbcgoodfood.com, and any site with Schema.org Recipe markup.

BBC Good Food Recipe Scraper

jungle_synthesizer/bbcgoodfood-recipe-scraper

Enumerate and scrape the full BBC Good Food recipe catalogue (~15K+ recipes) from sitemap discovery. Extracts structured recipe data including ingredients, instructions, UK nutrition panels, skill level, dietary tags, ratings, and schema.org/Recipe JSON-LD fields.

๐Ÿ‘ User avatar

BowTiedRaccoon

2

Recipe JSON-LD Bulk Harvester

jungle_synthesizer/recipe-jsonld-bulk-harvester

Harvest structured recipe data from any food blog. URL mode: scrape a provided list. Domain mode: auto-discover the sitemap, filter Recipe pages, and crawl them. Extracts name, author, parsed ingredients, instructions, nutrition, and ratings from schema.org/Recipe JSON-LD and hRecipe microformat.

๐Ÿ‘ User avatar

BowTiedRaccoon

2

Recipe Scraper (Universal / schema.org)

crawlerbros/recipe-scraper

Scrape any schema.org-compliant recipe site like Epicurious, BBC Good Food, Tasty, NYT Cooking, Serious Eats, Food Network, plus thousands of food blogs. Extracts ingredients, instructions, nutrition, ratings, prep/cook time, yield, author, and images via JSON-LD parsing.

Recipe Data Scraper - Extract from 500+ Cooking Websites

vulnv/recipe-scraper

Powerful recipe scraper that extracts ingredients, instructions, nutrition facts, and cooking metadata from 500+ popular cooking websites including AllRecipes, Food Network, BBC Good Food, Epicurious, and more. Perfect for food apps, meal planning, nutrition analysis, and culinary research.

Food.com Recipe Scraper

jungle_synthesizer/foodcom-recipe-scraper

Scrape recipes from Food.com โ€” one of the largest English community recipe databases with 500K+ recipes. Enumerate the full sitemap or supply specific URLs. Extracts ingredients, instructions, nutrition, ratings, reviews, and tag taxonomy.

๐Ÿ‘ User avatar

BowTiedRaccoon

2