🔎 Yahoo Scraper

Pricing

from $4.99 / 1,000 results

Try for free

Go to Apify Store

👁 🔎 Yahoo Scraper

🔎 Yahoo Scraper

Try for free

Pricing

from $4.99 / 1,000 results

Rating

0.0

(0)

Developer

👁 API Empire

API Empire

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

25 days ago

Last modified

🔎 Yahoo Search Scraper

Scrape Yahoo Search results at scale — titles, URLs, snippets, favicons, in-article sub-links, and a clean Markdown excerpt for every result. Bulk queries, time-window filtering and smart proxy auto-escalation (direct → datacenter → residential) keep your runs fast and unblocked.

⭐ Why Choose Us?

Bulk-first — paste dozens of queries (or full Yahoo URLs) and walk every result page until the cap.
Smart proxy ladder — starts direct, only escalates if Yahoo blocks. You don't pay for residential traffic you didn't need.
Rich back-fill — when Yahoo's snippet is thin, the actor visits the result page and harvests in-article sub-links + a Markdown summary.
Live results — rows stream to the dataset as they're scraped, so a mid-run interruption never loses your data.
Production-grade error handling — 3-tier proxy retries, graceful PPE limit handling, exponential cool-downs.

🔑 Key Features

🌐 Bulk queries — plain keywords or full Yahoo SERP URLs, mixed freely.
📅 Time-window filter — Anytime / Past day / Past week / Past month.
🛡️ Auto-escalating proxy: direct → Apify Datacenter → Apify Residential (3 retries), then sticky.
🧩 Optional second-pass back-fill of sub-links + Markdown excerpts.
📋 Per-section dataset views: Overview, Snippet, Sub-links.
🔄 Custom proxy URLs supported — they go first, then the smart ladder.

🧾 Input

{
"queries":[
"java developer",
"https://search.yahoo.com/search?p=python+jobs"
],
"maxItems":10,
"timePeriod":"Anytime",
"backfillEmptyResults":true,
"backfillConcurrency":8,
"backfillMaxLinks":10,
"proxyConfiguration":{"useApifyProxy":false}
}

Field	Type	Description
`queries`	`string[]`	One or more search terms or Yahoo SERP URLs.
`maxItems`	`integer`	Hard cap on unique results per query (1–500).
`timePeriod`	`string`	`Anytime` / `Past day` / `Past week` / `Past month`.
`backfillEmptyResults`	`boolean`	Visit each result page to harvest sub-links + Markdown excerpt.
`backfillConcurrency`	`integer`	Parallelism for back-fill (1–32).
`backfillMaxLinks`	`integer`	Max in-article sub-links per result page (1–50).
`proxyConfiguration`	`object`	Apify proxy config. Defaults to direct (no proxy).

📤 Output

Each row matches the per-section views in the dataset.

{
"query":"java developer",
"title":"How to become a Java Developer? - GeeksforGeeks",
"url":"https://www.geeksforgeeks.org/gfg-academy/how-to-become-a-java-developer/",
"description":"A Java developer is a software engineer who builds...",
"text":" * Core Java\n\nCore Fundamentals: Learn concepts and practice DSA...\n",
"logo_url":"https://s.yimg.com/pv/.../32x32_7eae5aac8b7f7402.png",
"links":[
"https://www.geeksforgeeks.org/java/java",
"https://www.geeksforgeeks.org/advance-java/spring"
],
"domain":"www.geeksforgeeks.org"
}

Field	Description
`query`	The query (or URL) the row was scraped under.
`title`	The result's headline.
`url`	The clean target URL (Yahoo's tracker is stripped).
`description`	Yahoo's SERP snippet, rendered as Markdown.
`text`	Markdown excerpt — either Yahoo's list block or, after back-fill, an in-article summary.
`logo_url`	The result's favicon.
`links`	Up to N harvested in-article sub-links (after back-fill).
`domain`	The host portion of `url`.

🚀 How to Use (Apify Console)

Open Apify Console → Actors.
Find this actor and open it.
Paste your queries (one per line) into 🌐 Search Queries / URLs.
Pick a 🎁 Maximum results cap and a 📅 Time window.
(Optional) Leave proxy on direct — the actor will auto-escalate only when needed.
Click Start.
Watch live logs — rows appear in the Output tab as they're scraped.
Export results as JSON / CSV / XLSX.

🤖 Use via API

curl-X POST "https://api.apify.com/v2/acts/<ACTOR_ID>/run-sync-get-dataset-items?token=$APIFY_TOKEN"\
-H"Content-Type: application/json"\
-d'{
 "queries": ["java developer"],
 "maxItems": 10,
 "timePeriod": "Anytime"
 }'

💼 Best Use Cases

SEO & SERP monitoring on Yahoo.
Competitive intelligence — track who appears for a query over time.
Lead generation — feed result URLs into your own enrichment pipeline.
Content discovery — harvest in-article sub-links for further crawling.

💳 Pricing

This actor uses Apify's Pay-per-event model. The primary event is result-item — one charge per result row pushed to the dataset. You pay only for the rows you actually receive; back-fill, retries and failed attempts are not billed.

You also pay the underlying Apify platform usage (compute units, proxy traffic when used). Direct (no-proxy) requests cost no proxy traffic at all — which is why the actor stays on direct until Yahoo forces it to escalate.

❓ Frequently Asked Questions

Does it work when Yahoo blocks me? Yes. The default no-proxy run is the fastest, but the moment Yahoo returns a block (HTTP 429/503 or a captcha page), the actor auto-escalates to the Apify Datacenter pool, then to Residential with up to 3 retries. Once a tier works, it's locked in for the rest of the run.

Can I bring my own proxies? Yes — paste them into the proxy field's Custom proxy URLs. Your URLs are tried first (3 retries), then the datacenter → residential fallback ladder kicks in.

Does it follow pagination? Yes. Yahoo returns ~7 results per page; the actor walks pages until your maxItems cap is hit or 3 consecutive pages return nothing.

What about non-Latin queries? Yahoo handles UTF-8 queries natively — paste them as-is.

Why is my back-filled text empty for some rows? Some sites block all bots (or render with JS only). In that case the actor falls back to a minimal block built from Yahoo's own title + description so the field is never blank.

📨 Support & Feedback

Issues / feature requests → please open a thread on the actor's detail page.
Custom solutions → dev.scraperengine@gmail.com.

🔎 Yahoo Scraper

simpleapi/yahoo-scraper

👁 User avatar

SimpleAPI

🔎 Yahoo Scraper

scrapapi/yahoo-scraper

👁 User avatar

ScrapAPI

👁 🔎 Yahoo Scraper avatar

🔎 Yahoo Scraper

scraper-engine/yahoo-scraper

👁 User avatar

Scraper Engine

🔎 Yahoo Scraper

scrapio/yahoo-scraper

👁 User avatar

Scrapio

🔎 Yahoo Scraper

scrapier/yahoo-scraper

👁 User avatar

Scrapier

Yahoo Finance Scraper

technicaldost/yahoo-finance-scraper

👁 User avatar

Technical Dost Solutions

Yahoo Finance Scraper

quarterly_jingo/yahoo-finance-scraper

👁 User avatar

Petey Boy

👁 Yahoo Search Results Scraper avatar

Yahoo Search Results Scraper

bhansalisoft/yahoo-search-results-scraper

Yahoo Search Results Scraper : Scrape Yahoo Search Engine Results Pages (SERPs). Enter keyword and Select the country and extract organic and paid results from Yahoo.com.

👁 User avatar

bhansalisoft

👁 Yahoo Images Scraper avatar

Yahoo Images Scraper

searchapi/yahoo-images-scraper

Scrapes image results from Yahoo Images Search (images.search.Yahoo.com). Extracts image URL, thumbnail, source, title, dimensions, and more.

👁 User avatar

Search API

👁 Yahoo Search Scraper avatar

Yahoo Search Scraper

searchapi/yahoo-search-scraper

Scrapes organic web search results from Yahoo Search (search.Yahoo.com). Extracts title, link, snippet, domain, displayed URL, date, and more.

👁 User avatar

Search API

URL: https://apify.com/api-empire/yahoo-scraper

⇱ 🔎 Yahoo Scraper · Apify

🔎 Yahoo Scraper

🔎 Yahoo Search Scraper

⭐ Why Choose Us?

🔑 Key Features

🧾 Input

📤 Output

🚀 How to Use (Apify Console)

🤖 Use via API

💼 Best Use Cases

💳 Pricing

❓ Frequently Asked Questions

📨 Support & Feedback

You might also like

🔎 Yahoo Scraper

🔎 Yahoo Scraper

🔎 Yahoo Scraper

🔎 Yahoo Scraper

🔎 Yahoo Scraper

Yahoo Finance Scraper

Yahoo Finance Scraper

Yahoo Search Results Scraper

Yahoo Images Scraper

Yahoo Search Scraper