VOOZH about

URL: https://apify.com/ethereal_wool/zhihu-question-answers-scraper

⇱ Zhihu Question Answers Scraper & API Β· Apify


πŸ‘ ❓ Zhihu Question Answers Scraper avatar

❓ Zhihu Question Answers Scraper

Pricing

$3.00 / 1,000 results

Go to Apify Store

❓ Zhihu Question Answers Scraper

Extract Zhihu question answers data β€” title, author, engagement, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

Pricing

$3.00 / 1,000 results

Rating

0.0

(0)

Developer

πŸ‘ Jackie Chen

Jackie Chen

Maintained by Community

Actor stats

0

Bookmarked

4

Total users

1

Monthly active users

9 days ago

Last modified

Share

Zhihu Question Answers Scraper

πŸ‘ zhihu-question-answers-scraper

Scrape every answer on a Zhihu (ηŸ₯乎) question β€” with the full answer body, not just a snippet. Each answer becomes one structured item: complete HTML content, a plain-text excerpt, the author's name / handle / follower count, upvotes, comment count, and timestamps. This is ideal raw material for LLM training data, Q&A-pair datasets, and content research.

Unofficial. This Actor is not affiliated with, authorized, or endorsed by Zhihu (ηŸ₯乎 / Zhihu Inc.). It is an independent tool that retrieves publicly available data via a third-party API. Use it in compliance with Zhihu's terms and all applicable laws; you are responsible for how you use the retrieved data.

What it does

  • Question answers (primary) β€” give one or more Zhihu question IDs; the Actor paginates through every answer on each question and emits one item per answer with the full body text.
  • Answer comments (optional) β€” give answer IDs to also pull their top-level comments, one item per comment.

Input

FieldTypeDefaultDescription
questionIdsstring[]["19550517"]Zhihu question IDs (numeric id from zhihu.com/question/<id>). Each is paginated independently.
orderenumdefaultdefault (Zhihu ranking) or updated (most recently updated).
answerIdsstring[][]Optional. Answer IDs to pull top-level comments for.
maxItemsinteger50Max total items (answers + comments) across all inputs.
maxCommentsPerAnswerinteger50Cap on comments fetched per answer ID.

Example input

{
"questionIds":["19550517","20278289"],
"order":"default",
"maxItems":200
}

Output

One dataset item per answer:

{
"answerId":"12202199",
"type":"answer",
"url":"https://www.zhihu.com/question/19550517/answer/12202199",
"questionId":19550517,
"questionTitle":"Instagram ζœ‰ε€šε°‘η”¨ζˆ·οΌŸ",
"excerpt":"1500 δΈ‡γ€‚δ»ŠεΉ΄ 3 月份ζŠ₯ε‡Ίζ₯ηš„ζ•°ε­— ...",
"content":"<p>... full answer body as HTML ...</p>",
"voteupCount":2,
"commentCount":7,
"createdTime":1334057938,
"updatedTime":1334057938,
"authorName":"ι»„η»§ζ–°",
"authorUrlToken":"jixin",
"authorId":"...",
"authorFollowerCount":1006585,
"authorHeadline":"ηŸ₯乎 θ”εˆεˆ›ε§‹δΊΊ",
"authorUrl":"https://www.zhihu.com/people/jixin",
"source":"question:19550517"
}

When answerIds is supplied, comment items are also emitted with type: "comment", commentId, answerId, content, voteupCount, childCommentCount, and author fields.

Notes

  • Data is sourced live; the upstream API occasionally returns a transient retry signal, so the Actor retries with exponential backoff and a browser User-Agent.
  • Items are de-duplicated by id within a run.

You might also like

❓ Zhihu Search Scraper

ethereal_wool/zhihu-search-scraper

Extract Zhihu search data β€” title, author, engagement, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

❓ Zhihu User Content Scraper

ethereal_wool/zhihu-user-content-scraper

Extract Zhihu user content data β€” title, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

πŸ“Έ Instagram Profile Scraper

ethereal_wool/instagram-profile-scraper

Extract Instagram profile data β€” title, author, engagement, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

🎡 TikTok Profile Scraper

ethereal_wool/tiktok-profile

Extract TikTok profile data β€” title, author, engagement, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

πŸ“Έ Instagram Reels Scraper

ethereal_wool/instagram-reels-scraper

Extract Instagram reels data β€” title, author, engagement, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

YouTube Search Scraper | $3/1K Results

ethereal_wool/youtube-search-scraper

Extract YouTube search data β€” title, author, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

πŸŽ₯ YouTube Channel Info Scraper

ethereal_wool/youtube-channel-info-scraper

Extract YouTube channel info data β€” title, author, engagement, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

TikTok Video Search Scraper | $3/1K Videos

ethereal_wool/tiktok-video-search

Extract TikTok video search data β€” title, author, engagement, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

Threads Search Scraper | $2.5/1K Posts

ethereal_wool/threads-search-scraper

Extract Threads search data β€” title, author, engagement, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

Zhihu Scraper β€” Q&A, Answers, Articles, Columns

sian.agency/zhihu-scraper

Zhihu scraper β€” extract long-form Mandarin Q&A, expert answers, articles & column posts. Keyword search, question answer threads, article detail, column article list. China market research, LLM training data, competitive intel. Four operations, one clean dataset per run. No API key.

28