Pricing
$3.00 / 1,000 results
❓ Zhihu Search Scraper
Extract Zhihu search data — title, author, engagement, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.
Pricing
$3.00 / 1,000 results
Rating
0.0
(0)
Developer
Actor stats
0
Bookmarked
3
Total users
1
Monthly active users
9 days ago
Last modified
Categories
Share
Zhihu Search Scraper
Search Zhihu (知乎), China's largest Q&A and knowledge-sharing platform, by keyword. Returns clean, structured records for each result: title, excerpt, full HTML content, upvote / comment / favorite counts, the question it belongs to, and author info. Choose what to search: answers & articles & questions, videos, columns, or users.
Unofficial. This Actor is not affiliated with, authorized, or endorsed by Zhihu (智者天下 / Zhihu Inc.). It is an independent tool that retrieves publicly available data via a third-party API. Use it in compliance with Zhihu's terms and all applicable laws; you are responsible for how you use the retrieved data.
What it does
- Keyword search — give one or more keywords; the Actor paginates Zhihu search
results for each until it hits
maxItemsor the end of results. - Search types — pick one:
general(default): answers, articles and questions.video: Zhihu videos (zvideo).column: Zhihu columns (专栏).user: Zhihu users (people).
- Promo / ad / "related query" / gaokao cards are filtered out automatically; only
real content records are returned. Search-highlight
<em>markup is stripped.
Input
| Field | Type | Default | Description |
|---|---|---|---|
searchKeywords | string[] | ["人工智能"] | Keywords to search. Each is paginated independently. Chinese keywords return the most results. |
searchType | enum | general | general / video / column / user. |
maxItems | integer | 50 | Max total records across all keywords. |
includeContent | boolean | true | Include the full HTML body of answers/articles. Disable to keep items small. |
Example input
{"searchKeywords":["人工智能","心理学"],"searchType":"general","maxItems":100,"includeContent":true}
Output
One dataset item per record:
{"id":"2044932700230914484","kind":"answer","title":"如何评价知乎上以赵泠为首的人工智能唱衰党?","excerpt":"大家一定要对这个专业去魅 ...","url":"https://www.zhihu.com/answer/2044932700230914484","authorName":"赵泠","authorUrlToken":"MarryMea","authorHeadline":"衣沾不足惜,但使愿无违。","voteupCount":2159,"commentCount":126,"favoritesCount":0,"questionTitle":"如何评价知乎上以赵泠为首的人工智能唱衰党?","questionId":"2042974223212213176","createdTime":1780329948,"updatedTime":1780330499,"content":"<p>...full answer HTML...</p>","source":"general:人工智能"}
For user results the item carries authorName, authorUrlToken, authorHeadline,
followerCount, answerCount and articlesCount; for column results it carries
title, articlesCount and followerCount.
Notes
- Data is sourced live. The Zhihu upstream is occasionally flaky and can emit a transient error even for a good keyword; the Actor retries with exponential backoff, so a run that briefly errors usually still returns results.
- Record IDs are de-duplicated within a run.
