Pricing
$3.00 / 1,000 results
β Zhihu User Content Scraper
Extract Zhihu user content data β title, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.
Pricing
$3.00 / 1,000 results
Rating
0.0
(0)
Developer
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
9 days ago
Last modified
Categories
Share
Zhihu User Content Scraper
π zhihu-user-content-scraper
Scrape Zhihu (η₯δΉ) users by their url_token. Get every published article
(with the full HTML body), or switch modes to pull a user's profile, their
followers, their followees, or the columns they follow. Each record is
returned as clean, structured data: title, content, excerpt, vote-up and comment
counts, author info, and the canonical URL.
Unofficial. This Actor is not affiliated with, authorized, or endorsed by Zhihu (η₯δΉ / ζΊθ 倩δΈη§ζ). It is an independent tool that retrieves publicly available data via a third-party API. Use it in compliance with Zhihu's terms and all applicable laws; you are responsible for how you use the retrieved data.
What it does
Give one or more Zhihu url_tokens β the slug in a profile URL
zhihu.com/people/<token> (e.g. kaifulee for ζεΌε€) β and pick a content type:
- Articles (default) β every article the user has published, including the full
HTML
content, title, excerpt, vote-up / comment counts and timestamps. Paginated automatically. Sort by newest or most up-voted. - Profile β the user's profile object (name, headline, avatar, follower / answer counts, etc.). One record per user.
- Followers / Followees β the people who follow, or are followed by, the user.
- Columns β the columns the user follows.
Input
| Field | Type | Default | Description |
|---|---|---|---|
userUrlTokens | string[] | ["kaifulee"] | Zhihu user url_tokens. Each is scraped independently. |
contentType | enum | articles | articles / profile / followers / followees / columns. |
sortType | enum | created | Article sort: created (newest) or voteups (most up-voted). Articles only. |
maxItems | integer | 10 | Max total records across all users. Ignored for profile. |
Example input
{"userUrlTokens":["kaifulee"],"contentType":"articles","sortType":"voteups","maxItems":50}
Output
One dataset item per record. Articles look like:
{"id":"606602766","type":"article","title":"ChatGPTεΌεε€±δΈζζ οΌθΏ20η§ε·₯δ½θ¦ιΏεΌοΌ","excerpt":"OpenAIζ°θΏζ¨εΊηChatGPTε·²η»ηη«εΊε ...","content":"<p>...full HTML body...</p>","url":"http://zhuanlan.zhihu.com/p/606602766","articleType":"normal","voteupCount":354,"commentCount":124,"created":1676559521,"updated":1676559615,"author":{"id":"...","name":"ζεΌε€","urlToken":"kaifulee","headline":"..."},"authorName":"ζεΌε€","userUrlToken":"kaifulee","source":"articles:kaifulee"}
Other content types share the same flat shape (id, type, title, url,
authorName, β¦) with mode-specific fields (e.g. followerCount for profiles,
itemsCount for columns).
Notes
- Data is sourced live; the upstream occasionally emits transient blocks and an
intermittent
400, so the Actor retries with exponential backoff. - Records are de-duplicated by
idwithin a run.
