Pricing
$3.00 / 1,000 results
๐ฐ Toutiao Article Detail Scraper
Extract Toutiao article detail data โ url, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.
Pricing
$3.00 / 1,000 results
Rating
0.0
(0)
Developer
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
10 days ago
Last modified
Categories
Share
Toutiao Article Detail Scraper (ไปๆฅๅคดๆกๆ็ซ ่ฏฆๆ )
Scrape Toutiao (ไปๆฅๅคดๆก) article details by article group ID or article URL. For each article it returns clean, structured data: title, source / author, read / comment / digg / like / repost counts, author and media info, and โ optionally โ the article's HTML content and images.
Unofficial. This Actor is not affiliated with, authorized, or endorsed by Toutiao (ไปๆฅๅคดๆก) or ByteDance (ๅญ่่ทณๅจ). It is an independent tool that retrieves publicly available data via a third-party API. Use it in compliance with Toutiao's terms and all applicable laws; you are responsible for how you use the retrieved data.
What it does
Toutiao articles are addressed by a 19-digit group ID (the ByteDance content id). Give the Actor one or more of these IDs โ or full article URLs and it will parse the ID out for you โ and it fetches each article's detail:
- Engagement & metadata โ read count, comment / digg / like / repost / bury counts, title, source, canonical URL, content hash.
- Author / media info โ author name, user id, media id, fan count, avatar, verification.
- Content & images (optional) โ when
includeContentis on (default), the Actor also calls the web endpoint to add the article's HTMLcontent,titleImage, and image list.
Input
| Field | Type | Default | Description |
|---|---|---|---|
groupIds | string[] | ["7036185404340437511"] | Article group IDs (19-digit) or article URLs (toutiao.com/group/<id>/, toutiao.com/article/<id>/). URLs are parsed to their id automatically. |
includeContent | boolean | true | Also fetch the article HTML content + images via the web endpoint (one extra API call per article). |
maxItems | integer | 50 | Max total articles to scrape across all IDs. |
Example input
{"groupIds":["7036185404340437511","https://www.toutiao.com/group/7036185404340437511/"],"includeContent":true,"maxItems":100}
Output
One dataset item per article:
{"groupId":"7036185404340437511","title":"ๅ จๅฎถไธ่ตทๅๅฐ้บป่พฃๆตท้ฒๆฑค๏ผ่ฝฏๅซฉ่น่ๆญ้ ็พๅณ้ฑผ้ฅผ๏ผๅณ้้ฒ็พๆ ๆฏ๏ผ","url":"https://toutiao.com/group/7036185404340437511/","displayUrl":"https://toutiao.com/group/7036185404340437511/","shareUrl":"https://m.toutiaoimg.cn/i7036185404340437511/","source":"ๅ ดๆฃฎไธๅฎถ","readCount":1450943,"commentCount":1010,"diggCount":12049,"likeCount":12049,"repinCount":1226,"buryCount":1,"contentHash":"3b08368c","author":{"userId":"1398172224077304","mediaId":"1655138971451403","name":"ๅ ดๆฃฎไธๅฎถ","fansCount":1501681,"avatarUrl":"https://...","verified":"True","description":"..."},"content":"<p>...</p>","titleImage":"https://p11-sign.toutiaoimg.com/...","imageUrls":["https://..."],"source_endpoint":"app"}
Notes
- Data is sourced live; Toutiao occasionally rate-limits, so the Actor retries transient blocks with exponential backoff and a browser User-Agent.
- This Actor is ID/URL-driven โ there is no keyword search. Collect the group IDs of the articles you want first (the long number in any Toutiao article link).
- Group IDs are de-duplicated within a run.
