VOOZH about

URL: https://apify.com/ethereal_wool/toutiao-article-detail

โ‡ฑ Toutiao Article Detail Scraper & API ยท Apify


๐Ÿ‘ ๐Ÿ“ฐ Toutiao Article Detail Scraper avatar

๐Ÿ“ฐ Toutiao Article Detail Scraper

Pricing

$3.00 / 1,000 results

Go to Apify Store

๐Ÿ“ฐ Toutiao Article Detail Scraper

Extract Toutiao article detail data โ€” url, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

Pricing

$3.00 / 1,000 results

Rating

0.0

(0)

Developer

๐Ÿ‘ Jackie Chen

Jackie Chen

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

10 days ago

Last modified

Share

Toutiao Article Detail Scraper (ไปŠๆ—ฅๅคดๆกๆ–‡็ซ ่ฏฆๆƒ…)

๐Ÿ‘ toutiao-article-detail

Scrape Toutiao (ไปŠๆ—ฅๅคดๆก) article details by article group ID or article URL. For each article it returns clean, structured data: title, source / author, read / comment / digg / like / repost counts, author and media info, and โ€” optionally โ€” the article's HTML content and images.

Unofficial. This Actor is not affiliated with, authorized, or endorsed by Toutiao (ไปŠๆ—ฅๅคดๆก) or ByteDance (ๅญ—่Š‚่ทณๅŠจ). It is an independent tool that retrieves publicly available data via a third-party API. Use it in compliance with Toutiao's terms and all applicable laws; you are responsible for how you use the retrieved data.

What it does

Toutiao articles are addressed by a 19-digit group ID (the ByteDance content id). Give the Actor one or more of these IDs โ€” or full article URLs and it will parse the ID out for you โ€” and it fetches each article's detail:

  • Engagement & metadata โ€” read count, comment / digg / like / repost / bury counts, title, source, canonical URL, content hash.
  • Author / media info โ€” author name, user id, media id, fan count, avatar, verification.
  • Content & images (optional) โ€” when includeContent is on (default), the Actor also calls the web endpoint to add the article's HTML content, titleImage, and image list.

Input

FieldTypeDefaultDescription
groupIdsstring[]["7036185404340437511"]Article group IDs (19-digit) or article URLs (toutiao.com/group/<id>/, toutiao.com/article/<id>/). URLs are parsed to their id automatically.
includeContentbooleantrueAlso fetch the article HTML content + images via the web endpoint (one extra API call per article).
maxItemsinteger50Max total articles to scrape across all IDs.

Example input

{
"groupIds":[
"7036185404340437511",
"https://www.toutiao.com/group/7036185404340437511/"
],
"includeContent":true,
"maxItems":100
}

Output

One dataset item per article:

{
"groupId":"7036185404340437511",
"title":"ๅ…จๅฎถไธ€่ตทๅ“ๅฐ้บป่พฃๆตท้ฒœๆฑค๏ผŒ่ฝฏๅซฉ่Ÿน่‚‰ๆญ้…็พŽๅ‘ณ้ฑผ้ฅผ๏ผŒๅ‘ณ้“้ฒœ็พŽๆ— ๆฏ”๏ผ",
"url":"https://toutiao.com/group/7036185404340437511/",
"displayUrl":"https://toutiao.com/group/7036185404340437511/",
"shareUrl":"https://m.toutiaoimg.cn/i7036185404340437511/",
"source":"ๅ…ดๆฃฎไธ€ๅฎถ",
"readCount":1450943,
"commentCount":1010,
"diggCount":12049,
"likeCount":12049,
"repinCount":1226,
"buryCount":1,
"contentHash":"3b08368c",
"author":{
"userId":"1398172224077304",
"mediaId":"1655138971451403",
"name":"ๅ…ดๆฃฎไธ€ๅฎถ",
"fansCount":1501681,
"avatarUrl":"https://...",
"verified":"True",
"description":"..."
},
"content":"<p>...</p>",
"titleImage":"https://p11-sign.toutiaoimg.com/...",
"imageUrls":["https://..."],
"source_endpoint":"app"
}

Notes

  • Data is sourced live; Toutiao occasionally rate-limits, so the Actor retries transient blocks with exponential backoff and a browser User-Agent.
  • This Actor is ID/URL-driven โ€” there is no keyword search. Collect the group IDs of the articles you want first (the long number in any Toutiao article link).
  • Group IDs are de-duplicated within a run.

You might also like

๐Ÿ“ฐ Toutiao Video Detail Scraper

ethereal_wool/toutiao-video-detail

Extract Toutiao video detail data โ€” title, author, engagement, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

๐Ÿ“ธ Instagram Comments Scraper

ethereal_wool/instagram-comments-scraper

Extract Instagram comments data โ€” text, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

๐ŸŽต TikTok Comments Scraper

ethereal_wool/tiktok-comments

Extract TikTok comments data โ€” text, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

๐Ÿ“ธ Instagram Profile Scraper

ethereal_wool/instagram-profile-scraper

Extract Instagram profile data โ€” title, author, engagement, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

๐ŸŽต TikTok Profile Scraper

ethereal_wool/tiktok-profile

Extract TikTok profile data โ€” title, author, engagement, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

YouTube Search Scraper | $3/1K Results

ethereal_wool/youtube-search-scraper

Extract YouTube search data โ€” title, author, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

๐ŸŽฅ YouTube Transcript Scraper

ethereal_wool/youtube-transcript-scraper

Extract YouTube transcript data โ€” name, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

๐Ÿ‰ Xigua Video Detail Scraper

ethereal_wool/xigua-video-detail

Extract Xigua video detail data โ€” title, author, engagement, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

Toutiao Scraper ๐Ÿ“ฐ

natanielsantos/toutiao-scraper

Easily scrape articles and videos from Toutiao (ๅคดๆก), a popular Chinese news and information content platform.

๐Ÿ‘ User avatar

Nataniel Santos

9

๐Ÿ’ผ LinkedIn Profile Company Scraper

ethereal_wool/linkedin-profile-company-scraper

Extract LinkedIn profile company data โ€” name, headline, followers, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.