VOOZH about

URL: https://apify.com/joyouscam35875/github-repo-scraper

โ‡ฑ GitHub Repository Scraper โ€” Stars, Forks, Languages & More [DEPRECATED] ยท Apify


๐Ÿ‘ GitHub Repository Scraper โ€” Stars, Forks, Languages & More avatar

GitHub Repository Scraper โ€” Stars, Forks, Languages & More

Deprecated

Pricing

Pay per usage

Go to Apify Store

GitHub Repository Scraper โ€” Stars, Forks, Languages & More

Deprecated

Scrape GitHub repository data using the REST API v3. Get stars, forks, languages, topics, contributors, releases. Search repos by keyword. Perfect for tech stack analysis and competitive intelligence. $0.002/repo.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

๐Ÿ‘ Ken Digital

Ken Digital

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 months ago

Last modified

Categories

Share

Scrape GitHub repository metadata at scale using the official REST API v3. Get structured data on stars, forks, languages, topics, contributors, releases, and more โ€” ready for lead generation, market research, and competitive analysis.

What it does

  • Scrape specific repos โ€” provide a list of owner/repo strings
  • Search GitHub โ€” use any GitHub search syntax to discover repos
  • Enrich with extras โ€” optionally fetch top contributors and latest releases
  • Handle rate limits โ€” automatic back-off with X-RateLimit headers; optional token support for 5,000 req/hr

Output fields

FieldTypeDescription
ownerstringRepository owner login
namestringRepository name
fullNamestringowner/name
urlstringGitHub URL
descriptionstringRepository description
starsintStargazer count
forksintFork count
openIssuesintOpen issue count
languagestringPrimary language
languagesobjectAll languages with byte counts
topicsarrayTopic tags
createdAtstringISO 8601 creation date
updatedAtstringLast update date
pushedAtstringLast push date
licensestringSPDX license identifier
isArchivedboolWhether the repo is archived
isForkboolWhether the repo is a fork
defaultBranchstringDefault branch name
sizeintRepository size in KB
watchersintWatcher/subscriber count
homepagestringHomepage URL
topContributorsarrayTop 30 contributors (opt-in)
latestReleasesarrayLast 5 releases (opt-in)

Input examples

Scrape specific repositories

{
"repos":["apify/crawlee","microsoft/playwright","facebook/react"]
}

Search for Python web scraping tools

{
"searchQuery":"web scraping language:python stars:>100",
"maxRepos":50
}

Full enrichment with auth

{
"repos":["vercel/next.js"],
"searchQuery":"framework language:typescript stars:>5000",
"maxRepos":100,
"includeContributors":true,
"includeReleases":true,
"githubToken":"ghp_xxxxxxxxxxxx"
}

Rate limits

ModeRequests/hourRepos/run (approx)
No token60~20โ€“30 (2โ€“3 API calls per repo)
With token5,000~1,500โ€“2,500

Tip: Create a free personal access token (no scopes needed for public repos) to unlock 5,000 requests/hour.

Pricing

$0.002 per repository scraped (pay per event).

Cost comparison

ReposThis actorGitHub API (your infra)Manual research
10$0.02Free + your time~30 min
100$0.20Free + your time~5 hours
500$1.00Free + your time~2 days
1,000$2.00Free + your time~1 week

You pay for data, not infrastructure. No servers to maintain, no code to write, no rate limits to handle.

Use cases

  • Lead generation โ€” Find companies using specific technologies, contact repo owners
  • Competitive analysis โ€” Track competitor open-source projects, compare stars/forks growth
  • Technology research โ€” Discover trending tools in any language or domain
  • Talent sourcing โ€” Identify top contributors to relevant projects
  • Investment research โ€” Gauge open-source traction for developer tools companies
  • Academic research โ€” Collect repository metadata for software engineering studies
  • Dependency auditing โ€” Assess health (activity, issues, releases) of your dependencies

Technical details

  • Uses GitHub REST API v3 (api.github.com)
  • Automatic rate-limit detection and back-off via X-RateLimit-* headers
  • No browser or proxy needed โ€” pure API calls
  • Async execution with httpx for fast throughput
  • Outputs clean, structured JSON to the Apify dataset

๐Ÿ”— More Scrapers by Ken Digital

ScraperWhat it doesPrice
YouTube Channel ScraperVideos, stats, metadata$0.001/video
France Job ScraperWTTJ + France Travail + Hellowork$0.005/job
France Real Estate Scraper5 sources + DVF price analysis$0.008/listing
Website Content CrawlerHTML โ†’ Markdown for AI/RAG$0.001/page
Google Trends ScraperKeywords, regions, related queries$0.002/keyword
GitHub Repo ScraperStars, forks, languages, topics$0.002/repo
RSS News AggregatorMulti-source feed parsing$0.0005/article
Instagram Profile ScraperFollowers, bio, posts$0.0015/profile
Google Maps ScraperBusinesses, reviews, contacts$0.002/result
TikTok ScraperVideos, likes, shares$0.001/video
Google SERP ScraperSearch results, PAA, snippets$0.003/search
Trustpilot ScraperReviews, ratings, sentiment$0.001/review

๐Ÿ‘‰ View all scrapers

๐Ÿ”— Quick Integration

Python

from apify_client import ApifyClient
client = ApifyClient("YOUR_API_TOKEN")
run = client.actor("joyouscam35875/github-repo-scraper").call(run_input={...})
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
print(item)

Node.js

import{ ApifyClient }from'apify-client';
const client =newApifyClient({token:'YOUR_API_TOKEN'});
const run =await client.actor('joyouscam35875/github-repo-scraper').call({...});
const{ items }=await client.dataset(run.defaultDatasetId).listItems();

No-code: Make / Zapier / n8n

Search for this actor in the Apify connector. No code needed.

You might also like

โญ GitHub Repo Explorer

dash_authority/github-repo-scraper

Search and analyze GitHub repositories โ€” stars, forks, language, topics, contributors. Filter by language and sort by popularity. Developer tools research made easy. $1/1K results.

๐Ÿ‘ User avatar

Dash Authority

2

GitHub Repository Scraper - Stars, Topics, Trending

logiover/github-repository-scraper

Scrape GitHub repos by search query and export stars, topics, forks & license to CSV/JSON. GitHub data export without an API key - trending repos scraper.

GitHub Trending Repositories Scraper

codingfrontend/github-trending-repositories-scraper

Scrape trending repositories from GitHub Trending. Extracts repo name, owner, description, stars, forks, language, contributors, and current period star gains.

๐Ÿ‘ User avatar

codingfrontend

2

Github Trending Scraper

viralanalyzer/github-trending-scraper

Scrape GitHub trending repositories, stars, forks, languages, and developer profiles. Track open source trends daily/weekly/monthly.

25

5.0

(3)

GitHub Repo Scraper

devilscrapes/github-repo-scraper

Fetch full GitHub repository metadata for one or many repos in one call โ€” stars, forks, languages, topics, license, default branch, latest release, contributor count โ€” export to JSON or CSV. A GitHub repo API wrapper; optional token for higher rate limits.