VOOZH about

URL: https://apify.com/moving_beacon-owner1/github----repository-search-data-scraper

⇱ GitHub β€” Repository Search & Data Scraper Β· Apify


πŸ‘ GitHub β€” Repository Search & Data Scraper avatar

GitHub β€” Repository Search & Data Scraper

Pricing

from $50.00 / 1,000 results

Go to Apify Store

GitHub β€” Repository Search & Data Scraper

Scrapes GitHub repository data via the GitHub REST API v3. Supports keyword search, advanced qualifiers, user/org listing, trending repos, topic browsing, and direct repo lookup. Works without an API key (rate-limited) or with a personal access token for higher throughput.

Pricing

from $50.00 / 1,000 results

Rating

0.0

(0)

Developer

πŸ‘ Jamshaid Arif

Jamshaid Arif

Maintained by Community

Actor stats

0

Bookmarked

8

Total users

3

Monthly active users

25 days ago

Last modified

Share

πŸ™ GitHub β€” Repository Search & Data Scraper β€” Apify Actor

Scrapes GitHub repository data via the GitHub REST API v3. Supports keyword search, advanced qualifiers, user/org listing, trending repos, topic browsing, and direct repo lookup. Works without an API key (rate-limited) or with a personal access token for higher throughput.

Features

  • 8 search modes β€” keyword search, user repos, org repos, topic browse, direct lookup, and 3 trending periods.
  • Advanced qualifiers β€” language, stars, forks, license, creation date, push date, archived/fork filters.
  • 4 optional extras β€” language breakdown, top contributors, releases with download counts, topic tags.
  • Smart rate limiting β€” automatic backoff on 403, separate delays for search vs. other endpoints, token-aware pacing.
  • 4 output formats β€” enriched (with computed metrics), raw, minimal, CSV-friendly.
  • Up to 1,000 search results β€” auto-paginates across 34 pages.

Search Modes

ModeWhat it does
searchKeyword search with qualifiers (up to 1,000 results)
user_reposAll public repos from a specific GitHub user
org_reposAll public repos from an organization
topic_browseRepos tagged with a specific topic
repos_by_nameFetch specific repos by owner/name
trending_todayRepos created today, sorted by stars
trending_weekRepos created this week, sorted by stars
trending_monthRepos created this month, sorted by stars

Input Examples

Search Python Web Scraping Tools

{
"mode":"search",
"searchQuery":"web scraping",
"language":"Python",
"minStars":100,
"sortBy":"stars",
"maxPages":5,
"fetchExtras":["languages","contributors"],
"outputFormat":"enriched"
}

All Google Org Repos

{
"mode":"org_repos",
"orgName":"google",
"maxPages":10,
"outputFormat":"enriched"
}

Trending Repos This Week

{
"mode":"trending_week",
"minStars":50,
"maxPages":5,
"outputFormat":"enriched"
}

Specific Repos with Releases

{
"mode":"repos_by_name",
"repoFullNames":"facebook/react, vuejs/vue, angular/angular, sveltejs/svelte",
"fetchExtras":["languages","releases","contributors"],
"outputFormat":"enriched"
}

MIT-Licensed TypeScript Projects

{
"mode":"search",
"searchQuery":"dashboard",
"language":"TypeScript",
"license":"mit",
"minStars":500,
"pushedAfter":"2024-06-01",
"sortBy":"stars"
}

Enriched Output Fields

FieldExample
full_namescrapy/scrapy
ownerscrapy
owner_typeOrganization
descriptionScrapy, a fast high-level…
urlhttps://github.com/scrapy/scrapy
homepagehttps://scrapy.org
languagePython
stars53,421
forks13,812
watchers53,421
open_issues487
size_kb23456
licenseBSD-3-Clause
topicspython, web-scraping, crawler
default_branchmaster
is_forkfalse
is_archivedfalse
has_wikitrue
has_pagestrue
created_at2010-02-22T02:23:08Z
pushed_at2025-04-04T18:30:00Z
age_days5520
stars_per_day9.68
fork_to_star_ratio0.259

Extra Fields

ExtraFields Added
Languageslanguages_breakdown (lang β†’ bytes + %), languages_flat, language_count
Contributorstop_contributors (login, commits, avatar Γ— 10), top_contributor
Releasesreleases (tag, date, downloads Γ— 5), latest_release, total_release_downloads
Topicsall_topics (array), topic_count

Rate Limits & Token

Without TokenWith Token
Search API10 requests/min30 requests/min
Other endpoints60 requests/hr5,000 requests/hr
Search results cap1,0001,000

To create a token: github.com β†’ Settings β†’ Developer Settings β†’ Personal Access Tokens β†’ Fine-grained tokens. No special permissions are needed for public repos.

The actor automatically handles rate limiting: it reads x-ratelimit-remaining headers and waits for resets when exhausted.

You might also like

GitHub Repository & Trending Scraper

rupom888/github-repository-scraper

Search GitHub repos, scrape user profiles with repos, get repo details with contributors, or track GitHub trending. Uses public API - optional token for higher rate limits.

GitHub Repository Scraper - Stars, Topics, Trending

logiover/github-repository-scraper

Scrape GitHub repos by search query and export stars, topics, forks & license to CSV/JSON. GitHub data export without an API key - trending repos scraper.

GitHub Scraper

automation-lab/github-scraper

Extract data from GitHub β€” repository details, developer profiles, trending repos, and search results. Stars, forks, languages, topics, and more. No API key needed.

πŸ‘ User avatar

Stas Persiianenko

37

GitHub Repo Scraper

devilscrapes/github-repo-scraper

Fetch full GitHub repository metadata for one or many repos in one call β€” stars, forks, languages, topics, license, default branch, latest release, contributor count β€” export to JSON or CSV. A GitHub repo API wrapper; optional token for higher rate limits.