VOOZH about

URL: https://apify.com/dash_authority/github-repo-scraper

⇱ ⭐ GitHub Repo Explorer [DEPRECATED] Β· Apify


πŸ‘ ⭐ GitHub Repo Explorer avatar

⭐ GitHub Repo Explorer

Deprecated

Pricing

$3.00 / 1,000 results

Go to Apify Store

⭐ GitHub Repo Explorer

Deprecated

Search and analyze GitHub repositories β€” stars, forks, language, topics, contributors. Filter by language and sort by popularity. Developer tools research made easy. $1/1K results.

Pricing

$3.00 / 1,000 results

Rating

0.0

(0)

Developer

πŸ‘ Dash Authority

Dash Authority

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

0

Monthly active users

2 months ago

Last modified

Share

GitHub Repository Scraper

Search and extract GitHub repository data β€” stars, forks, language, description, topics, license, and owner info. No API key required. Uses GitHub's public REST API under the hood.

Use Cases

Tech Market Research: Find all repos in a niche. How many Python web frameworks have 1,000+ stars? Who's actively maintained? What licenses dominate?

Developer Prospecting: Identify repo owners and contributors by language and topic. Useful for recruiting or partnership outreach.

Open Source Monitoring: Track stars, forks, and issue counts over time. Spot trending projects before they blow up.

Competitive Analysis: Compare repos in the same space. Which CLI tool has the most traction? Which library gets updated most often?


Input

FieldTypeDescription
querystringSearch keywords (required)
languagestringFilter by language (e.g., "Python", "TypeScript")
topicstringFilter by topic (e.g., "machine-learning")
sortstringSort by: stars, forks, updated, or best-match (default)
minStarsintegerMinimum star count filter
maxResultsintegerMax repos to return (default: 30)
proxyConfigurationobjectApify proxy settings

Search Tips

  • GitHub search supports qualifiers directly in query: "web scraper language:Python stars:>500"
  • Use minStars to filter out noise. Setting it to 100+ gives you repos people actually use.
  • sort: "updated" catches recently active projects β€” good for finding maintained alternatives to abandoned ones.

Output

Each result is a repository profile:

{
"name":"crawl4ai",
"fullName":"unclecode/crawl4ai",
"url":"https://github.com/unclecode/crawl4ai",
"description":"Open-source LLM Friendly Web Crawler & Scraper",
"language":"Python",
"stars":64028,
"forks":6562,
"watchers":64028,
"openIssues":69,
"topics":["web-scraping","llm","crawler"],
"license":"Apache-2.0",
"defaultBranch":"main",
"createdAt":"2024-05-09T09:48:50Z",
"updatedAt":"2026-04-15T12:18:34Z",
"pushedAt":"2026-04-11T09:27:40Z",
"size":150467,
"archived":false,
"fork":false,
"owner":{
"login":"unclecode",
"type":"User",
"avatarUrl":"https://avatars.githubusercontent.com/u/12494079?v=4"
}
}

Key Fields

  • fullName β€” owner/repo format, unique identifier
  • stars, forks, watchers β€” traction metrics
  • openIssues β€” active issue count (high can mean active development or poor maintenance)
  • topics β€” GitHub topic tags for categorization
  • license β€” SPDX license identifier
  • pushedAt β€” last commit date β€” the best signal for "is this maintained?"
  • size β€” repo size in KB
  • archived / fork β€” filter out dead or derivative repos

Integrations & API

  • Export formats: JSON, CSV, Excel, XML, HTML
  • Scheduling: Track star growth weekly to spot trends.
  • API: Apify Python or Node.js client for automated analysis pipelines.
  • Combine with: Other scrapers β€” scrape a repo's README, then its contributors, then its dependencies.

FAQ

Does this use GitHub's official API? Yes β€” GitHub's public REST API. No authentication token needed for public repos. Rate limits apply (60 requests/hour unauthenticated).

Can I scrape private repos? No. This scraper only works with public repositories.

How many results can I get? GitHub's search API caps at 1,000 results per query. For larger datasets, break your search into smaller chunks by language or topic.

What's the difference between stars and watchers? They're identical for public repos since GitHub unified them. Both reflect the star count.

Is pushedAt reliable for checking maintenance? It's the best quick signal, but a repo with a recent pushedAt might just have CI config updates. Check the commit history for real activity.


Support

Open an issue in the Issues tab for bugs or feature requests.

You might also like

Crates.io Rust Crates Scraper

parseforge/crates-io-rust-scraper

Search crates.io, the Rust package registry. Returns crate name, version, description, repository, homepage, documentation, license, downloads, recent downloads, last updated, categories, keywords, owner team, dependencies, and yank status. Search by keyword or look up specific crates by name.

Hugging Face Papers Scraper

parseforge/huggingface-papers-scraper

Scrape AI and machine learning research papers from Hugging Face Papers. Get titles, abstracts, authors with affiliations, upvotes, publication dates, ArXiv IDs, and community discussion counts. Search by keyword or browse daily papers.

GitLab Public Projects Scraper | Stars, Forks, Topics

parseforge/gitlab-public-projects-scraper

Harvest records from multiple Gitlab sources in a single run and get a unified, normalized result set. Pull names, identifiers, dates, descriptions, status flags and source links per record. Perfect for research, lead generation and intelligence pipelines.

GitHub repositories Scraper - Low-costπŸ’²πŸ”₯πŸ“¦πŸ™

delectable_incubator/github-repositories-scraper-low-cost

Scrape GitHub repositories πŸ“¦πŸ™ with a powerful developer data scraper. Extract repository names, descriptions, programming languages, stars, topics, forks, and repository URLs from any GitHub profile. Ideal for open-source analysis, developer scouting, technology research and market insights πŸ“ŠπŸš€

OSV & GitHub Security Scraper

taroyamada/oss-vulnerability-monitor

Scrape GitHub Security Advisories and OSV databases to extract CVSS v3.1 base scores, fixed version tags, and patching details for your tech stack.

GitHub Release & Tag Scraper

taroyamada/github-release-monitor

Automate your dependency tracking by monitoring GitHub repositories for new releases, extracting version bumps and changelog summaries.

Airbnb Images Scraper

scraper-engine/airbnb-images-scraper

The Airbnb Images Scraper collects high-quality images from Airbnb listings by URL or search results. It helps researchers, analysts, and marketers access property photos for design inspiration, content creation, and competitive insights, streamlining visual data gathering at scale.

πŸ‘ User avatar

Scraper Engine

13

Bing Search Results Scraper

parseforge/bing-search-scraper

Pull Bing organic search results with rank, title, URL, snippet, displayed URL, deep links, and date. Filter by country, market, language, safe-search, and freshness. Export clean JSON, CSV, or Excel for SEO research, competitor monitoring, content benchmarking, and SERP intelligence.

πŸ” Subdomain Finder & CT Log Scraper

taroyamada/subdomain-finder

Map website architectures by extracting subdomains from public Certificate Transparency logs to find unlinked staging sites.

Tech Events & CFP Calendar Scraper

taroyamada/tech-events-intelligence

Extract upcoming developer conferences, local meetups, and open Call for Papers (CFP) deadlines to build a comprehensive speaking schedule for your DevRel team.