VOOZH about

URL: https://apify.com/njoylab/git-commit-authors-emails

⇱ Git Commit Authors & Emails Β· Apify


Pricing

from $0.50 / 1,000 results

Go to Apify Store

Git Commit Authors & Emails

Extract commit emails from one or more Git repositories and aggregate commit counts per email. Process multiple repos in one run, detect no-reply addresses, map author aliases, and publish both dataset rows and a structured OUTPUT record for fast analysis, exports, and automation workflows at scale

Pricing

from $0.50 / 1,000 results

Rating

0.0

(0)

Developer

πŸ‘ njoylab

njoylab

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

1

Monthly active users

2 months ago

Last modified

Share

Git Commit Emails Extractor (Apify Actor)

Extract commit emails from one or more Git repositories and aggregate commit counts per email.

What this actor does

  • Accepts multiple repositories as input.
  • Supports:
    • GitHub URL (https://github.com/owner/repo)
    • Git URL (git@github.com:owner/repo.git)
    • GitHub shorthand (owner/repo)
  • Reads commit history from HEAD.
  • Outputs:
    • Dataset rows with commit-email aggregates (one row per email, no type field)
    • Repository summaries in key-value store under OUTPUT

Input

{
"repositories":[
"https://github.com/<user>/<repository>"
],
"excludeNoReply":true,
"normalizeEmails":true,
"includeAuthorAliases":true
}

Input fields

  • repositories (required): list of repositories to process.
  • excludeNoReply (optional, default false): exclude *@users.noreply.github.com and *@noreply.*.
  • normalizeEmails (optional, default true): lowercase emails before grouping.
  • includeAuthorAliases (optional, default true): include all author names seen for each email.
  • branch (optional): branch name used when cloning remote repositories. If the branch does not exist in a repository, the actor automatically falls back to that repository's default branch.

Output examples

Dataset email item

{
"repositoryInput":"apify/crawlee",
"repositoryName":"apify/crawlee",
"resolvedRepository":"https://github.com/apify/crawlee",
"email":"[EMAIL_ADDRESS]",
"commitCount":123,
"isNoReply":false,
"authorName":"<author name>",
"authorAliases":[
"<author name>"
]
}

OUTPUT (key-value store) summary

{
"startedAt":"2026-02-12T00:00:00.000Z",
"finishedAt":"2026-02-12T00:00:15.000Z",
"settings":{
"excludeNoReply":true,
"normalizeEmails":true,
"includeAuthorAliases":true,
"branch":null
},
"repositorySummaries":[
{
"repositoryInput":"https://github.com/apify/crawlee",
"repositoryName":"apify/crawlee",
"resolvedRepository":"https://github.com/apify/crawlee.git",
"totalCommitsInHead":9339,
"totalCommitRowsRead":9339,
"totalCommitsAfterFilters":9339,
"uniqueEmails":450,
"durationMs":1450,
"success":true
}
]
}

Notes

  • If contributors use masked emails (for example GitHub noreply), the actor cannot infer private real emails.
  • Large repositories can produce large datasets.

Disclaimer

This actor extracts commit metadata that may include personal email addresses. Its purpose is to raise awareness about how data such as email addresses can be exposed in Git repositories. Use the output only for legitimate and compliant purposes, and always follow applicable privacy laws, platform terms, and anti-spam rules. You are responsible for how extracted data is stored, shared, and used.

You might also like

Git Commit Parser Spider

getdataforme/git-commit-parser-spider

Powerful Apify Actor for scraping GitHub commit data. Extracts detailed info on authors, messages, timestamps, and OIDs for project analysis. Features robust batch processing, JSON output, error handling, and seamless integration for developers and researchers.

GitHub Push Event Scraper

krab/github-push-event-scraper

Extracts GitHub push events with your specified date range. Scrapes commit metadata, including author info (name, email), commit messages, and URLs for users, commits, and repos. Ideal for monitoring repository activity or tracking user contributions.

GitHub Scraper - Repos, Stars, Issues & Profiles

cryptosignals/github-scraper

Scrape GitHub repositories, profiles, and issues β€” extract stars, forks, contributors, README, commit history, and topics. CSV/JSON output. No login.

27

GitHub Repo Stats. Stars, Forks, Languages, Contributors

seemuapps/github-repo-stats-scraper

Get stars, forks, issues, language breakdown, license, last commit, and contributor counts for any GitHub repository. Bulk-process a list of repos in one run.

All In One Social Media Email Scraper

scraper-engine/all-in-one-social-media-email-scraper

All In One Social Media Email Scraper extracts publicly available email addresses from multiple social media platforms in one run. Build targeted contact lists across networks for sales, marketing, and outreach campaigns.

πŸ‘ User avatar

Scraper Engine

2

Related articles

Best email extractors for sales ops
Read more
8 best email scrapers in 2026
Read more