VOOZH about

URL: https://apify.com/josef.prochazka/commit-historian-agent

⇱ Commit Historian Agent Β· Apify


Pricing

Pay per event

Go to Apify Store

Commit Historian Agent

Simple tool to help analyze Github repository commits. It checkouts the repository and get all relevant commit messages. It uses OpenAI to answer questions asked by the user. This is done through PydanticAI framework.

Pricing

Pay per event

Rating

0.0

(0)

Developer

πŸ‘ Josef ProchΓ‘zka

Josef ProchΓ‘zka

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a year ago

Last modified

Share

Commit Historian Agent

Simple tool to help analyze Github repository commits. It checkouts the repository and gets all relevant commit messages. It uses OpenAI to answer questions asked by the user. This is done through PydanticAI framework.

How to run it

You can pick this actor from Apify store and run it on the Apify platform.

Enter repository name and your question and start the Actor. Optionally you can choose a specific branch if your question is not related to the default branch of the repository.

If you do not input your own OpenAI API key then the actor will use our own API key, which will cause additional costs for running the actor. You can pass your own OpenAI API key to significantly reduce the actor run costs.

Example

Inputs:

prompt: Show several most complicated changes done last month.

repository: apify/crawlee-python

Result:

Here are some of the most complicated changes from last month in the apify/crawlee-python repository:

  1. Status Code Handling Update: This refactor involved removing parameters and methods related to HTTP error status codes in HTTP clients, moving logic to a different class, and updating tests to ensure proper handling of session blocking status codes and error codes that require retries or retires. This was a significant change due to the impact on multiple components such as Session, SessionPool, PlaywrightCrawler, and HttpCrawler details here.

  2. Session Cookie Management: The approach to handling cookies in a session was changed from using a plain dictionary to a more sophisticated SessionCookies class incorporating CookieJar. This supports basic cookie parameters and multiple domains, requiring extensive updates to tests and support for multi-domain scenarios details here.

  3. Fingerprint Integration: Integration of the browserforge package to enable fingerprint and header generation in PlaywrightCrawler was implemented. This added significant functionality to enhance the crawling process by using generated fingerprints details here.

These complex changes involved substantial modifications to multiple parts of the codebase, including handling complex data structures, refactoring logic spread across different modules, and careful testing to ensure stability.

How does it work

This actor defines one main AI agent that is responsible for processing the prompt and return desired output. It uses one tool that gets the commit summaries for the main agent.

The tool for getting the commit summaries is responsible for suggesting the relevant time scope of the prompt, getting the raw commit messages in the relevant time scope and prefilter the commits based on whether they seem relevant for the main prompt or not. It is using two different AI agents through what is described in PydanticAI documentation as programatic agent hand-off:

  • Agent responsible for suggesting time scope of the prompt.
  • Agent responsible for deciding whether individual commit is relevant for the prompt.

You might also like

GitHub Push Event Scraper

krab/github-push-event-scraper

Extracts GitHub push events with your specified date range. Scrapes commit metadata, including author info (name, email), commit messages, and URLs for users, commits, and repos. Ideal for monitoring repository activity or tracking user contributions.

Github Repository Analyzer

actually_good_at_this/apify-github-repository-analyzer

GitHub Repository Analyzer extracts comprehensive repository metrics using the official GitHub API: stars, forks, watchers, contributors, commit activity, and issues/PRs.

Git Commit Parser Spider

getdataforme/git-commit-parser-spider

Powerful Apify Actor for scraping GitHub commit data. Extracts detailed info on authors, messages, timestamps, and OIDs for project analysis. Features robust batch processing, JSON output, error handling, and seamless integration for developers and researchers.

Github emails from commits

saswave/github-emails-from-commits

From a Github repository url, extract all emails from commits and their occurence number. Allow you to generate a list of emails from targeted github repositories

Git Commit Authors & Emails

njoylab/git-commit-authors-emails

Extract commit emails from one or more Git repositories and aggregate commit counts per email. Process multiple repos in one run, detect no-reply addresses, map author aliases, and publish both dataset rows and a structured OUTPUT record for fast analysis, exports, and automation workflows at scale

GitHub Repositories Scraper - CheapπŸ“¦πŸ™πŸ”

scrapestorm/github-repositories-scraper-cheap

πŸ” Easily collect repositories from GitHub Provide a GitHub profile URL or username and extract detailed repository information such as repository name, description, language, stars, topics & repository link πŸ“¦πŸ™ Perfect for open-source analysis, developer scouting & market intelligence πŸ“ŠπŸ”₯

2

Related articles

Integrate Apify into your project using GitHub Copilot and MCP
Read more
How to use LangChain with OpenAI, Pinecone, and Apify
Read more
How to build and monetize an AI agent on Apify
Read more