VOOZH about

URL: https://www.analyticsvidhya.com/blog/2018/02/top-5-github-repositories-january-2018/

⇱ Top 5 Data Science, Machine Learning GitHub Repositories Jan 2018


India's Most Futuristic AI Conference Is Back – Bigger, Sharper, Bolder

  • d
  • :
  • h
  • :
  • m
  • :
  • s

Reading list

Top 5 Data Science & Machine Learning Repositories on GitHub in Jan 2018

Pranav Dar Last Updated : 05 Jun, 2020
4 min read

Introduction

Breakthroughs in data science and machine learning are happening at a break-neck pace. If you are working in this field, it’s extremely important to keep yourself updated with what’s new.

Following GitHub repositories is one such way to do so. You can see the latest developments, interesting projects and their applications. I can not tell how much learning can happen through this.

👁 Image

You can download the code and run it on your own machine or simply just keep it as a reference point for your project. Whatever the application, GitHub communities are invaluable resources.

In this post, we look at 5 GitHub repositories created in January 2018 that you must follow. This is part of a series from Analytics Vidhya that will run every month.

Detectron

👁 Image

Detectron is a software system developed by Facebook’s AI Research team (FAIR) that “implements state-of the art object detection algorithms”. It is written in Python and leverages the Caffee2 deep learning framework underneath.

Along with the Python code, FAIR has also released performance baselines for over 70 pre-trained models. Once the model(s) is trained, it can be deployed on the cloud and even on mobile devices.

Detectron has been covered by us here.

DeepReinforcementLearning

👁 Image

This is a replica of the AlphaZero methodology developed in Python. The author has written the code to train an algorithm to play the Connect4 game. It’s not quite as complex as the famed ‘Go’ game, but there are 4,531,985,219,092 possible game positions so it’s perfect for this situation.

The main advantages of this repository are two-fold, namely:

  1. How you can build a replica of the AlphaZero methodology to play the game Connect4
  2. How you can adapt the code to plug in other games

Run it and you will see the beauty in AlphaGo!

Caire

Caire is a content-aware image resizing library. Currently, most applications either give you the option of cropping an image or changing it’s aspect ratio. This often leads to either the main parts being left out or the image becoming blurred. This is where Caire comes into play.

It has support for both shrinking and enlarging any image, resizing it horizontally or vertically and does not require any third party library. It uses edge detection to generate an energy map of the image. Based on that, it finds seams in the image and uses it’s algorithm accordingly. The process of how this works has been illustrated in the three images below:

👁 Image

It is based on the Seam Carving for Content-Aware Image Resizing paper. This has been covered by Analytics Vidhya here.

Minigo

Covered by Analytics Vidhya here, this is an open-source Python implementation inspired by DeepMind’s AlphaGo. It’s a Neural Network based AI, developed using Tensorflow.

👁 Image

                                                              Source: WIRED

The goals of this project, as described by the authors, are listed below:

  1. Provide a clear set of learning examples using Tensorflow, Kubernetes, and Google Cloud Platform for establishing Reinforcement Learning pipelines on various hardware accelerators.
  2. Reproduce the methods of the original DeepMind AlphaGo papers as faithfully as possible, through an open-source implementation and open-source pipeline tools.
  3. Provide our data, results, and discoveries in the open to benefit the Go, machine learning, and Kubernetes communities.

You can access the entire Python code on this GitHub repository.

Alpha Pose

👁 Image

Alpha Pose is a remarkably accurate tool to estimate the poses of multiple people (you can see this in their GitHub’s GIFs). It’s the first open-source systems that has achieved 70+ mAP on the COCO dataset 80+ mAP on the MPII dataset. Additionally, the authors have also developed ‘Pose Flow’, which is an online pose tracker.

And here are two bonus repositories for you!

VisualDL

👁 Image

VisualDL is a tool that can visualize the entire deep learning process for us. It’s an incredibly powerful visualization tool that helps us design deep learning jobs. VisualDL was built to support Python. Just by adding a few lines of Python code and inserting them into our neural network model, we can generate plenty of visualizations to understand the framework. VisualDL has also been written in low level C++.

Currently, VisualDL provides four components (more will be added soon):

  • graph
  • scalar
  • image
  • histogram

You can read more about these components, and how VisualDL works, in our post here.

TensorFlow Project Template

There are a ton of things to do when starting a TensorFlow project. The underlying idea behind this repository is to wrap up thonse things into a simple and well-defined structure. The TensorFlow Project Template combines simplicity, best practices for creating and maintaining folder structure and excellent OOP design.

Do you know of any other repositories created last month that we should be aware of? Feel free to let us know in the comments below.

Senior Editor at Analytics Vidhya.Data visualization practitioner who loves reading and delving deeper into the data science and machine learning arts. Always looking for new ways to improve processes using ML and AI.

Login to continue reading and enjoy expert-curated content.

Free Courses

Exploratory Data Analysis with Python & GenAI

Learn EDA with Python: Transform data into insights using PandasAI & more.

Data Science Course

Build a powerful 2026-ready data science resume using AI tools.

Understanding the working of Neural Networks

Learn the neural network basics, concepts, layers, and activation functions.

No Code Predictive Analytics with Orange

No-code AI course for business pros with real-world ML use cases.

Adaptive Email Agents with DSPy

Build adaptive email agents with DSPy using context and smart learning.

Responses From Readers

raymond doctor

Hi, Am looking for a simple tools for prediction using AI. I have a large trained data with the following format A=B in which A is the source and B is the target language. Am not very good in coding. I work with C, Perl. Awk. Sed. . Is there a simple tool in Windows by which data can be trained using 300,000 samples and then used to predict from a test data? Many thanks for the infor

Top 5 Data Science Machine Learning Repositories on GitHub in Jan 2018

[…] This is part of a series from Analytics Vidhya that will run every month. Detectron is a software system developed by Facebook’s AI Research team (FAIR) that “implements state-of the art object detection algorithms”. Read more from analyticsvidhya.com… […]

Saad Raja

I am fond of Github such a great platform to discover amazing repositories. Today we all need to connect with latest technology advancements and Github is the main boost for us to discover what we are looking for.

Flagship Programs

GenAI Pinnacle Program| GenAI Pinnacle Plus Program| AI/ML BlackBelt Program| Agentic AI Pioneer Program

Free Courses

Generative AI| DeepSeek| OpenAI Agent SDK| LLM Applications using Prompt Engineering| DeepSeek from Scratch| Stability.AI| SSM & MAMBA| RAG Systems using LlamaIndex| Building LLMs for Code| Python| Microsoft Excel| Machine Learning| Deep Learning| Mastering Multimodal RAG| Introduction to Transformer Model| Bagging & Boosting| Loan Prediction| Time Series Forecasting| Tableau| Business Analytics| Vibe Coding in Windsurf| Model Deployment using FastAPI| Building Data Analyst AI Agent| Getting started with OpenAI o3-mini| Introduction to Transformers and Attention Mechanisms

Popular Categories

AI Agents| Generative AI| Prompt Engineering| Generative AI Application| News| Technical Guides| AI Tools| Interview Preparation| Research Papers| Success Stories| Quiz| Use Cases| Listicles

Generative AI Tools and Techniques

GANs| VAEs| Transformers| StyleGAN| Pix2Pix| Autoencoders| GPT| BERT| Word2Vec| LSTM| Attention Mechanisms| Diffusion Models| LLMs| SLMs| Encoder Decoder Models| Prompt Engineering| LangChain| LlamaIndex| RAG| Fine-tuning| LangChain AI Agent| Multimodal Models| RNNs| DCGAN| ProGAN| Text-to-Image Models| DDPM| Document Question Answering| Imagen| T5 (Text-to-Text Transfer Transformer)| Seq2seq Models| WaveNet| Attention Is All You Need (Transformer Architecture) | WindSurf| Cursor

Popular GenAI Models

Llama 4| Llama 3.1| GPT 4.5| GPT 4.1| GPT 4o| o3-mini| Sora| DeepSeek R1| DeepSeek V3| Janus Pro| Veo 2| Gemini 2.5 Pro| Gemini 2.0| Gemma 3| Claude Sonnet 3.7| Claude 3.5 Sonnet| Phi 4| Phi 3.5| Mistral Small 3.1| Mistral NeMo| Mistral-7b| Bedrock| Vertex AI| Qwen QwQ 32B| Qwen 2| Qwen 2.5 VL| Qwen Chat| Grok 3

AI Development Frameworks

n8n| LangChain| Agent SDK| A2A by Google| SmolAgents| LangGraph| CrewAI| Agno| LangFlow| AutoGen| LlamaIndex| Swarm| AutoGPT

Data Science Tools and Techniques

Python| R| SQL| Jupyter Notebooks| TensorFlow| Scikit-learn| PyTorch| Tableau| Apache Spark| Matplotlib| Seaborn| Pandas| Hadoop| Docker| Git| Keras| Apache Kafka| AWS| NLP| Random Forest| Computer Vision| Data Visualization| Data Exploration| Big Data| Common Machine Learning Algorithms| Machine Learning| Google Data Science Agent
👁 Av Logo White

Continue your learning for FREE

Forgot your password?
👁 Av Logo White

Enter OTP sent to

Edit

Wrong OTP.

Enter the OTP

Resend OTP

Resend OTP in 45s

👁 Popup Banner
👁 AI Popup Banner