VOOZH about

URL: https://www.analyticsvidhya.com/blog/2018/03/top-5-github-repositories-february-2018/

⇱ Top Data Science, Machine Learning GitHub Repositories Feb 2018


India's Most Futuristic AI Conference Is Back – Bigger, Sharper, Bolder

  • d
  • :
  • h
  • :
  • m
  • :
  • s

Reading list

Top 5 Data Science & Machine Learning Repositories on GitHub in Feb 2018

Pranav Dar Last Updated : 05 Jun, 2020
3 min read

Introduction

Continuing our theme of collecting and sharing the top machine learning GitHub repositories every month, the February edition is fresh off the shelves ready for you!

GitHub repositories are one of the easiest and best things for all the people working in data science to keep ourselves updated with the latest developments and projects. It’s also an awesome collaboration tool where we can connect with other like minded data scientists on various projects.👁 Image

Without any further ado, let’s dive into this month’s list.

This is part of a series from Analytics Vidhya that will run every month. You can check out the top 5 repositories that we picked out in January here.

FastPhotoStyle

👁 Image

FastPhotoStyle is a python library developed by NVIDIA. The model takes a content photo and a style photo as inputs. It then transfers the style of the style photo to the content photo.

The developers have cited two examples to show how the algorithm works. The first is a very simple iteration – you download a content and a style image, re-size them, and then simply run the photorealistic image stylization code. In the second example, semantic label maps are used to create the stylized image.

You can read more about this library on Analytics Vidhya’s blog here.

Twitter Scraper

👁 Image

If you’ve ever scraped tweets from Twitter, you have experience working with it’s API. It has it’s limitations and is not easy to work with. This python library was created with that in mind – it has no API rate limits (does not require authentication), no limitations, and is ultra quick. You can use this library to scrape the tweets of any user trivially

The developer has mentioned that it can be used for making Markov Chains. Do note that it works only with python version 3.6+.

Handwriting Synthesis

👁 Image

This is an implementation of the handwriting synthesis experiments presented in the ‘Generating Sequences with Recurrent Neural Networks’ paper by Alex Graves. As the name of the repository suggests, you can generate different styles of handwriting. The model is based on priming and biasing. Priming controls the style of the samples and biasing controls the neatness of the samples.

The samples presented by the author on the GitHub page are truly fascinating in their diversity. He is looking for contributors to enhance the repository so if you’re interested, get in touch with him!

ENAS PyTorch

👁 Image

This is a PyTorch implementation of “Efficient Neural Architecture Search (ENAS) via Parameters Sharing”. What do ENAS do? They reduce the computational requirement, that is, the GPU Hours of the Neural Architecture Search by an incredible 1000 times. They do this via parameter sharing between models that are subgraphs within a large computational graph.

The process of how to use it have been neatly explained on the GitHub page. The prerequisites for implementing this library are:

  • Python 3.6+
  • PyTorch
  • tqdm, imageio, graphviz, tqdm, tensorboardX

Sign Language

👁 Image

                                                     Source: Wikipedia

This is a relatively straightforward, yet utterly fascinating, use of machine learning. Using a convolutional neural network in python, the developer has built a model that can recognize the hand gestures and convert it into text on the machine.

The author of this repository built the CNN model using both TensorFlow and Keras. He has specified, in detail, how he went about creating this project and each step he followed. It’s definitely worth checking out and trying once on your own machine.

Did you find these helpful? Or are you aware of any other GitHub repositories the AV community should know about? Let us know in the comments section below!

Senior Editor at Analytics Vidhya.Data visualization practitioner who loves reading and delving deeper into the data science and machine learning arts. Always looking for new ways to improve processes using ML and AI.

Login to continue reading and enjoy expert-curated content.

Free Courses

AI Interview Questions & Answers Masterclass

Master AI interview questions with expert answers.

A Complete MLops Journey

Start your MLOps Journey! Learn MLOPs fundamentals with free certification.

Building Smarter LLMs with Mamba and State Space Model

Master Mamba's state space model for LLMs: Efficient, scalable training

Model Deployment using FastAPI; Prepare, Train, and Test FastAPI Application

Deploy a fastapi machine learning model with XGBoost and Docker APIs.

Build Data Pipelines with Apache Airflow

Learn ETL pipeline building and workflow orchestration with Airflow.

Responses From Readers

raymond doctor

Hi, I need a simple prediction tool using CNN /Tensorflow+Back propagation which will allow me to train data . My data at present is Sindhi written in Arabic script and mapped to Devanagari script. A small sample شرِڙاٽُ=शरिड़ाटु شرڌانجليِ=श्रद्धांजली شرڙاٽُ=शरड़ाटु شرڻارٿيِ=शरणार्थी شسترشالا=शस्त्रशाला شسترهيڻُ=शस्त्रहीणु شسترُ=शस्त्रु ششماهيِ=शशमाही ششُ=शिषु I have around 300,000 samples At present I am writing rules to handle this, but am sure that a tool in Python can solve this. Any pointers to such a tool will be most welcome. Thanks in advance.

123 2
Faizan Shaikh

Hi Raymond, You would have to build a machine translation model from scratch for this data, as I don't think you would find pretrained models for a similar problem. You can refer this article for pointers

123 456
VB

I came across a white paper once which have been implemented on arabic text recognition. You have to google that and I think it claims to be powerful. If you are lucky, you might find github repo for that implementation.

123 456

Flagship Programs

GenAI Pinnacle Program| GenAI Pinnacle Plus Program| AI/ML BlackBelt Program| Agentic AI Pioneer Program

Free Courses

Generative AI| DeepSeek| OpenAI Agent SDK| LLM Applications using Prompt Engineering| DeepSeek from Scratch| Stability.AI| SSM & MAMBA| RAG Systems using LlamaIndex| Building LLMs for Code| Python| Microsoft Excel| Machine Learning| Deep Learning| Mastering Multimodal RAG| Introduction to Transformer Model| Bagging & Boosting| Loan Prediction| Time Series Forecasting| Tableau| Business Analytics| Vibe Coding in Windsurf| Model Deployment using FastAPI| Building Data Analyst AI Agent| Getting started with OpenAI o3-mini| Introduction to Transformers and Attention Mechanisms

Popular Categories

AI Agents| Generative AI| Prompt Engineering| Generative AI Application| News| Technical Guides| AI Tools| Interview Preparation| Research Papers| Success Stories| Quiz| Use Cases| Listicles

Generative AI Tools and Techniques

GANs| VAEs| Transformers| StyleGAN| Pix2Pix| Autoencoders| GPT| BERT| Word2Vec| LSTM| Attention Mechanisms| Diffusion Models| LLMs| SLMs| Encoder Decoder Models| Prompt Engineering| LangChain| LlamaIndex| RAG| Fine-tuning| LangChain AI Agent| Multimodal Models| RNNs| DCGAN| ProGAN| Text-to-Image Models| DDPM| Document Question Answering| Imagen| T5 (Text-to-Text Transfer Transformer)| Seq2seq Models| WaveNet| Attention Is All You Need (Transformer Architecture) | WindSurf| Cursor

Popular GenAI Models

Llama 4| Llama 3.1| GPT 4.5| GPT 4.1| GPT 4o| o3-mini| Sora| DeepSeek R1| DeepSeek V3| Janus Pro| Veo 2| Gemini 2.5 Pro| Gemini 2.0| Gemma 3| Claude Sonnet 3.7| Claude 3.5 Sonnet| Phi 4| Phi 3.5| Mistral Small 3.1| Mistral NeMo| Mistral-7b| Bedrock| Vertex AI| Qwen QwQ 32B| Qwen 2| Qwen 2.5 VL| Qwen Chat| Grok 3

AI Development Frameworks

n8n| LangChain| Agent SDK| A2A by Google| SmolAgents| LangGraph| CrewAI| Agno| LangFlow| AutoGen| LlamaIndex| Swarm| AutoGPT

Data Science Tools and Techniques

Python| R| SQL| Jupyter Notebooks| TensorFlow| Scikit-learn| PyTorch| Tableau| Apache Spark| Matplotlib| Seaborn| Pandas| Hadoop| Docker| Git| Keras| Apache Kafka| AWS| NLP| Random Forest| Computer Vision| Data Visualization| Data Exploration| Big Data| Common Machine Learning Algorithms| Machine Learning| Google Data Science Agent
👁 Av Logo White

Continue your learning for FREE

Forgot your password?
👁 Av Logo White

Enter OTP sent to

Edit

Wrong OTP.

Enter the OTP

Resend OTP

Resend OTP in 45s

👁 Popup Banner
👁 AI Popup Banner