VOOZH about

URL: https://www.analyticsvidhya.com/blog/2018/05/top-5-github-reddit-data-science-machine-learning-april-2018/

โ‡ฑ Best GitHub Repositories | Reddit Discussion April 2018


India's Most Futuristic AI Conference Is Back โ€“ Bigger, Sharper, Bolder

  • d
  • :
  • h
  • :
  • m
  • :
  • s

Reading list

Top 5 GitHub Repositories and Reddit Discussions for Data Science & Machine Learning (April 2018)

Pranav Dar Last Updated : 31 May, 2020
5 min read

Introduction

GitHub and Reddit are two of the most popular platforms when it comes to data science and machine learning. The former is an awesome tool for sharing and collaborating on codes and projects while the latter is the best platform out there for engaging with data science enthusiasts from around the world.

This year, we have covered the top GitHub repositories each month and from this month onwards, we will be including the top Reddit threads as well that generated the most interesting and intriguing discussions in the machine learning space.

April saw some amazing python libraries being open sourced. From Deep Painterly Harmonization, a library that makes manipulated images look ultra realistic, to Swift for TensorFlow, this article covers the best from last month.
๐Ÿ‘ Image

Letโ€™s look at Aprilโ€™s top repositories and most interesting Reddit discussions.

You can check out the top GitHub repositories for the last three months below:

GitHub Repositories

Deep Painterly Harmonization

๐Ÿ‘ Image

The task of manipulating images and still making them look realistic has been around for ages. But with deep learning, this is becoming far more efficient and remarkably life-like. A developer has come up with an algorithm that takes a painting, adds an external element to it, and harmonizes it to make it look almost undistinguishable from the original painting.

Just look at the above image โ€“ the third frame is the final output and if we didnโ€™t have the preceding two images, we would never be able to tell the balloon is an external object! This algorithm produces far more precise results than photo compositing or global stylization techniques and it achieves levels of edits that have so far been very difficult to achieve.

You can read more about this library on AVBytes here.

Swift for TensorFlow

๐Ÿ‘ Image

Swift for TensorFlow was demoโ€™d at the TensorFlow Developer Summit last month and the team behind the technology has now open sourced the code on GitHub for the entire community. Their aim is to provide a new interface to TensorFlow that will build on itโ€™s already awesome capabilities, while taking itโ€™s usability to a whole new level.

This is still in itโ€™s very nascent stages so it isnโ€™t ready to be written into deep learning models yet. The team admits that the goals it has in mind while launching this are still a while away from being achieved. But there is a lot of potential here that is as yet untapped.

We have covered Swift for TensorFlow here for your reference.

MUNIT: Multimodal UNsupervised Image-to-image Translation

๐Ÿ‘ Image

A team of researchers from Cornell University have proposed a Multimodal Unsupervised Image-to-image Translation (MUNIT) framework for translating images from one domain to another. The aim is to take an image and generate a new image from it that is from a new category (for instance, transforming an image of a dog to a cat).

The previously existing approaches are able to perform only one-to-one mapping of the given an image and thus fail to produce diverse outputs of the same. MUNIT, on the other hand is able to provide more than one output. Exciting times!

We covered this on AVBytes and you can read about how it works here.

GluonNLP

๐Ÿ‘ Image

Deep Learning in the field of Natural Language Processing has taken off in a big way recently. There is a plethora of text available on the internet, dating back to centuries! GluonNLP is a toolkit that aims to make NLP tasks easier for a data scientist. It makes text preprocessing easier, along with loading the dataset(s) and building the deep learning neural models. This enables you to to do your NLP research faster and in a more efficient manner.

This repository has a nice documentation, along with a detailed example of how to use the library. They even have a nicely packaged 60-minute crash course for folks who are new to Gluon.

PyTorch GAN

This repository is a goldmine. Itโ€™s a collection of PyTorch implementations of GANs (or Generative Adversarial Networks) that have been presented in research papers. Currently the repository lists 24 different implementations, each adding to your knowledge in its unique way. The list contains implementations like Adversarial Autoencoders, CycleGAN, Least Squares GAN, Pix2Pix, etc.

Reddit Discussions

Helping Users in Understanding Research Papers

If youโ€™re having trouble trying to understand any research paper, the Reddit machine learning community is willing to help you out. This is an awesome idea that has already helped a bunch of people in extracting valuable information where before they used to give up and move on.

But when you post there, ensure you provide as much detail as you can, like a summary of the paper, where you are stuck, what research have you done to find out by yourself, etc. This line sums it up well โ€“ โ€œThink of each paper as an invite to an open study group for that paper, not just a queue for an expert to come along and answer it.โ€

Statement on Nature Machine Intelligence 

The debate about whether research should be open sourced or closed has been raging on for decades. Recently, the popular Nature magazine announced itโ€™ll be publishing a closed-access journal. This has led to a major campaign against them, with a lot of big names (Jeff Dean, Ian Goodfellow, among others) adding their signatures to a petition stating they will not write for such a publication.

This discussion thread has diverse and knowledgeable opinions about whether research should have open or closed access. Itโ€™s a fascinating read and I highly recommend going through the entire thread to see what the ML community thinks about this topic.

Michael Jordanโ€™s Lecture on the Current State of AI

Michael Jordan is a celebrated professor from Berkeley and in a recent talk he spoke at length about how we are miles away from reaching true intelligence in machines. Itโ€™s a sobering presentation and really makes one think about the topic.

This thread has generated more than 100 comments, with users weighing in with their opinions about where they perceive AI to be. What makes this a fascinating read is the depth of comments which some users have gone into. Go ahead, read it and participate in the still active discussion.

Scientists Planning Huge European AI Hub to Compete with US

This looks like a reasonably straightforward topic right? Wait till you dive into the thread. Data scientists and machine learning researchers from all over Europe and the USA are involved in an intense discussion about how the structure of ML is shaping in both continents, and what the salary figures look like. You will gain a lot of perspective about the architecture of ML projects and prospective salaries.

Measuring the Intrinsic Dimension of Objective Landscapes

This thread was launched from Uberโ€™s video on developing intrinsic dimension as a fundamental property of neural networks. If you have any doubts regarding the content presented in the video, the community has answered those questions in detail. The biggest poisitve seems to be that people love that a research paper was turned into a video, which makes it easier to understand the research.

Have you used any of the GitHub libraries before? And whatโ€™s your take on the Reddit discussions? If you have any feedback or suggestions, or need clarification on anything, get involved in the comments section below!

Senior Editor at Analytics Vidhya.Data visualization practitioner who loves reading and delving deeper into the data science and machine learning arts. Always looking for new ways to improve processes using ML and AI.

Login to continue reading and enjoy expert-curated content.

Free Courses

Exploratory Data Analysis with Python & GenAI

Learn EDA with Python: Transform data into insights using PandasAI & more.

Data Science Course

Build a powerful 2026-ready data science resume using AI tools.

Understanding the working of Neural Networks

Learn the neural network basics, concepts, layers, and activation functions.

No Code Predictive Analytics with Orange

No-code AI course for business pros with real-world ML use cases.

Adaptive Email Agents with DSPy

Build adaptive email agents with DSPy using context and smart learning.

Responses From Readers

Flagship Programs

GenAI Pinnacle Program| GenAI Pinnacle Plus Program| AI/ML BlackBelt Program| Agentic AI Pioneer Program

Free Courses

Generative AI| DeepSeek| OpenAI Agent SDK| LLM Applications using Prompt Engineering| DeepSeek from Scratch| Stability.AI| SSM & MAMBA| RAG Systems using LlamaIndex| Building LLMs for Code| Python| Microsoft Excel| Machine Learning| Deep Learning| Mastering Multimodal RAG| Introduction to Transformer Model| Bagging & Boosting| Loan Prediction| Time Series Forecasting| Tableau| Business Analytics| Vibe Coding in Windsurf| Model Deployment using FastAPI| Building Data Analyst AI Agent| Getting started with OpenAI o3-mini| Introduction to Transformers and Attention Mechanisms

Popular Categories

AI Agents| Generative AI| Prompt Engineering| Generative AI Application| News| Technical Guides| AI Tools| Interview Preparation| Research Papers| Success Stories| Quiz| Use Cases| Listicles

Generative AI Tools and Techniques

GANs| VAEs| Transformers| StyleGAN| Pix2Pix| Autoencoders| GPT| BERT| Word2Vec| LSTM| Attention Mechanisms| Diffusion Models| LLMs| SLMs| Encoder Decoder Models| Prompt Engineering| LangChain| LlamaIndex| RAG| Fine-tuning| LangChain AI Agent| Multimodal Models| RNNs| DCGAN| ProGAN| Text-to-Image Models| DDPM| Document Question Answering| Imagen| T5 (Text-to-Text Transfer Transformer)| Seq2seq Models| WaveNet| Attention Is All You Need (Transformer Architecture) | WindSurf| Cursor

Popular GenAI Models

Llama 4| Llama 3.1| GPT 4.5| GPT 4.1| GPT 4o| o3-mini| Sora| DeepSeek R1| DeepSeek V3| Janus Pro| Veo 2| Gemini 2.5 Pro| Gemini 2.0| Gemma 3| Claude Sonnet 3.7| Claude 3.5 Sonnet| Phi 4| Phi 3.5| Mistral Small 3.1| Mistral NeMo| Mistral-7b| Bedrock| Vertex AI| Qwen QwQ 32B| Qwen 2| Qwen 2.5 VL| Qwen Chat| Grok 3

AI Development Frameworks

n8n| LangChain| Agent SDK| A2A by Google| SmolAgents| LangGraph| CrewAI| Agno| LangFlow| AutoGen| LlamaIndex| Swarm| AutoGPT

Data Science Tools and Techniques

Python| R| SQL| Jupyter Notebooks| TensorFlow| Scikit-learn| PyTorch| Tableau| Apache Spark| Matplotlib| Seaborn| Pandas| Hadoop| Docker| Git| Keras| Apache Kafka| AWS| NLP| Random Forest| Computer Vision| Data Visualization| Data Exploration| Big Data| Common Machine Learning Algorithms| Machine Learning| Google Data Science Agent
๐Ÿ‘ Av Logo White

Continue your learning for FREE

Forgot your password?
๐Ÿ‘ Av Logo White

Enter OTP sent to

Edit

Wrong OTP.

Enter the OTP

Resend OTP

Resend OTP in 45s

๐Ÿ‘ Popup Banner
๐Ÿ‘ AI Popup Banner