VOOZH about

URL: https://www.analyticsvidhya.com/blog/2018/06/facebook-has-open-sourced-the-python-code-for-densepose-download-now/

โ‡ฑ Facebook has Open Sourced the Python Code for DensePose - Download it now!


India's Most Futuristic AI Conference Is Back โ€“ Bigger, Sharper, Bolder

  • d
  • :
  • h
  • :
  • m
  • :
  • s

Facebook has Open Sourced the Python Code for DensePose โ€“ Download it now!

Aishwarya Singh Last Updated : 24 Jun, 2018
3 min read

Overview

  • Facebook has open sourced the code for DensePose, a technique that can understand human images in terms of surface-based models.
  • The DensePose project includes DensePose-COCO and Densepose-RCNN
  • It has been implemented using Facebookโ€™s Detectron framework, and is powered by Caffe2

Introduction

Imagine a world where you open an apparel application on your phone, tap on clothes you like, and the app shows images of you with those clothes on. Sounds like magic, right? On the contrary, we are very close to seeing this kind of technology turning into a real-life application.

Currently, data scientists are able to annotate images, but the existing approaches locate a sparse set of joints, like the wrists or elbows, which are often used for applications like gesture or action recognition. Facebookโ€™s AI Research division (FAIR) has taken this technique to another level altogether.

In order to map all human pixels in 2D images to a 3D surface-based model of the body, they have pioneered a new approach called DensePose. The current approaches in human pose estimation operate with 10 or 20 human joints (such as wrists, elbows, etc.) whereas DensePose identifies the human body in more than 5000 nodes! The below image illustrates my point pretty well.

๐Ÿ‘ Image

As mentioned by the researchers in the paper DensePose: Dense Human Pose Estimation In The Wild, presented at the Computer Vision and Pattern Recognition conference (CVPR) 2018 in Utah, the DensePose project includes:

DensePose-COCO: A large-scale dataset with image-to-surface correspondences. The team has gathered annotations for 50K humans, collecting more than 5 million manually annotated correspondences. The exact same train/validation/test split as in the COCO challenge has been followed. Below is an example of a visualization of annotation from the validation set.

๐Ÿ‘ Image

DensePose-RCNN: This is a variant of Mask-RCNN, with Feature Pyramid Network and Region-of-Interest Pooling followed by fully-convolutional processing (architecture shown below). This is done to obtain dense part labels and coordinates within each of the selected regions.

๐Ÿ‘ Image

The team has shared a GitHub repository in which they have open sourced the code to train and evaluate DensePose-RCNN. Also the notebooks used to visualize the collected DensePose-COCO dataset have been provided. This technique has been implemented using Facebookโ€™s own Detectron framework and is powered by Caffe2.

Below is a video in which they have provided an overview of the technique.

Our Take on this

I can see this technique being put to good use for improving virtual reality experiences or for motion capture devices. And not just that, it can help doctors make decisions regarding physical ailments in patients, accelerate the recent advancement in sports science, among other things.

As usual, the code is available on code for you to play around with. Can you improve on what theyโ€™ve released? Where else can you apply this transcendent technique? Share your thoughts in the comments section below.

Subscribe to AVBytes here to get regular data science, machine learning and AI updates in your inbox!

An avid reader and blogger who loves exploring the endless world of data science and artificial intelligence. Fascinated by the limitless applications of ML and AI; eager to learn and discover the depths of data science.

Login to continue reading and enjoy expert-curated content.

Free Courses

Generative AI - A Way of Life

Explore Generative AI for beginners: create text and images, use top AI tools, learn practical skills, and ethics.

Getting Started with Large Language Models

Master Large Language Models (LLMs) with this course, offering clear guidance in NLP and model training made simple.

Building LLM Applications using Prompt Engineering

This free course guides you on building LLM apps, mastering prompt engineering, and developing chatbots with enterprise data.

Improving Real World RAG Systems: Key Challenges & Practical Solutions

Explore practical solutions, advanced retrieval strategies, and agentic RAG systems to improve context, relevance, and accuracy in AI-driven applications.

Microsoft Excel: Formulas & Functions

Master MS Excel for data analysis with key formulas, functions, and LookUp tools in this comprehensive course.

Responses From Readers

Flagship Programs

GenAI Pinnacle Program| GenAI Pinnacle Plus Program| AI/ML BlackBelt Program| Agentic AI Pioneer Program

Free Courses

Generative AI| DeepSeek| OpenAI Agent SDK| LLM Applications using Prompt Engineering| DeepSeek from Scratch| Stability.AI| SSM & MAMBA| RAG Systems using LlamaIndex| Building LLMs for Code| Python| Microsoft Excel| Machine Learning| Deep Learning| Mastering Multimodal RAG| Introduction to Transformer Model| Bagging & Boosting| Loan Prediction| Time Series Forecasting| Tableau| Business Analytics| Vibe Coding in Windsurf| Model Deployment using FastAPI| Building Data Analyst AI Agent| Getting started with OpenAI o3-mini| Introduction to Transformers and Attention Mechanisms

Popular Categories

AI Agents| Generative AI| Prompt Engineering| Generative AI Application| News| Technical Guides| AI Tools| Interview Preparation| Research Papers| Success Stories| Quiz| Use Cases| Listicles

Generative AI Tools and Techniques

GANs| VAEs| Transformers| StyleGAN| Pix2Pix| Autoencoders| GPT| BERT| Word2Vec| LSTM| Attention Mechanisms| Diffusion Models| LLMs| SLMs| Encoder Decoder Models| Prompt Engineering| LangChain| LlamaIndex| RAG| Fine-tuning| LangChain AI Agent| Multimodal Models| RNNs| DCGAN| ProGAN| Text-to-Image Models| DDPM| Document Question Answering| Imagen| T5 (Text-to-Text Transfer Transformer)| Seq2seq Models| WaveNet| Attention Is All You Need (Transformer Architecture) | WindSurf| Cursor

Popular GenAI Models

Llama 4| Llama 3.1| GPT 4.5| GPT 4.1| GPT 4o| o3-mini| Sora| DeepSeek R1| DeepSeek V3| Janus Pro| Veo 2| Gemini 2.5 Pro| Gemini 2.0| Gemma 3| Claude Sonnet 3.7| Claude 3.5 Sonnet| Phi 4| Phi 3.5| Mistral Small 3.1| Mistral NeMo| Mistral-7b| Bedrock| Vertex AI| Qwen QwQ 32B| Qwen 2| Qwen 2.5 VL| Qwen Chat| Grok 3

AI Development Frameworks

n8n| LangChain| Agent SDK| A2A by Google| SmolAgents| LangGraph| CrewAI| Agno| LangFlow| AutoGen| LlamaIndex| Swarm| AutoGPT

Data Science Tools and Techniques

Python| R| SQL| Jupyter Notebooks| TensorFlow| Scikit-learn| PyTorch| Tableau| Apache Spark| Matplotlib| Seaborn| Pandas| Hadoop| Docker| Git| Keras| Apache Kafka| AWS| NLP| Random Forest| Computer Vision| Data Visualization| Data Exploration| Big Data| Common Machine Learning Algorithms| Machine Learning| Google Data Science Agent
๐Ÿ‘ Av Logo White

Continue your learning for FREE

Forgot your password?
๐Ÿ‘ Av Logo White

Enter OTP sent to

Edit

Wrong OTP.

Enter the OTP

Resend OTP

Resend OTP in 45s

๐Ÿ‘ Popup Banner
๐Ÿ‘ AI Popup Banner