VOOZH about

URL: https://thenewstack.io/cncf-dragonfly-speeds-container-model-sharing-with-p2p/

⇱ CNCF Dragonfly Speeds Container, Model Sharing with P2P - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2026-01-16 06:15:19
CNCF Dragonfly Speeds Container, Model Sharing with P2P
Cloud Native Ecosystem / Networking / Open Source / Operations

CNCF Dragonfly Speeds Container, Model Sharing with P2P

Dragonfly project has graduated from CNCF, proving its maturity as a high-scale peer-to-peer system for distributing container images and massive AI models across thousands of nodes.
Jan 16th, 2026 6:15am by Joab Jackson
👁 Featued image for: CNCF Dragonfly Speeds Container, Model Sharing with P2P

The Dragonfly project, an open source peer-to-peer image and file distribution system, has graduated from the Cloud Native Computing Foundation‘s program for incubating new cloud native technologies.

The open source technology, under CNCF’s wing since 2018, has shown that it can work in production settings, with its ability to copy containers and large AI models across a network at scale, according to the organization. Built to run on Kubernetes, it has found use by organizations managing large-scale AI workloads, and has found a home in other environs as well, including CI/CD and edge computing.

CNCF Dragonfly, originally developed for internal use by Alibaba Cloud, provides a way for organizations to distribute images across a network. It can copy container images to thousands of nodes nearly simultaneously.

It also works well with files, caches and logs.

Overall, 271 individuals across 130 companies have contributed 26,000 commits to building out the project.

“Looking back on this journey over the past eight years, every step has embodied the open source spirit and the tireless efforts of the many contributors,” said Zuozheng Hu, founder of Dragonfly and emeritus maintainer, in a statement.

The Power of P2P

A peer-to-peer file sharing mechanism could help cloud native deployments in distributing new and updated container images across a cluster more quickly and with less stress to the upstream network.

P2P, first popularized by music sharing programs such as Napster over two decades ago, can make full use of the cluster’s bandwidth while eliminating the possible bottleneck of having a single server respond to all the requests for a new image.

In a P2P network, each node, or “peer,” can share files with each other, rather than all the nodes saturating the bandwidth to the image server by downloading identical copies of a single image.

Dragonfly is not a pure P2P technology; It still requires a supernode, to schedule and control distribution within the peer network. An agent on each node, dfget, downloads the file pieces. Another component, the dfdaemon proxy, intercepts image downloading requests from a container engine to dfget.

Dragonfly’s Robust Support Stack

As a CNCF project, the development team has built a robust support stack in the past decade. The Dragonfly can be installed via Helm, and monitored with Prometheus and OpenTelemetry.

To speed transfers, it can run on the gRPC protocol. Images can be “preheated” for faster sharing via the Harbor open source registry.

Dragonfly also supports CNCF’s ModelPack specification for tidier AI model distribution.

One Dragonfly subproject, called Nydus, has brought considerable value to the software by further accelerating model distribution.

“The combination of Dragonfly and Nydus substantially shortens launch times for container images and AI models, enhancing system resilience and efficiency,” said Jiang Liu, Nydus maintainer, in a statement.

Use Cases for Dragonfly

Dragonfly has found a home across some of the most innovative cloud native services, many located in Asia. CNCF provided a few key examples.

It has become a core component of the container image and data distribution system for Alibaba, providing support for the annual Double 11 (Singles’ Day) shopping festival, as well as an ongoing role in model data distribution and cache acceleration.

It has saved considerable transmission bandwidth across the 10,000 Kubernetes nodes of the Asian financial company Ant Group. Nydus, in particular, helped the organization reduce image pull time to near zero, and the technology is used for large language model movement as well.

For the Datadog observability firm, Dragonfly with Nydus cut the time it takes node daemonsets to start up within seconds, whereas the image pulls would previously drag that time out to five minutes.

Chinese mobile technology company DiDi uses Dragonfly for large-scale file synchronization and image distribution for enterprises.

And container registry service Kuaishou is about to use Dragonfly to support image distribution capabilities for tens of thousands of services and hundreds of thousands of servers.

TRENDING STORIES
Joab Jackson is a senior editor for The New Stack, covering cloud native computing and system operations. He has reported on IT infrastructure and development for over 30 years, including stints at IDG and Government Computer News. Before that, he...
Read more from Joab Jackson
SHARE THIS STORY
TRENDING STORIES
The Cloud Native Computing Foundation is a sponsor of The New Stack. 
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.