Pinned
Hot take: Git was the wrong abstraction for 90% of ML data.
Checkpoints, optimizer states, training logs, agent traces - none of this needs version control. It needs fast, cheap, mutable storage.
So we built Buckets. S3-like storage on the @huggingface Hub with Xet dedup and
