SimpleSD Simple Self-Distillation Text Generation • 4B • Updated Apr 7 • 63 • 5 Text Generation • 31B • Updated Apr 7 • 199 • 6 Text Generation • 4B • Updated Apr 7 • 234 • 5
FastVLM Efficient Vision Encoding for Vision Language Models Paper • 2412.13303 • Published Dec 17, 2024 • 77 FastVLM WebGPU 🍎 446 Real-time video captioning powered by FastVLM Text Generation • 0.8B • Updated Sep 3, 2025 • 5.99k • 394 Text Generation • 2B • Updated Sep 3, 2025 • 1.43k • 80
DiffuCoder 8B • Updated Dec 8, 2025 • 409 • 319 8B • Updated Dec 8, 2025 • 1.41k • 63 Paper • 2506.20639 • Published Jun 25, 2025 • 32 8B • Updated Dec 8, 2025 • 168 • 31
Core ML Gallery Models Depth Estimation • Updated Jun 24, 2024 • 669 • 97 Depth Estimation • Updated Jun 13, 2024 • 52 • 39 Image Segmentation • Updated Aug 9, 2024 • 83 • 33 Image Classification • Updated Jun 13, 2024 • 16 • 17
OpenELM Pretrained Models Text Generation • 0.3B • Updated Feb 28, 2025 • 1.52k • 76 Text Generation • 0.5B • Updated Feb 28, 2025 • 291 • 26 Text Generation • 1B • Updated Feb 28, 2025 • 1.9k • 34 Text Generation • 3B • Updated Feb 28, 2025 • 218 • 130
TiC-CLIP Benchmark for the design of efficient continual learning of image-text models over years. Paper • 2310.16226 • Published Oct 24, 2023 • 10 Preview • Updated Jun 13, 2024 • 905 • 4 Zero-Shot Image Classification • Updated Feb 24, 2025 • 47 • 3 Zero-Shot Image Classification • Updated Feb 24, 2025 • 3 • 1
Core ML Stable Diffusion Updated Jul 29, 2023 • 10 • 30 Text-to-Image • Updated Jul 27, 2023 • 56 • 70 Text-to-Image • Updated May 1, 2023 • 141 • 55 Text-to-Image • Updated Dec 29, 2022 • 13 • 4
Core ML Depth Anything Depth Estimation • Updated Jun 24, 2024 • 669 • 97 Depth Estimation • Updated Jun 13, 2024 • 52 • 39
AIM AIM: Autoregressive Image Models Image Classification • Updated Feb 28, 2025 • 20 • 19 Image Classification • Updated Feb 28, 2025 • 13 • 6 Image Classification • Updated Feb 28, 2025 • 12 • 4 Image Classification • Updated Feb 28, 2025 • 11 • 25
Core ML Segment Anything 2 Mask Generation • Updated Oct 1, 2024 • 124 • 24 Mask Generation • Updated Oct 1, 2024 • 214 • 3 Mask Generation • Updated Oct 1, 2024 • 83 • 4 Mask Generation • Updated Oct 1, 2024 • 164 • 10
CLaRa CLaRa models Updated Dec 11, 2025 • 181 Updated Dec 8, 2025 • 18 Updated Nov 23, 2025 • 3 Updated Dec 8, 2025 • 28
MobileCLIP2 MobileCLIP2: Mobile-friendly image-text models with SOTA zero-shot capabilities trained on DFNDR-2B Paper • 2508.20691 • Published Aug 28, 2025 • 9 Updated Oct 9, 2025 • 78 • 49 Updated Oct 9, 2025 • 60 • 19 Updated Oct 9, 2025 • 22 • 4
AIMv2 A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. Image Feature Extraction • 0.3B • Updated Jul 8, 2025 • 1.23k • 62 Image Feature Extraction • 0.7B • Updated Jul 8, 2025 • 60 • 13 Image Feature Extraction • 1B • Updated Jul 8, 2025 • 113 • 8 Image Feature Extraction • 3B • Updated Jul 8, 2025 • 206 • 4
OpenELM Instruct Models Text Generation • 0.3B • Updated Feb 28, 2025 • 978 • 145 Text Generation • 0.5B • Updated Feb 28, 2025 • 910 • 51 Text Generation • 1B • Updated Feb 28, 2025 • 1.2M • 75 Text Generation • 3B • Updated Feb 28, 2025 • 1.34k • 340
MobileCLIP Models + DataCompDR Data MobileCLIP: Mobile-friendly image-text models with SOTA zero-shot capabilities. DataCompDR: Improved datasets for training image-text SOTA models. Paper • 2311.17049 • Published Nov 28, 2023 • 8 Image Classification • Updated Feb 28, 2025 • 169 • 12 Image Classification • Updated Feb 28, 2025 • 28 • 3 Image Classification • Updated Feb 28, 2025 • 35 • 6
DepthPro Models Depth Pro: Sharp Monocular Metric Depth in Less Than a Second Depth Estimation • 1.0B • Updated Feb 28, 2025 • 27.4k • 106 Depth Estimation • Updated Feb 28, 2025 • 6.42k • 517 Depth Estimation • 1.0B • Updated Feb 28, 2025 • 20 • 8 Zero-Shot Image Classification • 0.4B • Updated Sep 15, 2023 • 8.83M • 2.04k
Core ML FastViT Image Classification • Updated Jun 13, 2024 • 16 • 17 Image Classification • Updated Jun 13, 2024 • 7 • 11
DFN Models + Data CLIP Models trained using DFN-2B/DFN-5B datasets Updated Feb 28, 2025 • 469k • 109 Updated Feb 28, 2025 • 13.6k • 17 Updated Feb 28, 2025 • 757k • 13 Updated Feb 28, 2025 • 41.3k • 47
DCLM DCLM Models + Datasets 7B • Updated Jul 26, 2024 • 79 • 832 7B • Updated Aug 6, 2024 • 17 • 45 Preview • Updated Jul 22, 2024 • 606k • 287 1B • Updated Jul 25, 2024 • 5 • 13
SimpleSD Simple Self-Distillation Text Generation • 4B • Updated Apr 7 • 63 • 5 Text Generation • 31B • Updated Apr 7 • 199 • 6 Text Generation • 4B • Updated Apr 7 • 234 • 5
CLaRa CLaRa models Updated Dec 11, 2025 • 181 Updated Dec 8, 2025 • 18 Updated Nov 23, 2025 • 3 Updated Dec 8, 2025 • 28
FastVLM Efficient Vision Encoding for Vision Language Models Paper • 2412.13303 • Published Dec 17, 2024 • 77 FastVLM WebGPU 🍎 446 Real-time video captioning powered by FastVLM Text Generation • 0.8B • Updated Sep 3, 2025 • 5.99k • 394 Text Generation • 2B • Updated Sep 3, 2025 • 1.43k • 80
MobileCLIP2 MobileCLIP2: Mobile-friendly image-text models with SOTA zero-shot capabilities trained on DFNDR-2B Paper • 2508.20691 • Published Aug 28, 2025 • 9 Updated Oct 9, 2025 • 78 • 49 Updated Oct 9, 2025 • 60 • 19 Updated Oct 9, 2025 • 22 • 4
DiffuCoder 8B • Updated Dec 8, 2025 • 409 • 319 8B • Updated Dec 8, 2025 • 1.41k • 63 Paper • 2506.20639 • Published Jun 25, 2025 • 32 8B • Updated Dec 8, 2025 • 168 • 31
AIMv2 A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. Image Feature Extraction • 0.3B • Updated Jul 8, 2025 • 1.23k • 62 Image Feature Extraction • 0.7B • Updated Jul 8, 2025 • 60 • 13 Image Feature Extraction • 1B • Updated Jul 8, 2025 • 113 • 8 Image Feature Extraction • 3B • Updated Jul 8, 2025 • 206 • 4
Core ML Gallery Models Depth Estimation • Updated Jun 24, 2024 • 669 • 97 Depth Estimation • Updated Jun 13, 2024 • 52 • 39 Image Segmentation • Updated Aug 9, 2024 • 83 • 33 Image Classification • Updated Jun 13, 2024 • 16 • 17
OpenELM Instruct Models Text Generation • 0.3B • Updated Feb 28, 2025 • 978 • 145 Text Generation • 0.5B • Updated Feb 28, 2025 • 910 • 51 Text Generation • 1B • Updated Feb 28, 2025 • 1.2M • 75 Text Generation • 3B • Updated Feb 28, 2025 • 1.34k • 340
OpenELM Pretrained Models Text Generation • 0.3B • Updated Feb 28, 2025 • 1.52k • 76 Text Generation • 0.5B • Updated Feb 28, 2025 • 291 • 26 Text Generation • 1B • Updated Feb 28, 2025 • 1.9k • 34 Text Generation • 3B • Updated Feb 28, 2025 • 218 • 130
MobileCLIP Models + DataCompDR Data MobileCLIP: Mobile-friendly image-text models with SOTA zero-shot capabilities. DataCompDR: Improved datasets for training image-text SOTA models. Paper • 2311.17049 • Published Nov 28, 2023 • 8 Image Classification • Updated Feb 28, 2025 • 169 • 12 Image Classification • Updated Feb 28, 2025 • 28 • 3 Image Classification • Updated Feb 28, 2025 • 35 • 6
TiC-CLIP Benchmark for the design of efficient continual learning of image-text models over years. Paper • 2310.16226 • Published Oct 24, 2023 • 10 Preview • Updated Jun 13, 2024 • 905 • 4 Zero-Shot Image Classification • Updated Feb 24, 2025 • 47 • 3 Zero-Shot Image Classification • Updated Feb 24, 2025 • 3 • 1
DepthPro Models Depth Pro: Sharp Monocular Metric Depth in Less Than a Second Depth Estimation • 1.0B • Updated Feb 28, 2025 • 27.4k • 106 Depth Estimation • Updated Feb 28, 2025 • 6.42k • 517 Depth Estimation • 1.0B • Updated Feb 28, 2025 • 20 • 8 Zero-Shot Image Classification • 0.4B • Updated Sep 15, 2023 • 8.83M • 2.04k
Core ML Stable Diffusion Updated Jul 29, 2023 • 10 • 30 Text-to-Image • Updated Jul 27, 2023 • 56 • 70 Text-to-Image • Updated May 1, 2023 • 141 • 55 Text-to-Image • Updated Dec 29, 2022 • 13 • 4
Core ML FastViT Image Classification • Updated Jun 13, 2024 • 16 • 17 Image Classification • Updated Jun 13, 2024 • 7 • 11
Core ML Depth Anything Depth Estimation • Updated Jun 24, 2024 • 669 • 97 Depth Estimation • Updated Jun 13, 2024 • 52 • 39
DFN Models + Data CLIP Models trained using DFN-2B/DFN-5B datasets Updated Feb 28, 2025 • 469k • 109 Updated Feb 28, 2025 • 13.6k • 17 Updated Feb 28, 2025 • 757k • 13 Updated Feb 28, 2025 • 41.3k • 47
AIM AIM: Autoregressive Image Models Image Classification • Updated Feb 28, 2025 • 20 • 19 Image Classification • Updated Feb 28, 2025 • 13 • 6 Image Classification • Updated Feb 28, 2025 • 12 • 4 Image Classification • Updated Feb 28, 2025 • 11 • 25
DCLM DCLM Models + Datasets 7B • Updated Jul 26, 2024 • 79 • 832 7B • Updated Aug 6, 2024 • 17 • 45 Preview • Updated Jul 22, 2024 • 606k • 287 1B • Updated Jul 25, 2024 • 5 • 13
Core ML Segment Anything 2 Mask Generation • Updated Oct 1, 2024 • 124 • 24 Mask Generation • Updated Oct 1, 2024 • 214 • 3 Mask Generation • Updated Oct 1, 2024 • 83 • 4 Mask Generation • Updated Oct 1, 2024 • 164 • 10