Probably function calling datasets Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. Function Calling Datasets Explorer 💻 11 Browse and view datasets from a Hugging Face collection Viewer • Updated Jun 7, 2024 • 1k • 1.4k • 10 Viewer • Updated Jun 7, 2024 • 1k • 245 • 22 Viewer • Updated Jun 23, 2024 • 125k • 72 • 10
smol models Models where the size of the model file (model.safetensors or pytorch_model.bin) < 50mb Object Detection • 6.49M • Updated Apr 10, 2024 • 103k • 281 Fill-Mask • 0.3B • Updated 22 days ago • 3.57k • • 299 Image Segmentation • 3.75M • Updated Jan 14, 2024 • 262k • • 190 Updated Oct 27, 2021 • 1.17M • 146
paper-related-spaces Spaces which focus on creating recommendations for papers or working with Hugging Face papers Recommend Similar Papers 🌖 191 Get similar paper recommendations from a Hugging Face link Collection Reading List Generator 📚 9 Collection Papers Extractor 📄 5 Claim Papers 🌍 5
Image Preference Optimization Datasets Datasets suitable for Image Preference Optimization based on their colum names Preview • Updated Oct 14, 2025 • 1.33k • 215 Viewer • Updated Jul 3, 2024 • 101k • 1.41k • 20 Viewer • Updated May 15, 2024 • 12.7k • 13 Viewer • Updated Jul 26, 2024 • 1.81k • 26
Alpaca Style Datasets Datasets which follow the Alpaca Style format based on having 'instruction', 'input', and 'output' columns Alpaca Style Datasets Explorer 💻 2 Viewer • Updated Apr 12, 2023 • 15k • 1.37k • 22 Viewer • Updated Feb 10, 2024 • 52k • 4.58k • 324 Viewer • Updated Jun 7, 2024 • 51.7k • 543 • 5
Probably oasst Style Datasets Datasets in the OpenAssistant format {"INSTRUCTION": "...", "RESPONSE": "..."} Viewer • Updated May 13, 2023 • 16.8k • 703 • 11 Viewer • Updated Feb 11, 2023 • 8.79k • 624 • 52 Viewer • Updated Apr 23, 2023 • 1.01M • 1.02k • 261 Viewer • Updated May 20, 2023 • 419k • 349 • 36
Top 10% instruction tuning datasets Collects datasets with 'instruction' in the name and more than 1 download and in the top 10% for the number of likes Viewer • Updated Dec 23, 2022 • 7.15M • 7k • 84 Viewer • Updated Feb 11, 2023 • 8.79k • 624 • 52 Viewer • Updated Feb 28, 2023 • 327 • 1.16k • 65 Viewer • Updated Oct 16, 2024 • 323k • 799 • 61
Text datasets with missing language information Viewer • Updated Sep 2, 2023 • 51.8k • 37 Updated Jun 20, 2023 • 558 • 12 Viewer • Updated Dec 29, 2025 • 465k • 29 • 1 Updated Jan 7, 2024 • 1.98k • 23
Direct Preference Optimization Datasets Datasets suitable for DPO based on having 'chosen', 'rejected', and 'prompt' columns. Created using librarian-bots/dataset-column-search-api Direct-Preference-Optimization-Datasets-explorer 💻 2 Browse and view datasets from a Hugging Face collection Viewer • Updated Jul 16, 2024 • 7.56k • 8.15k • 184 Viewer • Updated Oct 16, 2024 • 187k • 14k • 341 Viewer • Updated Sep 9, 2024 • 8.11k • 5.68k • 108
Direct-Preference-Optimization-Datasets-explorer 💻 2 Browse and view datasets from a Hugging Face collection
Hub Card Data Viewer • Updated about 4 hours ago • 615k • 690 • 23 Viewer • Updated 10 days ago • 515k • 373 • 16
Probably function calling datasets Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. Function Calling Datasets Explorer 💻 11 Browse and view datasets from a Hugging Face collection Viewer • Updated Jun 7, 2024 • 1k • 1.4k • 10 Viewer • Updated Jun 7, 2024 • 1k • 245 • 22 Viewer • Updated Jun 23, 2024 • 125k • 72 • 10
Probably oasst Style Datasets Datasets in the OpenAssistant format {"INSTRUCTION": "...", "RESPONSE": "..."} Viewer • Updated May 13, 2023 • 16.8k • 703 • 11 Viewer • Updated Feb 11, 2023 • 8.79k • 624 • 52 Viewer • Updated Apr 23, 2023 • 1.01M • 1.02k • 261 Viewer • Updated May 20, 2023 • 419k • 349 • 36
smol models Models where the size of the model file (model.safetensors or pytorch_model.bin) < 50mb Object Detection • 6.49M • Updated Apr 10, 2024 • 103k • 281 Fill-Mask • 0.3B • Updated 22 days ago • 3.57k • • 299 Image Segmentation • 3.75M • Updated Jan 14, 2024 • 262k • • 190 Updated Oct 27, 2021 • 1.17M • 146
Top 10% instruction tuning datasets Collects datasets with 'instruction' in the name and more than 1 download and in the top 10% for the number of likes Viewer • Updated Dec 23, 2022 • 7.15M • 7k • 84 Viewer • Updated Feb 11, 2023 • 8.79k • 624 • 52 Viewer • Updated Feb 28, 2023 • 327 • 1.16k • 65 Viewer • Updated Oct 16, 2024 • 323k • 799 • 61
paper-related-spaces Spaces which focus on creating recommendations for papers or working with Hugging Face papers Recommend Similar Papers 🌖 191 Get similar paper recommendations from a Hugging Face link Collection Reading List Generator 📚 9 Collection Papers Extractor 📄 5 Claim Papers 🌍 5
Text datasets with missing language information Viewer • Updated Sep 2, 2023 • 51.8k • 37 Updated Jun 20, 2023 • 558 • 12 Viewer • Updated Dec 29, 2025 • 465k • 29 • 1 Updated Jan 7, 2024 • 1.98k • 23
Image Preference Optimization Datasets Datasets suitable for Image Preference Optimization based on their colum names Preview • Updated Oct 14, 2025 • 1.33k • 215 Viewer • Updated Jul 3, 2024 • 101k • 1.41k • 20 Viewer • Updated May 15, 2024 • 12.7k • 13 Viewer • Updated Jul 26, 2024 • 1.81k • 26
Direct Preference Optimization Datasets Datasets suitable for DPO based on having 'chosen', 'rejected', and 'prompt' columns. Created using librarian-bots/dataset-column-search-api Direct-Preference-Optimization-Datasets-explorer 💻 2 Browse and view datasets from a Hugging Face collection Viewer • Updated Jul 16, 2024 • 7.56k • 8.15k • 184 Viewer • Updated Oct 16, 2024 • 187k • 14k • 341 Viewer • Updated Sep 9, 2024 • 8.11k • 5.68k • 108
Direct-Preference-Optimization-Datasets-explorer 💻 2 Browse and view datasets from a Hugging Face collection
Alpaca Style Datasets Datasets which follow the Alpaca Style format based on having 'instruction', 'input', and 'output' columns Alpaca Style Datasets Explorer 💻 2 Viewer • Updated Apr 12, 2023 • 15k • 1.37k • 22 Viewer • Updated Feb 10, 2024 • 52k • 4.58k • 324 Viewer • Updated Jun 7, 2024 • 51.7k • 543 • 5
Hub Card Data Viewer • Updated about 4 hours ago • 615k • 690 • 23 Viewer • Updated 10 days ago • 515k • 373 • 16