Quantized versions of IBM Granite models. • 47 items • Updated • 36
Granite 4.0 H-Tiny (GGUF)
This repository contains models that have been converted to the GGUF format with various quantizations from an IBM Granite base model.
Please reference the base model's full model card here: https://huggingface.co/ibm-granite/granite-4.0-h-tiny
- Downloads last month
- 2,220
GGUF
Model size
7B params
Architecture
granitehybrid
Hardware compatibility
Log In to add your hardware
2-bit
3-bit
4-bit
5-bit
6-bit
8-bit
16-bit
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for ibm-granite/granite-4.0-h-tiny-GGUF
Base model
ibm-granite/granite-4.0-h-tiny