Quantized versions of IBM Granite models. • 49 items • Updated • 36
Granite-Vision-4.1-4B (GGUF)
This repository contains models that have been converted to the GGUF format with various quantizations from an IBM Granite base model.
Please reference the base model's full model card here: https://huggingface.co/ibm-granite/granite-vision-4.1-4b
Requirements
- llama.cpp build: b9534
- Downloads last month
- 377
GGUF
Model size
3B params
Architecture
granite
Hardware compatibility
Log In to add your hardware
4-bit
5-bit
6-bit
8-bit
16-bit
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for ibm-granite/granite-vision-4.1-4b-GGUF
Base model
ibm-granite/granite-vision-4.1-4b