What is this?
These are dynamic gguf quants, they take slightly more VRAM, but the attention layers are of a much higher quality.
- Downloads last month
- 17
GGUF
Model size
12B params
Architecture
llama
Hardware compatibility
Log In to add your hardware
We're not able to determine the quantization variants.
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for SicariusSicariiStuff/Impish_Nemo_12B_GGUF_HA
Base model
mistralai/Mistral-Nemo-Base-2407 Finetuned
mistralai/Mistral-Nemo-Instruct-2407 Finetuned
SicariusSicariiStuff/Impish_Nemo_12B