Eve 2 family of models; 272M parameter MoE IT Specialist Agents • 19 items • Updated • 1
Eve-2-MoE-IT-272M - GGUF
GGUF quantizations of anthonym21/Eve-2-MoE-IT-272M.
Quantization Variants
| Quantization | Filename | Size |
|---|---|---|
| Q8_0 | Eve-2-MoE-IT-272M-Q8_0.gguf | 318.3 MB |
| Q4_K_M | Eve-2-MoE-IT-272M-Q4_K_M.gguf | 204.0 MB |
Usage with Ollama
ollama run anthonym21/eve-2-moe-it-272m
Usage with llama.cpp
llama-cli -m Eve-2-MoE-IT-272M-Q4_K_M.gguf -p "Your prompt here"
Architecture
- Type: DeepSeek-style Mixture of Experts (MoE)
- Parameters: 272M total
- Layers: 12
- Hidden dim: 512
- Experts: 8 routed (top-2) + 1 shared per layer
- Context: 2048 tokens
- Tokenizer: GPT-2
Parent Model
This is a quantized version of anthonym21/Eve-2-MoE-IT-272M.
- Downloads last month
- 15
GGUF
Model size
0.3B params
Architecture
deepseek
Hardware compatibility
Log In to add your hardware
4-bit
8-bit
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for anthonym21/Eve-2-MoE-IT-272M-GGUF
Base model
anthonym21/Eve-2-MoE-272M Finetuned
anthonym21/Eve-2-MoE-IT-272M