This is an experimental version
Use the following to run:
git clone https://github.com/ggml-org/llama.cpp
cd llama.cpp
git fetch origin pull/24523/head:minimax-m3
git checkout minimax-m3
cmake -B build -DGGML_CUDA=ON
cmake --build build --config Release -j --target llama-cli llama-server
- Downloads last month
- 129
GGUF
Model size
426B params
Architecture
minimax-m3
Hardware compatibility
Log In to add your hardware
2-bit
3-bit
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for SicariusSicariiStuff/Minimax-M3-abliterated_GGUF
Base model
MiniMaxAI/MiniMax-M3