Voozh

mistral-small-r1-tensopolis

This model is a reasoning fine-tune of unsloth/mistral-small-24b-instruct-2501-unsloth-bnb-4bit. Trained in 1xA100 for about 100 hours. Please refer to the base model and dataset for more information about license, prompt format, etc.

Base model: mistralai/Mistral-Small-24B-Instruct-2501

Dataset: ServiceNow-AI/R1-Distill-SFT

Basic Instruct Template (V7-Tekken)

<s>[SYSTEM_PROMPT]<system prompt>[/SYSTEM_PROMPT][INST]<user message>[/INST]<assistant response></s>[INST]<user message>[/INST]

This mistral model was trained 2x faster with Unsloth and Huggingface's TRL library.

👁 Image

Downloads last month: 9

Safetensors

Model size

24B params

Tensor type

BF16

Model tree for tensopolis/mistral-small-r1-tensopolis

Base model

mistralai/Mistral-Small-24B-Base-2501

Finetuned

mistralai/Mistral-Small-24B-Instruct-2501

Finetuned

(76)

this model

Quantizations

3 models

URL: https://huggingface.co/tensopolis/mistral-small-r1-tensopolis

⇱ tensopolis/mistral-small-r1-tensopolis · Hugging Face

mistral-small-r1-tensopolis

Basic Instruct Template (V7-Tekken)

Model tree for tensopolis/mistral-small-r1-tensopolis

Dataset used to train tensopolis/mistral-small-r1-tensopolis