xCoT-Distill: Cross-Lingual Chain-of-Thought Distillation for Arabic Reasoning

This model is a QLoRA fine-tune of Qwen/Qwen3-8B trained to reason in English and answer in Arabic.

Method

xCoT-Distill generates (Arabic question, English CoT, Arabic answer) triples using a Qwen3-80B teacher model, then fine-tunes a student model using:

~17K triples from:

Mark Kashirskiy, Artiom Lipinski, Ilya Makarov

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Finetuned

Adapter

(1468)

this model