Voozh

VOOZH

URL: https://huggingface.co/RuleReasoner/RuleReasoner-4B

⇱ RuleReasoner/RuleReasoner-4B · Hugging Face

If you use the model in your research, please cite the original papers as below.

@article{liu2025rulereasoner,
 title={RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling}, 
 author={Yang Liu and Jiaqi Li and Zilong Zheng},
 year={2025},
 eprint={2506.08672},
 archivePrefix={arXiv},
 primaryClass={cs.CL},
 url={https://arxiv.org/abs/2506.08672}, 
}

Code: https://github.com/bigai-nlco/RuleReasoner

Downloads last month: 27

Safetensors

Model size

4B params

Tensor type

BF16

·

Model tree for RuleReasoner/RuleReasoner-4B

Base model

Qwen/Qwen3-4B-Base

Finetuned

(337)

this model

Quantizations

Dataset used to train RuleReasoner/RuleReasoner-4B

Paper for RuleReasoner/RuleReasoner-4B

Paper • 2506.08672 • Published Jun 10, 2025 • 30