VOOZH about

URL: https://huggingface.co/RuleReasoner/RuleReasoner-4B

⇱ RuleReasoner/RuleReasoner-4B · Hugging Face


If you use the model in your research, please cite the original papers as below.

@article{liu2025rulereasoner,
 title={RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling}, 
 author={Yang Liu and Jiaqi Li and Zilong Zheng},
 year={2025},
 eprint={2506.08672},
 archivePrefix={arXiv},
 primaryClass={cs.CL},
 url={https://arxiv.org/abs/2506.08672}, 
}

Code: https://github.com/bigai-nlco/RuleReasoner

Downloads last month
27
Safetensors
Model size
4B params
Tensor type
BF16
·

Model tree for RuleReasoner/RuleReasoner-4B

Finetuned
(337)
this model
Quantizations
1 model

Dataset used to train RuleReasoner/RuleReasoner-4B

Paper for RuleReasoner/RuleReasoner-4B