VOOZH about

URL: https://huggingface.co/shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat

⇱ shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat · Hugging Face


❗️❗️❗️NOTICE: For optimal performance, we refrain from fine-tuning the model's identity. Thus, inquiries such as "Who are you" or "Who developed you" may yield random responses that are not necessarily accurate.

Updates

Model Summary

Mistral-7B-v0.3-Chinese-Chat is an instruction-tuned language model for Chinese & English users with various abilities such as roleplaying & tool-using built upon the mistralai/Mistral-7B-Instruct-v0.3.

Developers: Shenzhi Wang*, Yaowei Zheng*, Guoyin Wang (in.ai), Shiji Song, Gao Huang. (*: Equal Contribution)

1. Introduction

This is the first model specifically fine-tuned for Chinese & English user based on the mistralai/Mistral-7B-Instruct-v0.3. The fine-tuning algorithm used is ORPO [1].

Compared to the original mistralai/Mistral-7B-Instruct-v0.3, our Mistral-7B-v0.3-Chinese-Chat model significantly reduces the issues of "Chinese questions with English answers" and the mixing of Chinese and English in responses.

[1] Hong, Jiwoo, Noah Lee, and James Thorne. "Reference-free Monolithic Preference Optimization with Odds Ratio." arXiv preprint arXiv:2403.07691 (2024).

Training framework: LLaMA-Factory.

Training details:

  • epochs: 3
  • learning rate: 3e-6
  • learning rate scheduler type: cosine
  • Warmup ratio: 0.1
  • cutoff len (i.e. context length): 32768
  • orpo beta (i.e. $\lambda$ in the ORPO paper): 0.05
  • global batch size: 128
  • fine-tuning type: full parameters
  • optimizer: paged_adamw_32bit

2. Usage

from transformers import pipeline

messages = [
 {
 "role": "system",
 "content": "You are a helpful assistant.",
 },
 {"role": "user", "content": "简要地介绍一下什么是机器学习"},
]
chatbot = pipeline(
 "text-generation",
 model="shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat",
 max_length=32768,
)
print(chatbot(messages))

3. Examples

The following are some examples generated by our Mistral-7B-v0.3-Chinese-Chat, including examples of role playing, function calling, math, RuoZhiBa (弱智吧), safety, writing, and coding, etc.








Citation

If our Mistral-7B-v0.3-Chinese-Chat is helpful, please kindly cite as:

@misc {shenzhi_wang_2024,
 author = {Wang, Shenzhi and Zheng, Yaowei and Wang, Guoyin and Song, Shiji and Huang, Gao},
 title = { Mistral-7B-v0.3-Chinese-Chat (Revision 754841d) },
 year = 2024,
 url = { https://huggingface.co/shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat },
 doi = { 10.57967/hf/2317 },
 publisher = { Hugging Face }
}
Downloads last month
13
Safetensors
Model size
7B params
Tensor type
BF16
·

Model tree for shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat

Finetuned
(505)
this model
Merges
3 models
Quantizations
4 models

Collection including shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat

Paper for shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat