VOOZH about

URL: https://huggingface.co/shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat-f16

⇱ shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat-f16 · Hugging Face


❗️❗️❗️NOTICE: For optimal performance, we refrain from fine-tuning the model's identity. Thus, inquiries such as "Who are you" or "Who developed you" may yield random responses that are not necessarily accurate.

This is the official f16 GGUF files for shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat.

Updates

Model Summary

Mistral-7B-v0.3-Chinese-Chat is an instruction-tuned language model for Chinese & English users with various abilities such as roleplaying & tool-using built upon the mistralai/Mistral-7B-Instruct-v0.3.

Developers: Shenzhi Wang*, Yaowei Zheng*, Guoyin Wang (in.ai), Shiji Song, Gao Huang. (*: Equal Contribution)

1. Introduction

This is the first model specifically fine-tuned for Chinese & English user based on the mistralai/Mistral-7B-Instruct-v0.3. The fine-tuning algorithm used is ORPO [1].

Compared to the original mistralai/Mistral-7B-Instruct-v0.3, our Mistral-7B-v0.3-Chinese-Chat model significantly reduces the issues of "Chinese questions with English answers" and the mixing of Chinese and English in responses.

[1] Hong, Jiwoo, Noah Lee, and James Thorne. "Reference-free Monolithic Preference Optimization with Odds Ratio." arXiv preprint arXiv:2403.07691 (2024).

Training framework: LLaMA-Factory.

Training details:

  • epochs: 3
  • learning rate: 3e-6
  • learning rate scheduler type: cosine
  • Warmup ratio: 0.1
  • cutoff len (i.e. context length): 32768
  • orpo beta (i.e. $\lambda$ in the ORPO paper): 0.05
  • global batch size: 128
  • fine-tuning type: full parameters
  • optimizer: paged_adamw_32bit

2. Usage

from transformers import pipeline

messages = [
 {
 "role": "system",
 "content": "You are a helpful assistant.",
 },
 {"role": "user", "content": "简要地介绍一下什么是机器学习"},
]
chatbot = pipeline(
 "text-generation",
 model="shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat",
 max_length=32768,
)
print(chatbot(messages))

3. Examples

The following are some examples generated by our Mistral-7B-v0.3-Chinese-Chat, including examples of role playing, function calling, math, RuoZhiBa (弱智吧), safety, writing, and coding, etc.








Citation

If our Mistral-7B-v0.3-Chinese-Chat is helpful, please kindly cite as:

@misc {shenzhi_wang_2024,
 author = {Wang, Shenzhi and Zheng, Yaowei and Wang, Guoyin and Song, Shiji and Huang, Gao},
 title = { Mistral-7B-v0.3-Chinese-Chat (Revision 754841d) },
 year = 2024,
 url = { https://huggingface.co/shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat },
 doi = { 10.57967/hf/2317 },
 publisher = { Hugging Face }
}
Downloads last month
43
GGUF
Model size
7B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

16-bit

Model tree for shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat-f16

Quantized
(263)
this model

Space using shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat-f16 1

Collection including shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat-f16

Paper for shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat-f16