❗️❗️❗️NOTICE: For optimal performance, we refrain from fine-tuning the model's identity. Thus, inquiries such as "Who are you" or "Who developed you" may yield random responses that are not necessarily accurate.

This is the official f16 GGUF files for shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat.

Updates

🚀🚀🚀 [May 26, 2024] We now introduce Mistral-7B-v0.3-Chinese-Chat, which is the first model fine-tuned specifically for Chinese and English users based on mistralai/Mistral-7B-Instruct-v0.3! Full-parameter fine-tuned on a mixed Chinese-English dataset of ~100K preference pairs, the Chinese ability of our Mistral-7B-v0.3-Chinese-Chat is significantly better than mistralai/Mistral-7B-Instruct-v0.3! Besides, our Mistral-7B-v0.3-Chinese-Chat has great performance in mathematics, roleplay, tool use, etc.
🔥 We provide the official q4 GGUF version of Mistral-7B-v0.3-Chinese-Chat at shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat-q4!
🔥 We provide the official q8 GGUF version of Mistral-7B-v0.3-Chinese-Chat at shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat-q8!
🔥 We provide the official f16 GGUF version of Mistral-7B-v0.3-Chinese-Chat at shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat-f16!

Model Summary

Mistral-7B-v0.3-Chinese-Chat is an instruction-tuned language model for Chinese & English users with various abilities such as roleplaying & tool-using built upon the mistralai/Mistral-7B-Instruct-v0.3.

Developers: Shenzhi Wang*, Yaowei Zheng*, Guoyin Wang (in.ai), Shiji Song, Gao Huang. (*: Equal Contribution)

License: Apache License 2.0
Base Model: mistralai/Mistral-7B-Instruct-v0.3
Model Size: 7.25B
Context length: 32K

1. Introduction

This is the first model specifically fine-tuned for Chinese & English user based on the mistralai/Mistral-7B-Instruct-v0.3. The fine-tuning algorithm used is ORPO [1].

Compared to the original mistralai/Mistral-7B-Instruct-v0.3, our Mistral-7B-v0.3-Chinese-Chat model significantly reduces the issues of "Chinese questions with English answers" and the mixing of Chinese and English in responses.

[1] Hong, Jiwoo, Noah Lee, and James Thorne. "Reference-free Monolithic Preference Optimization with Odds Ratio." arXiv preprint arXiv:2403.07691 (2024).

Training framework: LLaMA-Factory.

Training details:

epochs: 3
learning rate: 3e-6
learning rate scheduler type: cosine
Warmup ratio: 0.1
cutoff len (i.e. context length): 32768
orpo beta (i.e. $\lambda$ in the ORPO paper): 0.05
global batch size: 128
fine-tuning type: full parameters
optimizer: paged_adamw_32bit

2. Usage

from transformers import pipeline

messages = [
 {
 "role": "system",
 "content": "You are a helpful assistant.",
 },
 {"role": "user", "content": "简要地介绍一下什么是机器学习"},
]
chatbot = pipeline(
 "text-generation",
 model="shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat",
 max_length=32768,
)
print(chatbot(messages))

3. Examples

The following are some examples generated by our Mistral-7B-v0.3-Chinese-Chat, including examples of role playing, function calling, math, RuoZhiBa (弱智吧), safety, writing, and coding, etc.

Citation

If our Mistral-7B-v0.3-Chinese-Chat is helpful, please kindly cite as:

@misc {shenzhi_wang_2024,
 author = {Wang, Shenzhi and Zheng, Yaowei and Wang, Guoyin and Song, Shiji and Huang, Gao},
 title = { Mistral-7B-v0.3-Chinese-Chat (Revision 754841d) },
 year = 2024,
 url = { https://huggingface.co/shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat },
 doi = { 10.57967/hf/2317 },
 publisher = { Hugging Face }
}

Downloads last month: 43

GGUF

Model size

7B params

Architecture

llama

Hardware compatibility

16-bit

Model tree for shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat-f16

Base model

mistralai/Mistral-7B-v0.3

Finetuned

mistralai/Mistral-7B-Instruct-v0.3

Quantized

(263)

this model

Space using shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat-f16 1

Collection including shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat-f16

4 items • Updated Jul 26, 2024 • 3

Paper for shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat-f16

Paper • 2403.07691 • Published Mar 12, 2024 • 73

URL: https://huggingface.co/shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat-f16

⇱ shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat-f16 · Hugging Face