Model Card for lianghsun/Llama-3.2-Taiwan-3B-Instruct

[👋 歡迎加入 Discord 討論 🎉]

👁 image/png
圖像生成來自 OpenArt：An anime-style 🦙 standing proudly atop the summit of Taiwan’s Yushan (Jade Mountain), gazing forward.

採用 lianghsun/Llama-3.2-Taiwan-3B 為基礎模型（foundation model），使用大量中華民國台灣的繁體中文對話集和多國語言對話集進行模型指令微調（instruction fine-tuning）和多輪迭代直接偏好優化（direct preference optimization, DPO），旨在訓練出具有中華民國台灣知識及風格的小語言模型（small langugae model, SLM）之對話模型。

Model Details

Model Description

Developed by: Huang Liang Hsun
Model type: LlamaForCausalLM
Language(s) (NLP): Tranditional Chinese (zh-tw), English
License: llama3.2
Fine-tuned from model: lianghsun/Llama-3.2-Taiwan-3B

Model Sources

Repository: lianghsun/Llama-3.2-Taiwan-3B
Paper: (WIP, show me the time)
Playground: 🦉 Tawian SmolLM Chat 👈🏼 來玩看看 😻
Demo:

user: 請介紹台灣
assistant: 台灣，位於亞洲東部，地處太平洋與菲律賓海之間，面積約36,000平方公里，人口約2,300萬，是民主自由的國家，經濟實力強勁，擁有世界第10大經濟體。台灣以美食、文化、自然美景著稱，還有豐富的歷史與傳統，吸引全球遊客。台灣語為官方語言，但中文也廣為使用，英語也常用於國際交流。台灣政治多元，執政黨為民進黨，台灣是全球科技產業的重鎮，擁有先進的製造業與服務業。台灣氣候溫暖潮濕，四季分明，夏季炎熱，冬季涼爽，雨季則在5月至10月。台灣的美食以小吃為主，如滷肉飯、珍珠

Uses

Direct Use

本模型已經具備有繁體中文對話能力，使用者可以直接部署推論端點使用。

Downstream Use

若需強化模型在特定領域的知識，可透過微調進一步提升其性能與專業能力。

Out-of-Scope Use

本模型旨在提供資訊，不參與任何政治或法律問題的評斷或立場表達。

Bias, Risks, and Limitations

語言模型的生成內容可能因訓練集的多樣性而帶有偏見、特定立場，或包含與事實不符的言論，請使用者務必在使用過程中仔細確認內容的準確性與中立性。

Recommendations

Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.

How to Get Started with the Model

要使用 vLLM Docker image 來啟動此模型，您可以按照以下操作：

docker run --runtime nvidia --gpus all \
 -v ~/.cache/huggingface:/root/.cache/huggingface \
 --env "HUGGING_FACE_HUB_TOKEN=<secret>" \
 -p 8000:8000 \
 --ipc=host \
 vllm/vllm-openai:latest \
 --model lianghsun/Llama-3.2-Taiwan-3B-Instruct

請注意，如果想要使用不同版本的 checkpoint，請加上 --revision <tag_name>

docker run --runtime nvidia --gpus all \
 -v ~/.cache/huggingface:/root/.cache/huggingface \
 --env "HUGGING_FACE_HUB_TOKEN=<secret>" \
 -p 8000:8000 \
 --ipc=host \
 vllm/vllm-openai:latest \
 --model lianghsun/Llama-3.2-Taiwan-3B-Instruct --revision <tag_name>

Training Details

Training Data

Training Procedure

Preprocessing

(WIP)

Training Hyperparameters

Speeds, Sizes, Times

Evaluation

Testing Data, Factors & Metrics

Model Examination [optional]

[More Information Needed]

Environmental Impact

Hardware Type: 🚀
Hours used: ⏳⏳⌛
Cloud Provider: 鴻鵠國際股份有限公司
Compute Region: 🇹🇼
Carbon Emitted: ♻️

Technical Specifications

Model Architecture and Objective

[More Information Needed]

Compute Infrastructure

[More Information Needed]

Hardware

CPU count: 32
Logical CPU count: 64
GPU count: 4
GPU type: NVIDIA H100 NVL

Software

OS version: Linux-5.15.0-124-generic-x86_64-with-glibc2.35
Python version: 3.12.7

Citation

@misc{lianghsun2024llama32taiwan3binstruct,
 author = {Huang, Liang Hsun},
 title = {Llama-3.2-Taiwan-3B-Instruct},
 year = {2024},
 publisher = {Hugging Face},
 howpublished = {\url{https://huggingface.co/lianghsun/Llama-3.2-Taiwan-3B-Instruct}},
 note = {Accessed: 2024-11-25}
}

Glossary [optional]

N/A

More Information

Acknowledge

👁 image/png
在此致謝鴻鵠國際股份有限公司蔡長明先生無償地贊助算力，以及曾經幫忙過：廖振翔、chweng、Ben、kevin、Maxxchu、Lam 和陳林彥…等朋友們，才能讓這個模型得以訓練完成，提供算力者乃人生父母。

Usage

如果你基於此指令模型進行微調，希望能不吝嗇在 模型卡片(model card) 裡標註 基礎模型 為：

base_model: lianghsun/Llama-3.2-Taiwan-3B-Instruct

標註和 ❤️ 是給予我們最大的鼓勵，謝謝。😀

Model Card Authors

Huang Liang Hsun

Model Card Contact

Huang Liang Hsun

Framework versions

Transformers 4.45.2
Pytorch 2.4.1+cu121
Datasets 2.21.0
Tokenizers 0.20.0

Downloads last month: 108

Safetensors

Model size

4B params

Tensor type

BF16

Model tree for lianghsun/Llama-3.2-Taiwan-3B-Instruct

Base model

meta-llama/Llama-3.2-3B

Finetuned

lianghsun/Llama-3.2-Taiwan-3B

Finetuned

(3)

this model

Finetunes

1 model

Quantizations

5 models

Datasets used to train lianghsun/Llama-3.2-Taiwan-3B-Instruct

Spaces using lianghsun/Llama-3.2-Taiwan-3B-Instruct 23

Collection including lianghsun/Llama-3.2-Taiwan-3B-Instruct

Based on the meta-llama/Llama-3.2-*B model, we continue pre-training on a large corpus of Traditional Chinese and non-Chinese language data. • 6 items • Updated Mar 2 • 1

Paper for lianghsun/Llama-3.2-Taiwan-3B-Instruct

Paper • 2305.18290 • Published May 29, 2023 • 66

Evaluation results

single choice on tw-legal-benchmark-v1
self-reported
31.100
single choice on (Society) Formosa Taiwan Knowledge Bench
test set self-reported
60.420
single choice on (Governmnt) Formosa Taiwan Knowledge Bench
test set self-reported
44.250
single choice on (Geography) Formosa Taiwan Knowledge Bench
test set self-reported
47.540
single choice on (History) Formosa Taiwan Knowledge Bench
test set self-reported
60.000
single choice on (geography_of_taiwan) tmmlu++
test set self-reported
36.200
single choice on (dentistry) tmmlu++
test set self-reported
33.830
single choice on (technical) tmmlu++
test set self-reported
35.070

URL: https://huggingface.co/lianghsun/Llama-3.2-Taiwan-3B-Instruct

⇱ lianghsun/Llama-3.2-Taiwan-3B-Instruct · Hugging Face

Model Card for lianghsun/Llama-3.2-Taiwan-3B-Instruct

Model Details

Model Description

Model Sources

Uses

Direct Use

Downstream Use

Out-of-Scope Use

Bias, Risks, and Limitations

Recommendations

How to Get Started with the Model

Training Details

Training Data

Training Procedure

Preprocessing

Training Hyperparameters

Speeds, Sizes, Times

Evaluation

Testing Data, Factors & Metrics

Model Examination [optional]

Environmental Impact

Technical Specifications

Model Architecture and Objective

Compute Infrastructure

Hardware

Software

Citation

Glossary [optional]

More Information

Acknowledge

Usage

Model Card Authors

Model Card Contact

Framework versions

Model tree for lianghsun/Llama-3.2-Taiwan-3B-Instruct

Datasets used to train lianghsun/Llama-3.2-Taiwan-3B-Instruct

Spaces using lianghsun/Llama-3.2-Taiwan-3B-Instruct 23

Collection including lianghsun/Llama-3.2-Taiwan-3B-Instruct

Paper for lianghsun/Llama-3.2-Taiwan-3B-Instruct

Evaluation results