VOOZH about

URL: https://huggingface.co/macadeliccc/Samantha-Qwen-2-7B

⇱ macadeliccc/Samantha-Qwen-2-7B · Hugging Face


Samantha Qwen2 7B

Trained on 2x4090 using QLoRa and FSDP

Launch Using VLLM

python -m vllm.entrypoints.openai.api_server \
 --model macadeliccc/Samantha-Qwen-2-7B \
 --chat-template ./examples/template_chatml.jinja \
from openai import OpenAI
# Set OpenAI's API key and API base to use vLLM's API server.
openai_api_key = "EMPTY"
openai_api_base = "http://localhost:8000/v1"

client = OpenAI(
 api_key=openai_api_key,
 base_url=openai_api_base,
)

chat_response = client.chat.completions.create(
 model="macadeliccc/Samantha-Qwen-2-7B",
 messages=[
 {"role": "system", "content": "You are a helpful assistant."},
 {"role": "user", "content": "Tell me a joke."},
 ]
)
print("Chat response:", chat_response)

Prompt Template

<|im_start|>system
You are a friendly assistant.<|im_end|>
<|im_start|>user
What is the capital of France?<|im_end|>
<|im_start|>assistant
The capital of France is Paris.

Quants

👁 Built with Axolotl


Downloads last month
10
Safetensors
Model size
8B params
Tensor type
F16
·

Model tree for macadeliccc/Samantha-Qwen-2-7B

Base model

Qwen/Qwen2-7B
Finetuned
(76)
this model
Merges
3 models
Quantizations
8 models

Datasets used to train macadeliccc/Samantha-Qwen-2-7B

Spaces using macadeliccc/Samantha-Qwen-2-7B 2