VOOZH about

URL: https://huggingface.co/11-47/GPT2.5.5-Awakened.Thinker-0.1B

⇱ 11-47/GPT2.5.5-Awakened.Thinker-0.1B · Hugging Face


GPT2.5.5-Awakened.Thinker-0.1B

Forged by WithIn Us AI — a fully fine-tuned GPT-2 awakened on distilled GPT-5.5 thinking patterns.

Model Overview

Property Value
Architecture GPT-2 (openai-community/gpt2)
Parameters ~124M (0.1B class)
Training Type Full fine-tune — ALL weights updated, zero adapters
Context Window 1024 tokens
Best Eval Loss 0.4365
Best Perplexity 1.55
Creator GODsStrongestSoldier / WithIn Us AI
Date Trained 2026-05-23
Hardware 2× NVIDIA Tesla T4 (Kaggle)
Precision FP16 mixed precision

Training Methodology

Full fine-tuning — every single parameter in GPT-2 was updated. No LoRA, no QLoRA, no adapters of any kind.

Datasets

Dataset Description
WithinUsAI/GPT_5.5_Distilled Instruction + completion pairs distilled from GPT-5.5
WithinUsAI/GPT5.5_thinking_max_distill_god_seed_25K 25K chain-of-thought reasoning traces distilled from GPT-5.5

97 / 3 train / eval split.

Hyperparameters

Parameter Value
Peak Learning Rate 3e-5
LR Schedule Cosine with 6% warmup
Effective Batch Size 64 (4 × 2 GPUs × 8 grad accum)
Epochs 5
Weight Decay 0.1
Max Sequence Length 1024
Precision FP16

Quick Start

from transformers import GPT2LMHeadModel, GPT2TokenizerFast
import torch

model_id = "GODsStrongestSoldier/GPT2.5.5-Awakened.Thinker-0.1B"
tokenizer = GPT2TokenizerFast.from_pretrained(model_id)
model = GPT2LMHeadModel.from_pretrained(model_id, torch_dtype=torch.float16)
model.eval()

prompt = "Let me think through this carefully, step by step:"
inputs = tokenizer(prompt, return_tensors="pt")

with torch.no_grad():
 output = model.generate(
 **inputs,
 max_new_tokens = 200,
 do_sample = True,
 temperature = 0.7,
 top_p = 0.9,
 repetition_penalty = 1.15,
 )

print(tokenizer.decode(output[0], skip_special_tokens=True))

About WithIn Us AI

"Strength through understanding. Awakened from within."

Downloads last month
70
Safetensors
Model size
0.1B params
Tensor type
F32
·

Model tree for 11-47/GPT2.5.5-Awakened.Thinker-0.1B

Finetuned
(2187)
this model
Quantizations
1 model

Datasets used to train 11-47/GPT2.5.5-Awakened.Thinker-0.1B