VOOZH about

URL: https://huggingface.co/datasets/hyunseoki/qwen3-0p6b-openthoughts-self-distill-1k

⇱ hyunseoki/qwen3-0p6b-openthoughts-self-distill-1k · Datasets at Hugging Face


Dataset Preview
Duplicate
problem
string
prompt
string
generation
string
extracted_answer
string
gt_answer
string
passed_gt_match
bool
domain
string
source
string
difficulty
int64
label_char_len
int64
dataset_name
string
task_type
string
task_format
string
Let $g(m)$ be the product of the positive integer divisors of $m$. For how many values of $m$, where $1 \le m \le 30$, is $g(m)$ a perfect square?
<|im_start|>system You are a careful problem solver. Think step by step and put your final answer inside \boxed{} at the end of your response.<|im_end|> <|im_start|>user Let $g(m)$ be the product of the positive integer divisors of $m$. For how many values of $m$, where $1 \le m \le 30$, is $g(m)$ a perfect square?<|im...
<think> Okay, so I need to figure out how many numbers between 1 and 30 inclusive have their product of positive divisors, denoted as g(m), be a perfect square. Hmm, let me start by recalling some properties of divisors and perfect squares. First, I remember that for any number m, the product of its positive divisors ...
23
13
false
math
ai2-adapt-dev/openmath-2-math
null
42,650
self_distill_openthoughts
reason
generation
Find the number of triangles with integer sides that have one side equal to the arithmetic mean of the other two sides.
<|im_start|>system You are a careful problem solver. Think step by step and put your final answer inside \boxed{} at the end of your response.<|im_end|> <|im_start|>user Find the number of triangles with integer sides that have one side equal to the arithmetic mean of the other two sides.<|im_end|> <|im_start|>assistan...
<think> Okay, so I need to find the number of triangles with integer sides where one side is the arithmetic mean of the other two. Let me start by recalling what an arithmetic mean is. If I have two sides, say a and b, their arithmetic mean is (a + b)/2. So, the third side, which is supposed to be the arithmetic mean, ...
finite, and I need to check again
\infty
false
math
ai2-adapt-dev/openmath-2-math
null
46,563
self_distill_openthoughts
reason
generation
A circle of radius $r$ is inscribed in a square. Two circles of radius $r$ are inscribed in two of the triangles formed by the center of the square and two of its adjacent vertices. These two circles are tangent to each other and to the original circle. What is the length of the segment $PQ$, which is the diameter of...
<|im_start|>system You are a careful problem solver. Think step by step and put your final answer inside \boxed{} at the end of your response.<|im_end|> <|im_start|>user A circle of radius $r$ is inscribed in a square. Two circles of radius $r$ are inscribed in two of the triangles formed by the center of the square an...
<think> Okay, so there's this problem about a square with an inscribed circle of radius r. Then there are two more circles of the same radius r inscribed in two of the triangles formed by the center of the square and two adjacent vertices. These two circles are tangent to each other and to the original circle. We need ...
2r
2r(1 + \sqrt{2})
false
math
ai2-adapt-dev/openmath-2-math
null
54,103
self_distill_openthoughts
reason
generation
A circle is inscribed in an isosceles triangle, whose vertex angle is 60 degrees and whose base measures 8 units. Determine the perimeter of the triangle if one of its legs is a tangent to the inscribed circle at a point which divides the leg into two segments, whose lengths are in the ratio 1:3.
<|im_start|>system You are a careful problem solver. Think step by step and put your final answer inside \boxed{} at the end of your response.<|im_end|> <|im_start|>user A circle is inscribed in an isosceles triangle, whose vertex angle is 60 degrees and whose base measures 8 units. Determine the perimeter of the trian...
<think> Okay, so I need to find the perimeter of an isosceles triangle. The triangle has a vertex angle of 60 degrees and a base of 8 units. Also, one of its legs is tangent to the inscribed circle at a point that divides the leg into two segments with lengths in the ratio 1:3. Hmm, let me try to visualize this. First...
24
\dfrac{56}{3}
false
math
ai2-adapt-dev/openmath-2-math
null
29,529
self_distill_openthoughts
reason
generation
"Let $A,$ $B,$ $C$ be the angles of a triangle. Evaluate\n\\[\\begin{vmatrix} \\sin A & \\cos B & \(...TRUNCATED)
"<|im_start|>system\nYou are a careful problem solver. Think step by step and put your final answer (...TRUNCATED)
"<think>\nOkay, so I need to evaluate the determinant of this 3x3 matrix:\n\n$$\n\\begin{vmatrix} \n(...TRUNCATED)
-0.375
null
null
math
ai2-adapt-dev/openmath-2-math
null
44,599
self_distill_openthoughts
reason
generation
"Consider the sequence $2000, y, 2000 - y,\\ldots$, where each term after the first two is obtained (...TRUNCATED)
"<|im_start|>system\nYou are a careful problem solver. Think step by step and put your final answer (...TRUNCATED)
"<think>\nOkay, so I need to find the positive integer y such that the sequence 2000, y, 2000 - y, .(...TRUNCATED)
2000
1236
false
math
ai2-adapt-dev/openmath-2-math
null
32,982
self_distill_openthoughts
reason
generation
"I'm trying to understand the wave function of two identical particles with spin but unfortunately m(...TRUNCATED)
"<|im_start|>system\nYou are a careful problem solver. Think step by step and put your final answer (...TRUNCATED)
"<think>\nOkay, let me try to work through this. So, the user is asking about the wave function for (...TRUNCATED)
the same as for fermions
"\\text{The degeneracy increases due to the orbital spin components combined symmetrically or anti-s(...TRUNCATED)
false
science
stackexchange-physics
null
12,736
self_distill_openthoughts
reason
generation
"Find the sum of all positive integers that can be expressed as a palindrome in both base 2 and base(...TRUNCATED)
"<|im_start|>system\nYou are a careful problem solver. Think step by step and put your final answer (...TRUNCATED)
"<think>\nOkay, so I need to find all positive integers that can be expressed as palindromes in both(...TRUNCATED)
valid
1
false
math
ai2-adapt-dev/openmath-2-math
null
34,295
self_distill_openthoughts
reason
generation
"The sum of the volumes of all tetrahedrons whose vertices are also vertices of a regular octahedron(...TRUNCATED)
"<|im_start|>system\nYou are a careful problem solver. Think step by step and put your final answer (...TRUNCATED)
"<think>\nOkay, so I need to find the sum of the volumes of all tetrahedrons whose vertices are also(...TRUNCATED)
8*(1/3) + 7*0 = 8/3
null
null
math
ai2-adapt-dev/openmath-2-math
null
47,859
self_distill_openthoughts
reason
generation
"A rectangular prism has a total of 16 vertices. Each face of the prism is a rectangle and has eithe(...TRUNCATED)
"<|im_start|>system\nYou are a careful problem solver. Think step by step and put your final answer (...TRUNCATED)
"<think>\nOkay, let's try to figure out this problem. So, we have a rectangular prism with 16 vertic(...TRUNCATED)
8
null
null
math
ai2-adapt-dev/openmath-2-math
null
56,958
self_distill_openthoughts
reason
generation
End of preview.

Qwen3-0.6B Self-Distillation on OpenThoughts3 (1k)

Self-distillation trajectories generated by Qwen/Qwen3-0.6B on problems sampled from the deduplicated OpenThoughts3 pool (hyunseoki/openthoughts3-dedup-index).

The intended use is reasoning retention supervision in selective-forgetting experiments: the model's own generation distribution is preserved under distillation because the label sequence was produced by the same base model. See the upstream research repo memory_reasoning_split for training-loop integration.

How it was built

  1. Streamed the full 1.2M rows of open-thoughts/OpenThoughts3-1.2M.
  2. Deduplicated by normalized problem text → 65,047 unique questions.
  3. Uniformly sampled 1,000 unique problems (seed=13) across all three upstream domains (math, code, science).
  4. Rendered prompts with the Qwen3 chat template + an explicit solver system prompt, then generated one trajectory per problem with vLLM using Qwen3's recommended thinking-mode sampling: temperature=0.6, top_p=0.95, top_k=20, min_p=0.0, max_new_tokens=16384.
  5. Extracted a final answer from each generation via \boxed{...} / "final answer" heuristics.
  6. Compared the extracted answer against the upstream solver's boxed GT when available.

Schema

Field Type Notes
problem str Original problem text
prompt str Chat-templated prompt actually fed to the model
generation str Raw model output (the label_text / distillation target)
extracted_answer str Final-answer extraction from generation
gt_answer str or null \boxed{...} extracted from the upstream solver's response; null when absent
passed_gt_match bool or null true/false when gt_answer is present; null when no GT was available (e.g. code-golf problems) — this is not a correctness signal, it means "could not be compared"
domain str math / code / science
source str Upstream source (e.g. ai2-adapt-dev/openmath-2-math)
difficulty str or null Upstream difficulty
label_char_len int Character length of generation
dataset_name str self_distill_openthoughts
task_type str reason
task_format str generation

Summary statistics

  • Total rows: 1000
  • By domain:
    • math: 803
    • science: 105
    • code: 92
  • passed_gt_match values:
    • true: 144
    • false: 560
    • blank: 296
  • Pass rate by domain (within rows that have GT):
    • math: 134/600 = 22.3% (+ 203 rows without GT)
    • science: 8/83 = 9.6% (+ 22 rows without GT)
    • code: 2/21 = 9.5% (+ 71 rows without GT)

Notes

  • No train/eval split is shipped; downstream consumers should split as needed. The whole 1,000-row corpus is intended as one coherent distillation set for the 0.6B self-retention experiments.
  • passed_gt_match is a nullable boolean column: true, false, or null. null means "could not be compared" (no upstream boxed GT available) — it is not a correctness signal. In particular most code rows and any math/science row whose upstream solver response lacks a boxed target will be null.
  • For a sandboxed code-correctness evaluation see the discussion in the research repo — not included here.

Reproducibility

  • Sampling seed: 13
  • Base model: Qwen/Qwen3-0.6B
  • Sampling config: Qwen3 thinking-mode defaults
  • Upstream dedup index: hyunseoki/openthoughts3-dedup-index (commit-pinned on HF)

License / Attribution

Problems and GT answers are derived from open-thoughts/OpenThoughts3-1.2M; please follow its license terms. The generated trajectories are model output from Qwen/Qwen3-0.6B.

Downloads last month
18