VOOZH about

URL: https://huggingface.co/datasets/mkurman/medical-reasoning-synthlabs-I

⇱ mkurman/medical-reasoning-synthlabs-I · Datasets at Hugging Face


query
stringlengths
5
325
reasoning
stringlengths
891
4.99k
reasoning_content
stringlengths
891
6.67k
answer
stringlengths
64
15k
modelUsed
stringclasses
2 values
isMultiTurn
bool
1 class
alcohol withdrawal
### 1. Query Decomposition The query "alcohol withdrawal" requires a structured explanation of its **definition**, **risk factors**, **clinical progression**, **management**, and **red flags**. The user likely seeks actionable information for clinical decision-making, patient education, or personal health awareness. ...
### 1. Query Decomposition The query "alcohol withdrawal" requires a structured explanation of its **definition**, **risk factors**, **clinical progression**, **management**, and **red flags**. The user likely seeks actionable information for clinical decision-making, patient education, or personal health awareness. ...
Alcohol withdrawal is a potentially life-threatening condition characterized by physical and mental symptoms following abrupt cessation or reduction of heavy alcohol use. It arises from the central nervous system (CNS) adapting to alcohol’s depressant effects, leading to hyperexcitability when alcohol is removed. Key a...
other/Qwen/Qwen3-14B
false
general parasitology
### 1. Query Decomposition The input query "general parasitology" requires a structured overview of the field, encompassing definitions, classifications, life cycles, and clinical relevance. → Break down into core components: - Definition and scope of parasitology - Host-parasite relationship dynamics - Paras...
### 1. Query Decomposition The input query "general parasitology" requires a structured overview of the field, encompassing definitions, classifications, life cycles, and clinical relevance. → Break down into core components: - Definition and scope of parasitology - Host-parasite relationship dynamics - Paras...
**GENERAL PARASITOLOGY** A comprehensive overview of the biology, ecology, and impact of parasites. --- ### **1. Definition and Scope** - **Parasitology**: The study of parasites, their hosts, and the interactions between them. - **Parasite**: An organism that lives temporarily or permanently on/in a **host**, deri...
other/Qwen/Qwen3-14B
false
collection of dermatological disorders
### 1. Query Decomposition The query seeks a **clinically-oriented collection of dermatological disorders** organized for quick reference. This implies the need to categorize conditions by **morphology** (e.g., papulosquamous, vesiculobullous) and **etiology** (e.g., autoimmune, infectious), with emphasis on **key di...
### 1. Query Decomposition The query seeks a **clinically-oriented collection of dermatological disorders** organized for quick reference. This implies the need to categorize conditions by **morphology** (e.g., papulosquamous, vesiculobullous) and **etiology** (e.g., autoimmune, infectious), with emphasis on **key di...
Here is a clinically-oriented **collection of common dermatological disorders**, grouped by **morphology** and **etiology**, with classic features and key associations. Use it as a quick-reference atlas or foundation for deeper study. --- ### **1. Papulosquamous Disorders** | **Disorder** | **Key Features** | **Class...
other/Qwen/Qwen3-14B
false
strain counterstrain
### 1. Query Decomposition The query "strain counterstrain" seeks an explanation of a therapeutic technique. This requires identifying: - **Definition**: What is SCS? - **Mechanism**: How does it work? - **Application**: Conditions treated and benefits. - **Comparison**: How it differs from other methods. ...
### 1. Query Decomposition The query "strain counterstrain" seeks an explanation of a therapeutic technique. This requires identifying: - **Definition**: What is SCS? - **Mechanism**: How does it work? - **Application**: Conditions treated and benefits. - **Comparison**: How it differs from other methods. ...
**Strain Counterstrain (SCS)** is a **manual therapy technique** used to treat **muscle and joint pain** by **releasing tender points** without aggressive stretching or thrusting. ### **Key Concepts:** - **Origin:** Developed by **Lawrence Jones, DO** in the 1950s. - **Mechanism:** Positions the body to **shorten the ...
other/Qwen/Qwen3-14B
false
mechanical ventilation
``` ### 1. Query Decomposition The query "mechanical ventilation" → requires a structured overview of its clinical application, technical modes, and associated risks. ● Mechanical ventilation is a critical care intervention for respiratory failure → decompose into: 1. **Definition and purpose** 2. **Clinical ind...
### 1. Query Decomposition The query "mechanical ventilation" → requires a structured overview of its clinical application, technical modes, and associated risks. ● Mechanical ventilation is a critical care intervention for respiratory failure → decompose into: 1. **Definition and purpose** 2. **Clinical indicat...
**Mechanical Ventilation** Mechanical ventilation is a life-support technique that assists or replaces spontaneous breathing when a patient cannot maintain adequate gas exchange (oxygenation and CO₂ removal). It involves delivering controlled breaths via a ventilator through an artificial airway (e.g., endotracheal ...
other/Qwen/Qwen3-14B
false
abdominal examination
### 1. Query Decomposition The query "abdominal examination" → request for a structured clinical procedure guide. ※ Constraint: Focus on standard medical practice (not specialized subspecialties). ? Ambiguity: Does the user need a step-by-step guide, mnemonics, or differential diagnosis highlights? --- ### 2....
### 1. Query Decomposition The query "abdominal examination" → request for a structured clinical procedure guide. ※ Constraint: Focus on standard medical practice (not specialized subspecialties). ? Ambiguity: Does the user need a step-by-step guide, mnemonics, or differential diagnosis highlights? --- ### 2....
Abdominal examination is a systematic, step-wise assessment of the abdomen that is performed as part of a routine physical exam or when a patient presents with abdominal complaints. It is traditionally divided into **four classic steps**—**inspection, auscultation, percussion, and palpation**—in that order (except in a...
other/Qwen/Qwen3-14B
false
head and neck region
### 1. Query Decomposition The query "head and neck region" → anatomical overview request ※ Constraint: Focus on structural and functional components ? Ambiguity: Potential clinical relevance vs. basic anatomy ### 2. Knowledge Assessment ● The head and neck region includes cranial bones, facial bones, and ce...
### 1. Query Decomposition The query "head and neck region" → anatomical overview request ※ Constraint: Focus on structural and functional components ? Ambiguity: Potential clinical relevance vs. basic anatomy ### 2. Knowledge Assessment ● The head and neck region includes cranial bones, facial bones, and ce...
The **head and neck region** is a complex anatomical area that includes vital structures for sensory, motor, and autonomic functions. Here's a concise overview: ### **1. Bones & Skull** - **Cranium**: Protects the brain (frontal, parietal, temporal, occipital, sphenoid, ethmoid). - **Facial bones**: Maxilla, mandible,...
other/Qwen/Qwen3-14B
false
cancer of unknown primary
### 1. Query Analysis The query seeks an explanation of "Cancer of Unknown Primary (CUP)" and its clinical management. The goal is to synthesize key diagnostic, therapeutic, and prognostic information from the provided response into a structured reasoning trace. ### 2. Knowledge Assessment The provided model res...
### 1. Query Analysis The query seeks an explanation of "Cancer of Unknown Primary (CUP)" and its clinical management. The goal is to synthesize key diagnostic, therapeutic, and prognostic information from the provided response into a structured reasoning trace. ### 2. Knowledge Assessment The provided model res...
Cancer of Unknown Primary (CUP) Quick Definition CUP is a metastatic cancer for which standardized imaging, laboratory tests, endoscopy, and histopathology fail to identify the primary site. Key Points for Clinicians and Patients 1. Epidemiology • ~3-5 % of all invasive cancers. • Median age at diagnosis...
other/Qwen/Qwen3-14B
false
case 38 ankle pain
### 1. Query Decomposition The query "case 38 ankle pain" → clinical case analysis for ankle pain. ※ Constraint: No specific details provided (location, mechanism, duration). ? Ambiguity: Could refer to a textbook case, patient presentation, or diagnostic challenge. --- ### 2. Knowledge Assessment ● Ankle p...
### 1. Query Decomposition The query "case 38 ankle pain" → clinical case analysis for ankle pain. ※ Constraint: No specific details provided (location, mechanism, duration). ? Ambiguity: Could refer to a textbook case, patient presentation, or diagnostic challenge. --- ### 2. Knowledge Assessment ● Ankle p...
**Case 38: Ankle Pain** **Brief Symptom:** Ankle pain (location, onset, severity, and mechanism not specified). --- ### **Differential Diagnosis** Ankle pain can stem from **traumatic**, **inflammatory**, **infectious**, **degenerative**, or **referred** causes. Key considerations: #### **1. Traumatic (Most ...
other/Qwen/Qwen3-14B
false
case 22 problems with swallowing
### 1. Query Decomposition The query "case 22 problems with swallowing" refers to a clinical scenario involving **dysphagia** (difficulty swallowing). The model response provides a structured approach to evaluating and managing this condition. → Break down the case into: 1. Clarifying the patient’s specific swall...
### 1. Query Decomposition The query "case 22 problems with swallowing" refers to a clinical scenario involving **dysphagia** (difficulty swallowing). The model response provides a structured approach to evaluating and managing this condition. → Break down the case into: 1. Clarifying the patient’s specific swall...
Case 22 – Problems with swallowing (dysphagia) 1. Clarify the complaint • “Swallowing problems” can mean: – Difficulty starting the swallow (oral/pharyngeal) – Food sticking or pain on the way down (esophageal) – Coughing, choking, nasal regurgitation, aspiration pneumonia (neuromuscular) ...
other/Qwen/Qwen3-14B
false
cerebellum
### 1. Query Decomposition The query "cerebellum" targets a specific anatomical structure in neuroscience. To address it, the response must cover: - **Anatomical location**: Where is the cerebellum situated relative to other brain regions? - **Functional roles**: What are its primary and emerging functions? - *...
### 1. Query Decomposition The query "cerebellum" targets a specific anatomical structure in neuroscience. To address it, the response must cover: - **Anatomical location**: Where is the cerebellum situated relative to other brain regions? - **Functional roles**: What are its primary and emerging functions? - *...
The **cerebellum** is a distinct structure of the brain located at the back of the skull, underneath the **cerebrum** and behind the **brainstem**. It plays a crucial role in: - **Motor control**: Coordinating voluntary movements like posture, balance, and coordination. - **Motor learning**: Helping refine movements t...
other/Qwen/Qwen3-14B
false
nausea and vomiting
### 1. Query Decomposition The query "nausea and vomiting" is a clinical symptom set requiring differential diagnosis. This necessitates: - Identifying **red flags** for urgent conditions (e.g., intracranial pathology, metabolic derangements) - Listing **common causes** (gastroenteritis, medications, pregnancy) ...
### 1. Query Decomposition The query "nausea and vomiting" is a clinical symptom set requiring differential diagnosis. This necessitates: - Identifying **red flags** for urgent conditions (e.g., intracranial pathology, metabolic derangements) - Listing **common causes** (gastroenteritis, medications, pregnancy) ...
Nausea and vomiting can stem from many causes—ranging from benign (motion sickness, overeating) to serious (infections, medication side effects, intracranial pressure, metabolic disorders). Here’s a concise clinical framework: **1. Red flags (seek immediate care):** - Severe, sudden-onset headache or stiff neck - ...
other/Qwen/Qwen3-14B
false
gastrointestinal tract
### 1. Query Decomposition The input query "gastrointestinal tract" requires a structured explanation of its anatomy, function, and related disorders. This involves: - Defining the GI tract and its role in digestion → - Segmenting it into upper and lower sections with specific organs → - Including accessory org...
### 1. Query Decomposition The input query "gastrointestinal tract" requires a structured explanation of its anatomy, function, and related disorders. This involves: - Defining the GI tract and its role in digestion → - Segmenting it into upper and lower sections with specific organs → - Including accessory org...
The **gastrointestinal (GI) tract**—also called the **digestive tract** or **alimentary canal**—is a continuous, muscular tube that extends from the **mouth to the anus**, responsible for **ingesting**, **digesting**, **absorbing**, and **eliminating** food. It is divided into two main sections: --- ### **Upper GI Tr...
other/Qwen/Qwen3-14B
false
diagnostic evaluation of the kidney and urinary tract
### 1. Query Decomposition The query seeks a structured diagnostic approach for kidney and urinary tract evaluation. This involves identifying essential tests, imaging modalities, and clinical scenarios. The response provides a comprehensive roadmap, so the analysis must map its components to clinical utility. ###...
### 1. Query Decomposition The query seeks a structured diagnostic approach for kidney and urinary tract evaluation. This involves identifying essential tests, imaging modalities, and clinical scenarios. The response provides a comprehensive roadmap, so the analysis must map its components to clinical utility. ###...
DIAGNOSTIC EVALUATION OF THE KIDNEY & URINARY TRACT (A concise “what-to-order-when” roadmap for clinicians) ──────────────────────────────────────── 1. INITIAL SCREEN – EVERY CASE • Urinalysis (dipstick + microscopy) • Serum creatinine & eGFR • Serum electrolytes (Na, K, Cl, HCO₃) • CBC • Bloo...
other/Qwen/Qwen3-14B
false
japanese encephalitis
``` ### 1. Query Decomposition The query "Japanese encephalitis" requests a comprehensive overview of the disease, including its **etiology, transmission, clinical features, diagnosis, treatment, and prevention**. → Breakdown: - **Definition**: Viral disease caused by JEV (● confirmed in response). - **Transmiss...
### 1. Query Decomposition The query "Japanese encephalitis" requests a comprehensive overview of the disease, including its **etiology, transmission, clinical features, diagnosis, treatment, and prevention**. → Breakdown: - **Definition**: Viral disease caused by JEV (● confirmed in response). - **Transmission*...
**Japanese Encephalitis (JE)** is a serious viral **mosquito-borne** disease that primarily affects the **central nervous system**, causing **inflammation of the brain (encephalitis)**. It is caused by the **Japanese encephalitis virus (JEV)**, a flavivirus related to dengue, Zika, and West Nile viruses. --- ### **Ke...
other/Qwen/Qwen3-14B
false
basics of embryology
### 1. Query Decomposition The query "basics of embryology" requires a structured overview of developmental stages from fertilization to organ formation. ● Embryology focuses on the transformation of a single fertilized cell into a complex organism. → Break down into key stages: gametogenesis, fertilization, cleav...
### 1. Query Decomposition The query "basics of embryology" requires a structured overview of developmental stages from fertilization to organ formation. ● Embryology focuses on the transformation of a single fertilized cell into a complex organism. → Break down into key stages: gametogenesis, fertilization, cleav...
Embryology studies how a single fertilized cell becomes a complex, multicellular organism. Here is a concise, step-by-step overview of the basics. 1. Gametogenesis (formation of the germ cells) • Spermatogenesis – in testes, diploid spermatogonia → haploid spermatozoa (64 days). • Oogenesis – in ovaries, p...
other/Qwen/Qwen3-14B
false
ovarian cysts
### 1. Query Decomposition The input query "ovarian cysts" is a broad topic requiring a structured breakdown. It necessitates addressing: - **Definition and prevalence** (are they common? benign?) - **Types** (functional vs. pathological, subtypes like follicular, dermoid, etc.) - **Symptoms** (asymptomatic vs....
### 1. Query Decomposition The input query "ovarian cysts" is a broad topic requiring a structured breakdown. It necessitates addressing: - **Definition and prevalence** (are they common? benign?) - **Types** (functional vs. pathological, subtypes like follicular, dermoid, etc.) - **Symptoms** (asymptomatic vs....
Ovarian cysts are fluid-filled sacs or pockets that develop on or inside an ovary. They’re extremely common—most women will have one at some point—and the vast majority are harmless, cause no symptoms, and resolve on their own within a few menstrual cycles. Below is a concise overview covering what you need to know. -...
other/Qwen/Qwen3-14B
false
diencephalon
### 1. Query Decomposition The query "diencephalon" requests an explanation of this brain structure, including its **location, major components, functions, connections, and clinical significance**. → This requires organizing information into **anatomical regions**, **physiological roles**, and **pathological implic...
### 1. Query Decomposition The query "diencephalon" requests an explanation of this brain structure, including its **location, major components, functions, connections, and clinical significance**. → This requires organizing information into **anatomical regions**, **physiological roles**, and **pathological implic...
The **diencephalon** is a region of the **forebrain** (prosencephalon) located between the **brainstem** and the **cerebral hemispheres**. It plays a crucial role in **sensory processing, motor control, hormone regulation, and homeostasis**. --- ### **Major Structures of the Diencephalon** 1. **Thalamus** - **Fun...
other/Qwen/Qwen3-14B
false
the leg ankle and foot
### 1. Query Decomposition The query "the leg ankle and foot" → seeks anatomical overview of lower limb regions ※ Constraint: Focus on structural components (bones, muscles, joints) ? Ambiguity: Could imply clinical relevance (e.g., injuries, pathologies) ### 2. Fact Retrieval ● **Leg (Crural Region)**: ...
### 1. Query Decomposition The query "the leg ankle and foot" → seeks anatomical overview of lower limb regions ※ Constraint: Focus on structural components (bones, muscles, joints) ? Ambiguity: Could imply clinical relevance (e.g., injuries, pathologies) ### 2. Fact Retrieval ● **Leg (Crural Region)**: ...
### **Anatomy of the Leg, Ankle, and Foot** #### **1. Leg (Crural Region)** - **Bones**: - **Tibia** (medial, weight-bearing) - **Fibula** (lateral, muscle attachment) - **Joints**: - **Superior tibiofibular joint** (synovial plane joint) - **Inferior tibiofibular joint** (syndesmosis) - **Musc...
other/Qwen/Qwen3-14B
false
kidneys
### 1. Query Decomposition The input query "kidneys" is a broad topic requiring a structured, comprehensive overview. To address this, the response must cover **anatomy**, **functions**, **common issues**, **symptoms**, and **prevention**. ※ Constraint: Information must be accurate, accessible, and organized for ge...
### 1. Query Decomposition The input query "kidneys" is a broad topic requiring a structured, comprehensive overview. To address this, the response must cover **anatomy**, **functions**, **common issues**, **symptoms**, and **prevention**. ※ Constraint: Information must be accurate, accessible, and organized for ge...
**Kidneys: Your Body’s Built-In Filtration System** Kidneys are two bean-shaped organs, each about the size of a fist, located just below your rib cage on either side of your spine. Despite their small size, they play a massive role in keeping your body healthy and balanced. --- ### **What Do Kidneys Do?** 1. **Filt...
other/Qwen/Qwen3-14B
false
laboratory methods
``` ### 1. Query Decomposition The query "laboratory methods" is a high-level, domain-agnostic term (●). It lacks specificity in three critical dimensions: 1. **Scientific discipline** (e.g., biology vs. chemistry) 2. **Technical scope** (e.g., basic techniques vs. advanced instrumentation) 3. **Intended use** (...
### 1. Query Decomposition The query "laboratory methods" is a high-level, domain-agnostic term (●). It lacks specificity in three critical dimensions: 1. **Scientific discipline** (e.g., biology vs. chemistry) 2. **Technical scope** (e.g., basic techniques vs. advanced instrumentation) 3. **Intended use** (e.g....
“Laboratory methods” is a very broad term—every scientific discipline has its own set of techniques, instruments, and protocols. To give you the most useful answer, could you tell me a bit more about what you need? - **Field or application** (e.g., molecular biology, analytical chemistry, microbiology, clinical diagno...
other/Qwen/Qwen3-14B
false
psoriasis
### 1. Query Decomposition The query "psoriasis" seeks a comprehensive overview of the condition. This requires breaking down the topic into core components: definition, clinical presentation, types, associated conditions, triggers, treatment modalities, lifestyle considerations, and urgent referral criteria. ### ...
### 1. Query Decomposition The query "psoriasis" seeks a comprehensive overview of the condition. This requires breaking down the topic into core components: definition, clinical presentation, types, associated conditions, triggers, treatment modalities, lifestyle considerations, and urgent referral criteria. ### ...
Psoriasis is a chronic, immune-mediated skin disorder that causes thick, scaly (often silvery-white) plaques, redness, itching, and sometimes joint or nail involvement. It is **not contagious** and tends to **wax and wane** (flare-ups followed by periods of improvement). Key points at a glance 1. **What it looks lik...
other/Qwen/Qwen3-14B
false
End of preview. Expand in Data Studio

Medical Reasoning SynthLabs I

Medical Reasoning SynthLabs I is a synthetic medical reasoning dataset generated with SynthLabs. It contains medical question-answer examples paired with structured reasoning traces, intended for research on reasoning-style instruction tuning, medical QA, answer synthesis, and reasoning trace analysis.

The dataset is designed for machine learning research and experimentation. It is not intended for clinical decision-making, diagnosis, treatment planning, or use as a source of medical advice.

Dataset Details

  • Repository: mkurman/medical-reasoning-synthlabs-I
  • Rows: 856 examples
  • Format: Parquet
  • Split: train
  • Modality: Text
  • Generation model label: DEEP: qwen3-synth
  • Primary task type: Medical question answering with reasoning

Dataset Structure

The dataset contains one split:

Split Rows
train 856

Columns

Column Type Description
query string The medical question, case prompt, or task instruction.
reasoning string A structured reasoning trace generated for the query.
answer string The final answer or conclusion.
modelUsed string The model/generation pipeline label used to create the example.

Example Usage

from datasets import load_dataset

dataset = load_dataset("mkurman/medical-reasoning-synthlabs-I")

print(dataset)
print(dataset["train"][0])

Intended Uses

This dataset may be useful for:

  • Fine-tuning language models on structured medical reasoning.
  • Training models to produce concise answers from longer reasoning traces.
  • Evaluating reasoning-trace formatting and answer consistency.
  • Building synthetic-data pipelines for domain-specific QA.
  • Researching medical instruction tuning and reasoning supervision.

Out-of-Scope Uses

This dataset should not be used for:

  • Direct clinical diagnosis or treatment recommendations.
  • Replacing medical professionals or validated clinical guidelines.
  • Patient-facing medical advice without expert review.
  • High-stakes medical deployment without rigorous validation.
  • Benchmarking real-world clinical safety without additional expert evaluation.

Data Generation

The examples were generated using SynthLabs in a medical reasoning format. Each sample contains a medical query, a structured reasoning trace, and a final answer.

The visible generation label in the dataset is:

DEEP: qwen3-synth

This dataset was created through a two-stage synthetic generation process.

First, medical topics were provided to moonshotai/kimi-k2, which generated long-form medical text about those topics. These generated texts were then used as source material for a second synthesis step.

In the second step, qwen/Qwen3-14B was used to generate SynthLabs-style reasoning and answer pairs based on the long-form medical texts. Each final example contains a medical query, a reasoning trace, and a final answer.

Finally, the generated examples were evaluated and filtered using deepseek-ai/DeepSeek-V3 to improve overall quality and remove lower-quality generations.

This means the dataset should be understood as synthetic medical reasoning data derived from model-generated medical topic expansions, rather than from human-authored clinical cases or real patient records.

⚠️ Because the source medical text, reasoning traces, and answer pairs were generated through language-model-based synthesis, the dataset may still contain inaccuracies even after final evaluation and filtering. The deepseek-ai/DeepSeek-V3 filtering step is intended to improve quality, but it should not be treated as clinical validation.

Limitations

Because this is a synthetic dataset, it may contain:

  • Medical inaccuracies.
  • Hallucinated facts or unsupported assumptions.
  • Overconfident reasoning.
  • Incomplete differential diagnoses.
  • Reasoning traces that appear plausible but are not clinically validated.
  • Biases inherited from the source prompts, generation model, or synthesis pipeline.

Users should perform medical expert review before using this dataset for any health-related application.

Ethical Considerations

Medical reasoning datasets require extra caution because incorrect outputs can cause harm if used in real-world healthcare settings. This dataset is intended for research, prototyping, and model development only.

Recommended safeguards include:

  • Expert clinical review.
  • Automated and manual quality checks.
  • Filtering for unsafe or misleading generations.
  • Clear downstream disclaimers.
  • Evaluation against trusted medical references and guidelines.

Citation

If you use this dataset, please cite it as:

@misc{kurman2026medical_reasoning_synthlabs_i,
 title = {Medical Reasoning SynthLabs I},
 author = {Mariusz Kurman},
 year = {2026},
 publisher = {Hugging Face},
 howpublished = {\url{https://huggingface.co/datasets/mkurman/medical-reasoning-synthlabs-I}}
}

License

This dataset is released under the Apache 2.0 license.

Disclaimer

This dataset is for research and educational use only. It is not medical advice and should not be used for diagnosis, treatment, triage, or other clinical decision-making.

Downloads last month
56

Models trained or fine-tuned on mkurman/medical-reasoning-synthlabs-I