Text Generation
PEFT
Safetensors
GGUF
English
clinicalthought-ai-8b
ClinicalThought-AI-8B
medical-ai
healthcare-ai
clinical-reasoning
chain-of-thought
diagnostic-support
differential-diagnosis
clinical-decision-making
medical-education
reasoning-model
8b
clinical-ai
medical-diagnosis
healthcare-llm
quantized
fine-tuned
lora
medical-nlp
clinical-support
healthcare-professional
evidence-based-medicine
conversational
Update Training/Training_Documentation.txt
Browse files
Training/Training_Documentation.txt
CHANGED
|
@@ -13,9 +13,12 @@ Training Dataset: Custom curated dataset for medical reasoning
|
|
| 13 |
Dataset Specifications
|
| 14 |
---------------------
|
| 15 |
|
| 16 |
-
Total Token Count:
|
| 17 |
Total Sample Count: 29,500
|
| 18 |
-
Average Tokens/Sample:
|
|
|
|
|
|
|
|
|
|
| 19 |
Dataset Creation: Created from a combination of public medical reasoning datasets from OpenAI o1 and DeepSeek-R1, along with additional reasoning chains created using Claude Sonnet 4 extended thinking
|
| 20 |
|
| 21 |
Training Configuration
|
|
|
|
| 13 |
Dataset Specifications
|
| 14 |
---------------------
|
| 15 |
|
| 16 |
+
Total Token Count: 31,929,580
|
| 17 |
Total Sample Count: 29,500
|
| 18 |
+
Average Tokens/Sample: 1082.36
|
| 19 |
+
Max Token Count: 9,803
|
| 20 |
+
Min Token Count: 237
|
| 21 |
+
Tokens Counted Using: tiktoken (cl100k_base encoding)
|
| 22 |
Dataset Creation: Created from a combination of public medical reasoning datasets from OpenAI o1 and DeepSeek-R1, along with additional reasoning chains created using Claude Sonnet 4 extended thinking
|
| 23 |
|
| 24 |
Training Configuration
|