Update README.md
Browse files
README.md
CHANGED
|
@@ -102,7 +102,7 @@ A 3B-parameter instruction-tuned language model optimized for reasoning, math, a
|
|
| 102 |
|
| 103 |
## What is ADS?
|
| 104 |
|
| 105 |
-
**Adaptive Dual-Search Distillation
|
| 106 |
|
| 107 |
## Benchmark Results
|
| 108 |
|
|
|
|
| 102 |
|
| 103 |
## What is ADS?
|
| 104 |
|
| 105 |
+
**Adaptive Dual-Search Distillation** treats model fine-tuning as a constrained optimization problem inspired by Operations Research. The core mechanism is a dynamic loss function with a stateful dual penalty factor that adapts based on embedding space entropy — forcing the model to converge to high-confidence predictions at difficult reasoning points, without modifying the model architecture.
|
| 106 |
|
| 107 |
## Benchmark Results
|
| 108 |
|