Upload folder using huggingface_hub
Browse files- README.md +1 -1
- model-00001-of-00002.safetensors +1 -1
- model-00002-of-00002.safetensors +1 -1
README.md
CHANGED
|
@@ -102,7 +102,7 @@ A 3B-parameter instruction-tuned language model optimized for reasoning, math, a
|
|
| 102 |
|
| 103 |
## What is ADS?
|
| 104 |
|
| 105 |
-
**Adaptive Dual-Search Distillation** treats model fine-tuning as a constrained optimization problem inspired by Operations Research. The core mechanism is a dynamic loss function with a stateful dual penalty factor that adapts based on embedding space entropy — forcing the model to converge to high-confidence predictions at difficult reasoning points, without modifying the model architecture.
|
| 106 |
|
| 107 |
## Benchmark Results
|
| 108 |
|
|
|
|
| 102 |
|
| 103 |
## What is ADS?
|
| 104 |
|
| 105 |
+
**Adaptive Dual-Search Distillation (自适应对偶搜索蒸馏)** treats model fine-tuning as a constrained optimization problem inspired by Operations Research. The core mechanism is a dynamic loss function with a stateful dual penalty factor that adapts based on embedding space entropy — forcing the model to converge to high-confidence predictions at difficult reasoning points, without modifying the model architecture.
|
| 106 |
|
| 107 |
## Benchmark Results
|
| 108 |
|
model-00001-of-00002.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 4966315264
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e5a15717e85d23851ded51752efb28ce8296403925f91d01005bfff4c587b308
|
| 3 |
size 4966315264
|
model-00002-of-00002.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 1183919744
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:30034dea2d5a297abd0648af528b0c1755e8aaa7085d63965dfa5e0ca3759976
|
| 3 |
size 1183919744
|