OzTianlu commited on
Commit
eede50b
·
verified ·
1 Parent(s): 800f1ab

Upload folder using huggingface_hub

Browse files
README.md CHANGED
@@ -102,7 +102,7 @@ A 3B-parameter instruction-tuned language model optimized for reasoning, math, a
102
 
103
  ## What is ADS?
104
 
105
- **Adaptive Dual-Search Distillation** treats model fine-tuning as a constrained optimization problem inspired by Operations Research. The core mechanism is a dynamic loss function with a stateful dual penalty factor that adapts based on embedding space entropy — forcing the model to converge to high-confidence predictions at difficult reasoning points, without modifying the model architecture.
106
 
107
  ## Benchmark Results
108
 
 
102
 
103
  ## What is ADS?
104
 
105
+ **Adaptive Dual-Search Distillation (自适应对偶搜索蒸馏)** treats model fine-tuning as a constrained optimization problem inspired by Operations Research. The core mechanism is a dynamic loss function with a stateful dual penalty factor that adapts based on embedding space entropy — forcing the model to converge to high-confidence predictions at difficult reasoning points, without modifying the model architecture.
106
 
107
  ## Benchmark Results
108
 
model-00001-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e98ab017eab910e19466dfa5ce298e1720470c563efb1efc686b314239257a34
3
  size 4966315264
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e5a15717e85d23851ded51752efb28ce8296403925f91d01005bfff4c587b308
3
  size 4966315264
model-00002-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:abc6d445d8b0a2bf107b4502636b27da5c023ce89d41ca4399eb6ace288efe86
3
  size 1183919744
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:30034dea2d5a297abd0648af528b0c1755e8aaa7085d63965dfa5e0ca3759976
3
  size 1183919744