OzTianlu commited on
Commit
800f1ab
·
verified ·
1 Parent(s): 3ac4e23

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -102,7 +102,7 @@ A 3B-parameter instruction-tuned language model optimized for reasoning, math, a
102
 
103
  ## What is ADS?
104
 
105
- **Adaptive Dual-Search Distillation (自适应对偶搜索蒸馏)** treats model fine-tuning as a constrained optimization problem inspired by Operations Research. The core mechanism is a dynamic loss function with a stateful dual penalty factor that adapts based on embedding space entropy — forcing the model to converge to high-confidence predictions at difficult reasoning points, without modifying the model architecture.
106
 
107
  ## Benchmark Results
108
 
 
102
 
103
  ## What is ADS?
104
 
105
+ **Adaptive Dual-Search Distillation** treats model fine-tuning as a constrained optimization problem inspired by Operations Research. The core mechanism is a dynamic loss function with a stateful dual penalty factor that adapts based on embedding space entropy — forcing the model to converge to high-confidence predictions at difficult reasoning points, without modifying the model architecture.
106
 
107
  ## Benchmark Results
108