01RAI commited on
Commit
0a6920a
·
verified ·
1 Parent(s): d303b8b

PredictLM v11.0 + Mini ship-bundle

Browse files
Files changed (1) hide show
  1. README.md +0 -13
README.md CHANGED
@@ -82,23 +82,10 @@ That's it. On the first `.predict()` call the package silently downloads its par
82
  **Edge cases:**
83
 
84
  - **No internet / air-gapped.** Pass `auto_duo=False` at load to disable partner download — `.predict()` returns the single-model in-context result.
85
- - **Want explicit Duo control** (custom `w`, `n_inner`, manual orchestration)? Use the explicit `duo_ttt_predict(mini, base, ...)` helper documented below.
86
  - **Real-time inference** (<10 ms latency)? Use `auto_duo=False` zero-tuning. Duo + TTT adds ~1-60 s per query depending on table size.
87
 
88
  **TTT** ([Test-Time Training](https://arxiv.org/abs/2503.11842), grounded in TabPFN-2.5's [recipe](https://arxiv.org/abs/2511.08667)) does ~15 inner Adam steps of self-supervised fine-tuning on the user's in-context examples before predicting. Per-task specialization on top of a generic ICL prior. 19 / 20 datasets improved vs zero-tuning; no dataset regressed by more than 0.006.
89
 
90
- ### Advanced — explicit Duo + TTT (manual orchestration)
91
-
92
- ```python
93
- from predictlm import PredictLM, duo_ttt_predict
94
-
95
- mini = PredictLM.from_pretrained("zerooneresearch/predictlm-mini-13m", auto_duo=False)
96
- base = PredictLM.from_pretrained("zerooneresearch/predictlm-base-26m", auto_duo=False)
97
- preds = duo_ttt_predict(mini, base, X_train, y_train, X_test, w=0.40, n_inner=15)
98
- ```
99
-
100
- Same numerical result as the default `.predict()`, but you control `w` (mini logit weight), `n_inner`, `lr`, etc.
101
-
102
  ## Developers and affiliations
103
 
104
  - **Developed by**: ZeroOne Research
 
82
  **Edge cases:**
83
 
84
  - **No internet / air-gapped.** Pass `auto_duo=False` at load to disable partner download — `.predict()` returns the single-model in-context result.
 
85
  - **Real-time inference** (<10 ms latency)? Use `auto_duo=False` zero-tuning. Duo + TTT adds ~1-60 s per query depending on table size.
86
 
87
  **TTT** ([Test-Time Training](https://arxiv.org/abs/2503.11842), grounded in TabPFN-2.5's [recipe](https://arxiv.org/abs/2511.08667)) does ~15 inner Adam steps of self-supervised fine-tuning on the user's in-context examples before predicting. Per-task specialization on top of a generic ICL prior. 19 / 20 datasets improved vs zero-tuning; no dataset regressed by more than 0.006.
88
 
 
 
 
 
 
 
 
 
 
 
 
 
89
  ## Developers and affiliations
90
 
91
  - **Developed by**: ZeroOne Research