PredictLM v11.0 + Mini ship-bundle
Browse files
README.md
CHANGED
|
@@ -82,23 +82,10 @@ That's it. On the first `.predict()` call the package silently downloads its par
|
|
| 82 |
**Edge cases:**
|
| 83 |
|
| 84 |
- **No internet / air-gapped.** Pass `auto_duo=False` at load to disable partner download — `.predict()` returns the single-model in-context result.
|
| 85 |
-
- **Want explicit Duo control** (custom `w`, `n_inner`, manual orchestration)? Use the explicit `duo_ttt_predict(mini, base, ...)` helper documented below.
|
| 86 |
- **Real-time inference** (<10 ms latency)? Use `auto_duo=False` zero-tuning. Duo + TTT adds ~1-60 s per query depending on table size.
|
| 87 |
|
| 88 |
**TTT** ([Test-Time Training](https://arxiv.org/abs/2503.11842), grounded in TabPFN-2.5's [recipe](https://arxiv.org/abs/2511.08667)) does ~15 inner Adam steps of self-supervised fine-tuning on the user's in-context examples before predicting. Per-task specialization on top of a generic ICL prior. 19 / 20 datasets improved vs zero-tuning; no dataset regressed by more than 0.006.
|
| 89 |
|
| 90 |
-
### Advanced — explicit Duo + TTT (manual orchestration)
|
| 91 |
-
|
| 92 |
-
```python
|
| 93 |
-
from predictlm import PredictLM, duo_ttt_predict
|
| 94 |
-
|
| 95 |
-
mini = PredictLM.from_pretrained("zerooneresearch/predictlm-mini-13m", auto_duo=False)
|
| 96 |
-
base = PredictLM.from_pretrained("zerooneresearch/predictlm-base-26m", auto_duo=False)
|
| 97 |
-
preds = duo_ttt_predict(mini, base, X_train, y_train, X_test, w=0.40, n_inner=15)
|
| 98 |
-
```
|
| 99 |
-
|
| 100 |
-
Same numerical result as the default `.predict()`, but you control `w` (mini logit weight), `n_inner`, `lr`, etc.
|
| 101 |
-
|
| 102 |
## Developers and affiliations
|
| 103 |
|
| 104 |
- **Developed by**: ZeroOne Research
|
|
|
|
| 82 |
**Edge cases:**
|
| 83 |
|
| 84 |
- **No internet / air-gapped.** Pass `auto_duo=False` at load to disable partner download — `.predict()` returns the single-model in-context result.
|
|
|
|
| 85 |
- **Real-time inference** (<10 ms latency)? Use `auto_duo=False` zero-tuning. Duo + TTT adds ~1-60 s per query depending on table size.
|
| 86 |
|
| 87 |
**TTT** ([Test-Time Training](https://arxiv.org/abs/2503.11842), grounded in TabPFN-2.5's [recipe](https://arxiv.org/abs/2511.08667)) does ~15 inner Adam steps of self-supervised fine-tuning on the user's in-context examples before predicting. Per-task specialization on top of a generic ICL prior. 19 / 20 datasets improved vs zero-tuning; no dataset regressed by more than 0.006.
|
| 88 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 89 |
## Developers and affiliations
|
| 90 |
|
| 91 |
- **Developed by**: ZeroOne Research
|