Document AGILLM3.5 runtime code
Browse files
README.md
CHANGED
|
@@ -20,6 +20,15 @@ Single full file per snapshot (each round is a block merge, not a delta).
|
|
| 20 |
Checkpoint dict keys: `core` (backbone), `ar`, `sat` (heads), `cfg`, embedded
|
| 21 |
`tokenizer_json`, plus `disagg_updates` (merge provenance) on the distributed master.
|
| 22 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 23 |
## Inference
|
| 24 |
Load with the AGILLM nB300 code (`infer --mode ar|sat`); the tokenizer round-trips from the
|
| 25 |
embedded `tokenizer_json`.
|
|
|
|
|
|
|
|
|
| 20 |
Checkpoint dict keys: `core` (backbone), `ar`, `sat` (heads), `cfg`, embedded
|
| 21 |
`tokenizer_json`, plus `disagg_updates` (merge provenance) on the distributed master.
|
| 22 |
|
| 23 |
+
|
| 24 |
+
## Code
|
| 25 |
+
|
| 26 |
+
- `agillm35.py` - single-file AGILLM3.5 runtime for training/status/inference.
|
| 27 |
+
- `distributed/public_join/` - public signed-lease host and outbound worker scripts for untrusted joiners.
|
| 28 |
+
- `distributed/inference/agillm35_distributed_infer.py` - phase-1 distributed AR inference harness for transformer/MoE/DiffusionBlock layer stages.
|
| 29 |
+
|
| 30 |
## Inference
|
| 31 |
Load with the AGILLM nB300 code (`infer --mode ar|sat`); the tokenizer round-trips from the
|
| 32 |
embedded `tokenizer_json`.
|
| 33 |
+
Distributed AR inference can split contiguous transformer/DiffusionBlock layer ranges across local and HTTP worker stages. The network payload path uses a raw tensor wire format rather than unpickling remote worker responses; use TLS and a bearer token outside localhost.
|
| 34 |
+
|