OpenTransformer commited on
Commit
f8bd3e3
·
verified ·
1 Parent(s): bafb727

Document AGILLM3.5 runtime code

Browse files
Files changed (1) hide show
  1. README.md +9 -0
README.md CHANGED
@@ -20,6 +20,15 @@ Single full file per snapshot (each round is a block merge, not a delta).
20
  Checkpoint dict keys: `core` (backbone), `ar`, `sat` (heads), `cfg`, embedded
21
  `tokenizer_json`, plus `disagg_updates` (merge provenance) on the distributed master.
22
 
 
 
 
 
 
 
 
23
  ## Inference
24
  Load with the AGILLM nB300 code (`infer --mode ar|sat`); the tokenizer round-trips from the
25
  embedded `tokenizer_json`.
 
 
 
20
  Checkpoint dict keys: `core` (backbone), `ar`, `sat` (heads), `cfg`, embedded
21
  `tokenizer_json`, plus `disagg_updates` (merge provenance) on the distributed master.
22
 
23
+
24
+ ## Code
25
+
26
+ - `agillm35.py` - single-file AGILLM3.5 runtime for training/status/inference.
27
+ - `distributed/public_join/` - public signed-lease host and outbound worker scripts for untrusted joiners.
28
+ - `distributed/inference/agillm35_distributed_infer.py` - phase-1 distributed AR inference harness for transformer/MoE/DiffusionBlock layer stages.
29
+
30
  ## Inference
31
  Load with the AGILLM nB300 code (`infer --mode ar|sat`); the tokenizer round-trips from the
32
  embedded `tokenizer_json`.
33
+ Distributed AR inference can split contiguous transformer/DiffusionBlock layer ranges across local and HTTP worker stages. The network payload path uses a raw tensor wire format rather than unpickling remote worker responses; use TLS and a bearer token outside localhost.
34
+