Upload folder using huggingface_hub

Browse files

Files changed (4) hide show

README.md +39 -0
agent_heads.bin +3 -0
config.json +18 -0
model.safetensors +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,39 @@

+---
+license: mit
+library_name: pytorch
+tags: [tool-calling, agent, tiny-llm, byte-level, on-device, from-scratch]
+pipeline_tag: text-generation
+---
+# ultra-tiny-1m — LocalAgent (0.98M params)
+A **from-scratch, byte-level** tool-calling agent model from
+[LocalAgent](https://github.com/sangbumchoi/localagent). Pure PyTorch, **0.98M params**,
+trained on CPU. It pairs a tiny decoder (GQA + RoPE + SwiGLU + depth-recurrence) with a **dual head**
+(tool-selection classifier + pointer/copy argument head) and **prompt-grounded constrained
+decoding** for reliable tool calls across 21 tools (general assistant, the Claude Code /
+Codex coding surface, and computer-use / productivity tools), including parallel two-call turns.
+## Architecture
+- vocab 256 (byte-level), d_model 192, layers 2 x6 loops, heads 6/2 (GQA), ffn 640
+- factorized embeddings: True
+## Files
+- `config.json` — `ModelConfig`
+- `model.safetensors` / `pytorch_model.bin` — decoder weights
+- `agent_heads.bin` — trained tool-selection + pointer heads (optional)
+## Load (pure PyTorch, no transformers)
+```python
+import json, torch
+from huggingface_hub import hf_hub_download
+from localagent.model import LocalAgentLM, ModelConfig
+cfg_d = json.load(open(hf_hub_download("danelcsb/localagent-ultra-tiny-1m", "config.json")))
+cfg = ModelConfig(**{k: v for k, v in cfg_d.items() if k in ModelConfig.__dataclass_fields__})
+model = LocalAgentLM(cfg)
+from safetensors.torch import load_file
+model.load_state_dict(load_file(hf_hub_download("danelcsb/localagent-ultra-tiny-1m", "model.safetensors")))
+model.eval()
+```
+See the LocalAgent repo for the grounded decoder / agent runtime.

agent_heads.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fcbe81933ca0de9c0c543a31d14c631b0f07129ccfb8046e04e0a94a50a0a66f
+size 327813

config.json ADDED Viewed

	@@ -0,0 +1,18 @@

+{
+  "model_type": "localagent",
+  "architecture": "LocalAgentLM (byte-level GQA+RoPE+SwiGLU)",
+  "name": "ultra-tiny-1m",
+  "vocab_size": 256,
+  "d_model": 192,
+  "embed_dim": 64,
+  "n_layers": 2,
+  "n_loops": 6,
+  "n_heads": 6,
+  "n_kv_heads": 2,
+  "ffn_hidden": 640,
+  "max_seq_len": 1024,
+  "rope_theta": 10000.0,
+  "norm_eps": 1e-05,
+  "tie_embeddings": true,
+  "dropout": 0.0
+}

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:042e13cf2f5e878b0cdd05840f20fffeefee0d09794ad7663d6d91196c7e2add
+size 3909888