Deploy NanEcho CI checkpoint (4L/4H/256E, 200 iters, val_loss=1.9258)

Browse files

Files changed (5) hide show

README.md +97 -3
config.json +23 -0
pytorch_model.bin +3 -0
tokenizer_config.json +8 -0
training_metadata.json +35 -0

README.md CHANGED Viewed

@@ -1,3 +1,97 @@
----
-license: agpl-3.0
----

+---
+language: en
+tags:
+- gpt2
+- echo-self
+- cognitive-architecture
+- deep-tree-echo
+- nanecho
+- transformer
+license: agpl-3.0
+---
+# NanEcho — Deep Tree Echo Cognitive Model
+## Model Description
+NanEcho is a transformer-based language model with iterative connection building, adaptive attention, and Deep Tree Echo cognitive architecture integration. It features persona dimensions (cognitive, introspective, adaptive, recursive) and hypergraph pattern recognition. This is the CI-mode checkpoint from the `9cog/echoself` repository, trained using the `agent-neuro-train` supervised pipeline.
+## Architecture
+| Parameter | Value |
+|:---|:---|
+| Model Type | GPT-2 (causal LM) |
+| Vocabulary Size | 50,304 |
+| Embedding Dimension | 256 |
+| Attention Heads | 4 |
+| Transformer Layers | 4 |
+| MLP Inner Dimension | 1,024 |
+| Context Length | 1,024 |
+| Dropout | 0.1 |
+| Total Parameters | ~24M |
+## Training
+| Metric | Value |
+|:---|:---|
+| Training Mode | CI (Agent-Neuro supervised) |
+| Training Iterations | 200 |
+| Best Validation Loss | 1.9258 |
+| Output Directory | out-nanecho-ci |
+| Orchestrator | Agent-Neuro |
+| Persona Enforced | Deep Tree Echo |
+| Source Run | 22276548709 |
+## Echo Self Features
+This model incorporates several cognitive architecture features:
+- **Adaptive Attention**: Dynamic threshold adjustment based on cognitive load
+- **Persona Dimensions**: Multi-dimensional cognitive processing (Cognitive, Introspective, Adaptive, Recursive, Synergistic, Holographic, Neural-Symbolic, Dynamic)
+- **Recursive Reasoning**: Multi-level introspection capabilities
+- **Hypergraph Patterns**: Neural-symbolic pattern encoding
+## Usage
+```python
+from transformers import GPT2LMHeadModel, GPT2Tokenizer
+model = GPT2LMHeadModel.from_pretrained("drzo/echoself")
+tokenizer = GPT2Tokenizer.from_pretrained("gpt2")
+inputs = tokenizer("Echo Self is", return_tensors="pt")
+outputs = model.generate(**inputs, max_new_tokens=50)
+print(tokenizer.decode(outputs[0], skip_special_tokens=True))
+```
+## Training Data
+The model was trained on Echo Self documentation and cognitive architecture descriptions, including hypergraph reasoning patterns, persona dimension examples, and recursive introspection samples from the `echoself.md` corpus.
+## Limitations
+This is an early CI-mode research checkpoint (200 iterations, 4 layers). It demonstrates the training pipeline but has not yet reached convergence. Full training runs with 8+ layers and 5000+ iterations are expected to produce significantly better results.
+## Source
+Trained from the [9cog/echoself](https://github.com/9cog/echoself) repository using the `agent-neuro-train.yml` GitHub Actions workflow with Deep Tree Echo persona enforcement.
+## Citation
+```bibtex
+@misc{echoself-nanecho,
+  title={EchoSelf NanEcho: Deep Tree Echo Cognitive Architecture},
+  author={drzo},
+  year={2026},
+  url={https://github.com/9cog/echoself}
+}
+```
+## More Information
+- **Repository**: https://github.com/9cog/echoself
+- **Documentation**: See repository README for detailed architecture information
+## License
+AGPL-3.0

config.json ADDED Viewed

	@@ -0,0 +1,23 @@

+{
+  "model_type": "gpt2",
+  "architectures": [
+    "GPT2LMHeadModel"
+  ],
+  "vocab_size": 50304,
+  "n_embd": 256,
+  "n_head": 4,
+  "n_layer": 4,
+  "n_positions": 1024,
+  "embd_pdrop": 0.1,
+  "attn_pdrop": 0.1,
+  "resid_pdrop": 0.1,
+  "layer_norm_epsilon": 1e-05,
+  "initializer_range": 0.02,
+  "bos_token_id": 0,
+  "eos_token_id": 0,
+  "echo_self_version": "1.0",
+  "echo_self_persona_dimensions": [],
+  "echo_self_adaptive_attention": true,
+  "echo_self_recursive_reasoning": true,
+  "n_inner": 1024
+}

pytorch_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d2c90904723e83adbd1cad062aa48db0ba6a8bfd387c51888a9a55ee372146bf
+size 65214947

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,8 @@

+{
+  "tokenizer_class": "GPT2Tokenizer",
+  "model_max_length": 1024,
+  "bos_token": "<|endoftext|>",
+  "eos_token": "<|endoftext|>",
+  "unk_token": "<|endoftext|>",
+  "pad_token": "<|endoftext|>"
+}

training_metadata.json ADDED Viewed

	@@ -0,0 +1,35 @@

+{
+  "out_dir": "out-nanecho-ci",
+  "eval_interval": 25,
+  "log_interval": 5,
+  "eval_iters": 10,
+  "eval_only": false,
+  "always_save_checkpoint": true,
+  "init_from": "scratch",
+  "wandb_log": false,
+  "wandb_project": "nanecho",
+  "wandb_run_name": "nanecho-1771761179.4450994",
+  "dataset": "nanecho",
+  "gradient_accumulation_steps": 2,
+  "batch_size": 2,
+  "block_size": 1024,
+  "n_layer": 4,
+  "n_head": 4,
+  "n_embd": 256,
+  "dropout": 0.1,
+  "bias": true,
+  "learning_rate": 0.0002,
+  "max_iters": 200,
+  "weight_decay": 0.01,
+  "beta1": 0.9,
+  "beta2": 0.95,
+  "grad_clip": 1.0,
+  "decay_lr": true,
+  "warmup_iters": 20,
+  "lr_decay_iters": 200,
+  "min_lr": 2e-05,
+  "backend": "nccl",
+  "device": "cpu",
+  "dtype": "float32",
+  "compile": false
+}