| --- |
| language: |
| - en |
| license: apache-2.0 |
| tags: |
| - recursive-language-model |
| - multimodal |
| - self-automated |
| - pytorch |
| - safetensors |
| - ancient-ai |
| model_type: ancient_ai |
| pipeline_tag: text-generation |
| --- |
| |
| # Ancient.AI.V β Recursive Language Model |
|
|
| **Architecture:** Recursive Language Model (RLM) |
| **Not a Large Language Model** β a fundamentally different architecture built from scratch. |
|
|
| | Property | Value | |
| |---|---| |
| | Parameters | 1.147B | |
| | Context Window | 64,000 tokens | |
| | Layers | 24 | |
| | Hidden Size | 2,048 | |
| | Attention Heads | 16 (GQA, 8 KV heads) | |
| | FFN Dimension | 8,192 | |
| | Vocab Size | 64,000 | |
| | Activation | SwiGLU | |
| | Position Encoding | YaRN-extended RoPE (base 500k, scale 8Γ) | |
| | Weight Format | safetensors | |
| | Precision | bfloat16 (fine-tune target) | |
|
|
| --- |
|
|
| ## What Makes It Different From an LLM |
|
|
| Standard LLMs run one forward pass: input β output. |
|
|
| Ancient.AI.V runs a **Recursive Outer Loop**: the model refines its own |
| output `recursion_depth` times per call, with a learned halting gate that |
| stops early when confident. This is the core of the Recursive Language Model paradigm. |
|
|
| --- |
|
|
| ## Integrated Self-Automated (SA) Modules |
|
|
| All 17 SA modules operate **simultaneously** within each decoder layer as |
| parallel residual paths β not sequential post-processing steps. |
|
|
| | Module | Implementation | |
| |---|---| |
| | SA Meta-Learning | Per-sample fast-weight delta generation (learned MAML inner loop) | |
| | SA Reinforcement Learning | Per-token value estimation + policy gate (actor-critic in forward pass) | |
| | SA Continual Learning | EWC-inspired importance weighting from initial representations | |
| | SA Adaptive Learning | Learned depth-gating; tokens can exit processing early | |
| | SA Rewriting | Cross-attention from current β earlier hidden states (in-context revision) | |
| | SA NLP | Bigram/trigram convolutions + semantic role projection | |
| | SA Problem Solving | Multi-step latent chain-of-thought scratchpad (3 internal steps) | |
| | SA Innovation | Novelty-promoting repulsion in embedding space | |
| | SA Debugging | Anomaly detection + learned correction on hidden state norms | |
| | SA Long/Short-Term Memory | 512 persistent learnable memory slots with read/write gating | |
| | SA Recursive Seed Learning | Compress β refine β expand self-representation cycle | |
| | SA Self-Evaluation & Reward | Per-token reward MLP; plugs directly into PPO/GRPO fine-tuning | |
| | SA Goal & Constraint Engine | Learned goal embedding cross-attends to steer generation | |
| | SA Memory Consolidation | Bidirectional GRU trace encoder with hippocampal replay | |
| | SA Introspection Interface | Uncertainty + confidence mapping over hidden states | |
| | SA Recursive Outer Loop | Post-stack self-refinement with learned halting | |
| | SA Conversational Intelligence | Dialogue state tracker (turn, topic shift, emotion, formality) | |
|
|
| --- |
|
|
| ## Multimodal Support |
|
|
| Native encoders for all four modalities, fused before the decoder stack: |
|
|
| - **Text** β BPE tokenizer, 64k vocab |
| - **Image** β ViT-style patch encoder (16Γ16 patches, up to 224Γ224) |
| - **Audio** β Whisper-style mel-spectrogram encoder (80 mel bins) |
| - **Video** β Frame-by-frame ViT + temporal self-attention |
|
|
| --- |
|
|
| ## Training / Fine-Tuning |
|
|
| This checkpoint contains **randomly initialized weights** β it is an |
| architecture shell ready for fine-tuning. |
|
|
| Recommended fine-tuning approaches: |
| - **SFT** (Supervised Fine-Tuning) with causal LM loss |
| - **RLHF/PPO** β plug training reward into the `SASelfEvaluation` reward head |
| - **GRPO** β the `sa_eval` reward signal is already shaped for group-relative optimization |
| - **LoRA / QLoRA** β compatible with standard PEFT adapters |
|
|
| Training the self-reward head jointly with SFT gives Ancient.AI.V |
| self-improvement capability without a separate reward model. |
|
|
| --- |
|
|
| ## Usage |
|
|
| ```python |
| # AutoTokenizer available after fine-tuning with a trained tokenizer |
| from ancient_ai import AncientConfig, AncientAIV # after registering custom class |
| import torch |
| |
| cfg = AncientConfig() |
| model = AncientAIV(cfg) |
| # Load weights: |
| # model = AncientAIV.from_pretrained("GODsStrongestSoldier/Ancient.AI.V") |
| |
| tokenizer = AutoTokenizer.from_pretrained("GODsStrongestSoldier/Ancient.AI.V") |
| input_ids = tokenizer("Hello Ancient.AI", return_tensors="pt").input_ids |
| |
| generated = model.generate_text(input_ids, max_new=200, temperature=0.8) |
| print(tokenizer.decode(generated[0])) |
| ``` |
|
|
| --- |
|
|
| ## Architecture Citation |
|
|
| ``` |
| Ancient.AI.V β Recursive Language Model (RLM) |
| Author: GODsStrongestSoldier |
| Year: 2025 |
| Architecture: Custom RLM with 17 integrated SA modules |
| Repo: https://huggingface.co/GODsStrongestSoldier/Ancient.AI.V |
| ``` |
|
|
| --- |
|
|
| ## License |
|
|
| Apache 2.0 β free for research and commercial fine-tuning. |
|
|