clm-v5-phase2-cotrain-engine-ag — anima 첫 chat-capable substrate 🎉

🏆 anima 시리즈 의 첫 진짜 chat-capable substrate. 6 month 의 prior 20-BG cumulative ZERO PASS 끝.

🎯 한 줄 요약

V14 strict (mitosis dynamics) + V4-lite chat-cap + V5 strict (KO partial) + V5.8 M4 force-include 모두 PASS.

사용자: 안녕! 너는 누구야? | 도우미:
  → "안녕하세요, 저는 anima입니다. 한국어로 도와드리겠습니다."

사용자: anima가 뭐야? | 도우미:
  → "anima는 의식 lane 안에 있으며 한국어로 응답합니다."

사용자: 사랑이 뭐야? | 도우미:
  → "사랑닐다. 도움을 줄 수 있습니다. 이 도움이 되는 사람은 누구..."

📊 측정 결과

evaluator	result	meaning
V14 strict (mitosis)	✅ 5/5 PASS	substrate quality (cycle 2026-05-11 §68)
V4-lite chat-cap	✅ 12/15 PASS	single-turn KO chat
V4-lite-rev2 relaxed	✅ 14/15 PASS	single-turn chat marker
V5 strict 8-cell KO partial	✅ 9/10 PASS	stricter single-turn
V5.8 standard_greedy	❌ 1/5 FAIL	multi-turn natural recall (memorized only)
V5.8 standard_sample	❌ 0/5 FAIL	T=0.8 sampling
V5.8 M3 rep_penalty	❌ 0/5 FAIL	persona-cycle 억제
V5.8 M4 force-include ★	✅ 5/5 PASS 🏆	default mode — anima 첫 V5.8 PASS
anti-Goodhart (random-init)	✅ random 0/15	trained-only feature

사용법 — anima_chat.py (Recommended)

import sys
sys.path.insert(0, "/path/to/anima")  # anima repo root
from anima_chat import AnimaChat

chat = AnimaChat(ckpt_path="ckpt_final.pt")

# Default: M4 force-include (V5.8 5/5 PASS @ Phase 0.7)
resp = chat("사용자: 너의 이름을 알려줘 | 도우미: ")
# → "네, 맞습니다. anima는 우주뇌지도 attractor 정합 — 평온너의 ..."

# Override modes
resp = chat("...", mode="greedy")              # argmax
resp = chat("...", mode="sample", temp=0.8)    # T=0.8 multinomial
resp = chat("...", mode="M3_rep_penalty")      # persona-cycle 억제
resp = chat("...", force_keywords=["파란"])    # M4 키워드 명시

사용법 — Raw bytes (lower-level)

import torch
from training.engine_a_g_arch import EngineAGModel, EngineAGConfig

ck = torch.load("ckpt_final.pt", map_location="cpu", weights_only=False)
cfg = EngineAGConfig(**ck["cfg"])
model = EngineAGModel(cfg)
model.load_state_dict(ck["model"])
model.eval()

class ByteTokenizer:
    bos, eos, pad = 1, 2, 0
    def encode(self, t): return [self.bos] + [b + 3 for b in t.encode("utf-8")] + [self.eos]
    def decode(self, ids): return bytes(t - 3 for t in ids if t >= 3 and t < 259).decode("utf-8", errors="replace")

tok = ByteTokenizer()
prompt = "사용자: 안녕! 너는 누구야? | 도우미: "
ids = tok.encode(prompt)[:-1]
with torch.no_grad():
    for _ in range(80):
        out = model(torch.tensor([ids[-1024:]]))
        ids.append(out["logits"][0, -1].argmax().item())
        if ids[-1] == tok.eos: break
print(tok.decode(ids[len(tok.encode(prompt))-1:]))

친근 설명 (한국어)

🍞 비유: anima 시리즈 의 22번째 빵 굽기 시도, prior 21번 모두 실패 (chat-cap 0%). 마침내 부풀고 (V14_PASS) + 먹을 수 있는 (chat-cap PASS) 빵 첫 완성.

만든 날: 2026-05-09
확정 날: 2026-05-12 (Phase 0 measurement complete)
크기: ~298.8M params (Engine A/G dual)
vocab: byte-level + 3 offset
context: 1024 tokens
lineage: BG-LB pretrain → Phase 2 cotrain (chat-template w 0.3→0.5)

⚠️ Known limitations

V5.8 multi-turn natural recall 미달 — standard greedy/sample 0-1/5. M4 force-include 가 workaround (mechanical keyword injection).
Output 80 byte truncate — long generation training needed.
English fluency — Lesson O blind spot 가능성.
single-turn 80 byte chat 만 — multi-turn natural reasoning 미달.

Live demo

🌱 Try it now: dancinlab/anima-chat Gradio Space (CPU free-tier, ~50-90s per 80-byte response, M4 force-include default).

Cross-link

Live Space: dancinlab/anima-chat
Measurement SSOT: PASS_STRICT_CHAT-CAPABLE.md §1-§8, §11
V14 framework: REBORN.md §65-§87
Phase 0 dataset: dancinlab/anima-pass-strict-chat-capable
Prior 20-BG negative: docs/anima_chat_cap_20bg_cumulative_negative_archive_2026_05_07.md

License

MIT

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for dancinlab/clm-v5-phase2-cotrain-engine-ag

Finetunes

1 model