oscarz511
/

NanoSOTA-v2-General

Mixture of Experts

custom-architecture

Model card Files Files and versions

oscarz511 commited on Dec 19, 2025

Commit

04da9c6

·

verified ·

1 Parent(s): 185ca01

Final Merged Upload

Files changed (2) hide show

README.md +7 -16
model.safetensors +3 -0

README.md CHANGED Viewed

@@ -7,31 +7,22 @@ tags:
 - nanosota
 - reasoning
 ---
 # NanoSOTA-v2-General
-**Description:** Final version. An 8-Expert MoE balancing Parallel Logic, Sequential Logic, and Algebra (Bat & Ball).
 ## How to Load (Required)
-This model uses a custom 8-Expert MoE architecture.
-You **must** use the provided loader.
 ```python
 from nanosota_moe import load_nanosota
 model, tokenizer = load_nanosota("oscarz511/NanoSOTA-v2-General")
 prompt = "If it takes 3 hours to dry 3 shirts, how long for 30 shirts?"
-inputs = tokenizer.apply_chat_template(
-    [
-        {"role": "system", "content": "You are NanoSOTA. Think step-by-step."},
-        {"role": "user", "content": prompt},
-    ],
-    return_tensors="pt",
-    add_generation_prompt=True,
-).to("cuda")
 out = model.generate(**inputs, max_new_tokens=256)
 print(tokenizer.decode(out[0]))

 - nanosota
 - reasoning
 ---
 # NanoSOTA-v2-General
+**Description:** Final Generalist (Logic + Algebra).
 ## How to Load (Required)
+This model uses a custom 8-Expert MoE architecture. You **must** use the provided loader script.
 ```python
 from nanosota_moe import load_nanosota
 model, tokenizer = load_nanosota("oscarz511/NanoSOTA-v2-General")
 prompt = "If it takes 3 hours to dry 3 shirts, how long for 30 shirts?"
+inputs = tokenizer.apply_chat_template([
+    {"role": "system", "content": "You are NanoSOTA. Think step-by-step."},
+    {"role": "user", "content": prompt}
+], return_tensors="pt", add_generation_prompt=True).to("cuda")
 out = model.generate(**inputs, max_new_tokens=256)
 print(tokenizer.decode(out[0]))
+```

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3ec8eb4dcdafe7851ddc59e6db6b891ef77f72521a1be53c29a6a51669c8db82
+size 5381516616