oscarz511 commited on
Commit
04da9c6
·
verified ·
1 Parent(s): 185ca01

Final Merged Upload

Browse files
Files changed (2) hide show
  1. README.md +7 -16
  2. model.safetensors +3 -0
README.md CHANGED
@@ -7,31 +7,22 @@ tags:
7
  - nanosota
8
  - reasoning
9
  ---
10
-
11
  # NanoSOTA-v2-General
12
-
13
- **Description:** Final version. An 8-Expert MoE balancing Parallel Logic, Sequential Logic, and Algebra (Bat & Ball).
14
 
15
  ## How to Load (Required)
16
-
17
- This model uses a custom 8-Expert MoE architecture.
18
- You **must** use the provided loader.
19
 
20
  ```python
21
  from nanosota_moe import load_nanosota
22
-
23
  model, tokenizer = load_nanosota("oscarz511/NanoSOTA-v2-General")
24
 
25
  prompt = "If it takes 3 hours to dry 3 shirts, how long for 30 shirts?"
26
-
27
- inputs = tokenizer.apply_chat_template(
28
- [
29
- {"role": "system", "content": "You are NanoSOTA. Think step-by-step."},
30
- {"role": "user", "content": prompt},
31
- ],
32
- return_tensors="pt",
33
- add_generation_prompt=True,
34
- ).to("cuda")
35
 
36
  out = model.generate(**inputs, max_new_tokens=256)
37
  print(tokenizer.decode(out[0]))
 
 
7
  - nanosota
8
  - reasoning
9
  ---
 
10
  # NanoSOTA-v2-General
11
+ **Description:** Final Generalist (Logic + Algebra).
 
12
 
13
  ## How to Load (Required)
14
+ This model uses a custom 8-Expert MoE architecture. You **must** use the provided loader script.
 
 
15
 
16
  ```python
17
  from nanosota_moe import load_nanosota
 
18
  model, tokenizer = load_nanosota("oscarz511/NanoSOTA-v2-General")
19
 
20
  prompt = "If it takes 3 hours to dry 3 shirts, how long for 30 shirts?"
21
+ inputs = tokenizer.apply_chat_template([
22
+ {"role": "system", "content": "You are NanoSOTA. Think step-by-step."},
23
+ {"role": "user", "content": prompt}
24
+ ], return_tensors="pt", add_generation_prompt=True).to("cuda")
 
 
 
 
 
25
 
26
  out = model.generate(**inputs, max_new_tokens=256)
27
  print(tokenizer.decode(out[0]))
28
+ ```
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3ec8eb4dcdafe7851ddc59e6db6b891ef77f72521a1be53c29a6a51669c8db82
3
+ size 5381516616