Update README.md
Browse files
README.md
CHANGED
|
@@ -120,6 +120,29 @@ MiniAxion1 is not intended to compete with large-scale models. Instead, it is bu
|
|
| 120 |
|
| 121 |
---
|
| 122 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 123 |
## 🧠 Philosophy
|
| 124 |
|
| 125 |
MiniAxion1 explores a key question:
|
|
|
|
| 120 |
|
| 121 |
---
|
| 122 |
|
| 123 |
+
### Quick start
|
| 124 |
+
|
| 125 |
+
```python
|
| 126 |
+
|
| 127 |
+
import torch
|
| 128 |
+
from model import NRMModel
|
| 129 |
+
from tokenizer import Tokenizer
|
| 130 |
+
|
| 131 |
+
# load
|
| 132 |
+
model = NRMModel.from_config("config.json")
|
| 133 |
+
model.load_state_dict(torch.load("model.pt"))
|
| 134 |
+
model.eval()
|
| 135 |
+
|
| 136 |
+
tokenizer = Tokenizer.load("tokenizer.json")
|
| 137 |
+
|
| 138 |
+
def generate(prompt):
|
| 139 |
+
tokens = tokenizer.encode(prompt)
|
| 140 |
+
output = model.generate(tokens)
|
| 141 |
+
return tokenizer.decode(output)
|
| 142 |
+
|
| 143 |
+
print(generate("<INST>What is 2 + 2?</INST>"))
|
| 144 |
+
```
|
| 145 |
+
|
| 146 |
## 🧠 Philosophy
|
| 147 |
|
| 148 |
MiniAxion1 explores a key question:
|