Update README.md
Browse files
README.md
CHANGED
|
@@ -134,6 +134,7 @@ input_ids = tokenizer("The capital of France is", return_tensors="pt").input_ids
|
|
| 134 |
output = model.generate(input_ids.to(model.device), max_new_tokens=32)
|
| 135 |
print(tokenizer.decode(output[0]))
|
| 136 |
```
|
|
|
|
| 137 |
|
| 138 |
### Running Evaluations
|
| 139 |
|
|
|
|
| 134 |
output = model.generate(input_ids.to(model.device), max_new_tokens=32)
|
| 135 |
print(tokenizer.decode(output[0]))
|
| 136 |
```
|
| 137 |
+
Note that `quartet_2_impl="quartet2"` only supports inputs with `(micro_batch_size * seq_length) % 128 == 0`.
|
| 138 |
|
| 139 |
### Running Evaluations
|
| 140 |
|