Update README.md
Browse files
README.md
CHANGED
|
@@ -16,7 +16,7 @@ pipeline_tag: text-generation
|
|
| 16 |
---
|
| 17 |

|
| 18 |
|
| 19 |
-
# Wayfarer-2-12B
|
| 20 |
|
| 21 |
Quantized NVFP4 weights of the [Wayfarer-2-12B](https://huggingface.co/LatitudeGames/Wayfarer-2-12B) model, for use with nVidia Blackwell GPUs.
|
| 22 |
|
|
@@ -43,7 +43,7 @@ dataset_utils.SUPPORTED_DATASET_CONFIG["distilled-roleplay"] = {
|
|
| 43 |
|
| 44 |
## Inference
|
| 45 |
|
| 46 |
-
Tested on a RTX 5060 Ti 16GB with TensorRT-LLM, vLLM, and
|
| 47 |
|
| 48 |
Recommended generation settings (a mix of what it says on the Wayfarer-2-12B model card and the [AI Dungeon Model Guide](https://help.aidungeon.com/ai-models-and-their-differences) entry for Wayfarer 2):
|
| 49 |
- Temperature: 1.1
|
|
|
|
| 16 |
---
|
| 17 |

|
| 18 |
|
| 19 |
+
# Wayfarer-2-12B-NVFP4
|
| 20 |
|
| 21 |
Quantized NVFP4 weights of the [Wayfarer-2-12B](https://huggingface.co/LatitudeGames/Wayfarer-2-12B) model, for use with nVidia Blackwell GPUs.
|
| 22 |
|
|
|
|
| 43 |
|
| 44 |
## Inference
|
| 45 |
|
| 46 |
+
Tested on a RTX 5060 Ti 16GB with TensorRT-LLM, vLLM, SGLang, and Aphrodite Engine.
|
| 47 |
|
| 48 |
Recommended generation settings (a mix of what it says on the Wayfarer-2-12B model card and the [AI Dungeon Model Guide](https://help.aidungeon.com/ai-models-and-their-differences) entry for Wayfarer 2):
|
| 49 |
- Temperature: 1.1
|