DataSnake
/

Muse-12B-NVFP4

Text Generation

8-bit precision

Model card Files Files and versions

DataSnake commited on Dec 25, 2025

Commit

1b56a4f

·

verified ·

1 Parent(s): 1487e11

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ pipeline_tag: text-generation
 ![image/jpeg](muse.jpg)
-# Muse-12B
 Quantized NVFP4 weights of the [Muse-12B](https://huggingface.co/LatitudeGames/Muse-12B) model, for use with nVidia Blackwell GPUs.
@@ -45,7 +45,7 @@ dataset_utils.SUPPORTED_DATASET_CONFIG["distilled-roleplay"] = {
 ## Inference
-Tested on a RTX 5060 Ti 16GB with TensorRT-LLM, vLLM, and SGLang.
 Recommended generation settings (a mix of what it says on the Muse-12B model card and the [AI Dungeon Model Guide](https://help.aidungeon.com/ai-models-and-their-differences)):
 - Temperature: 1.0

 ![image/jpeg](muse.jpg)
+# Muse-12B-NVFP4
 Quantized NVFP4 weights of the [Muse-12B](https://huggingface.co/LatitudeGames/Muse-12B) model, for use with nVidia Blackwell GPUs.
 ## Inference
+Tested on a RTX 5060 Ti 16GB with TensorRT-LLM, vLLM, SGLang, and Aphrodite Engine.
 Recommended generation settings (a mix of what it says on the Muse-12B model card and the [AI Dungeon Model Guide](https://help.aidungeon.com/ai-models-and-their-differences)):
 - Temperature: 1.0