Add note clarifying HF param count vs actual 196.81B
Browse files
README.md
CHANGED
|
@@ -12,17 +12,14 @@ tags:
|
|
| 12 |
library_name: transformers
|
| 13 |
quantized_by: tacos4me
|
| 14 |
pipeline_tag: text-generation
|
| 15 |
-
model-index:
|
| 16 |
-
- name: Step-3.5-Flash-NVFP4
|
| 17 |
-
results: []
|
| 18 |
-
extra:
|
| 19 |
-
num_params: 196810000000
|
| 20 |
---
|
| 21 |
|
| 22 |
# Step-3.5-Flash-NVFP4
|
| 23 |
|
| 24 |
NVFP4-quantized version of [stepfun-ai/Step-3.5-Flash](https://huggingface.co/stepfun-ai/Step-3.5-Flash), an open-source frontier-level reasoning model by StepFun with 196.81B total parameters and ~11B active parameters per token.
|
| 25 |
|
|
|
|
|
|
|
| 26 |
## Model Description
|
| 27 |
|
| 28 |
[Step 3.5 Flash](https://huggingface.co/stepfun-ai/Step-3.5-Flash) is an open-source foundation model designed for frontier-level reasoning and agentic capabilities with exceptional efficiency. Key highlights from the base model:
|
|
|
|
| 12 |
library_name: transformers
|
| 13 |
quantized_by: tacos4me
|
| 14 |
pipeline_tag: text-generation
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 15 |
---
|
| 16 |
|
| 17 |
# Step-3.5-Flash-NVFP4
|
| 18 |
|
| 19 |
NVFP4-quantized version of [stepfun-ai/Step-3.5-Flash](https://huggingface.co/stepfun-ai/Step-3.5-Flash), an open-source frontier-level reasoning model by StepFun with 196.81B total parameters and ~11B active parameters per token.
|
| 20 |
|
| 21 |
+
> **Note:** HuggingFace displays ~111B params because it counts packed FP4 tensor elements (2 weights per byte). The actual model has **196.81B parameters**, identical to the base model.
|
| 22 |
+
|
| 23 |
## Model Description
|
| 24 |
|
| 25 |
[Step 3.5 Flash](https://huggingface.co/stepfun-ai/Step-3.5-Flash) is an open-source foundation model designed for frontier-level reasoning and agentic capabilities with exceptional efficiency. Key highlights from the base model:
|