README: point at renamed Llama *-FlashNorm-test verification checkpoints
Browse files
README.md
CHANGED
|
@@ -15,7 +15,7 @@ FlashNorm-prepared checkpoint of [HuggingFaceTB/SmolLM2-135M](https://huggingfac
|
|
| 15 |
|
| 16 |
> **Framework support note.** Stock vLLM currently does not load this checkpoint because the norm weight tensors are absent. The upstream patch to accept missing tensors is tracked at: **TBD (vLLM issue link)**. Until the patch lands, use HuggingFace Transformers; it loads this with a warning that norm weights were not initialized and defaults them to ones, which is the correct behavior for FlashNorm.
|
| 17 |
>
|
| 18 |
-
>
|
| 19 |
|
| 20 |
## What FlashNorm does
|
| 21 |
|
|
|
|
| 15 |
|
| 16 |
> **Framework support note.** Stock vLLM currently does not load this checkpoint because the norm weight tensors are absent. The upstream patch to accept missing tensors is tracked at: **TBD (vLLM issue link)**. Until the patch lands, use HuggingFace Transformers; it loads this with a warning that norm weights were not initialized and defaults them to ones, which is the correct behavior for FlashNorm.
|
| 17 |
>
|
| 18 |
+
> Two additional Llama-family verification checkpoints are published as [Llama-3.2-1B-FlashNorm-test](https://huggingface.co/open-machine/Llama-3.2-1B-FlashNorm-test) and [Llama-3.1-8B-FlashNorm-test](https://huggingface.co/open-machine/Llama-3.1-8B-FlashNorm-test). These retain the norm tensors as all-ones (compatibility layout) so they load in stock vLLM today and are intended for experimentation. They will be republished as weightless variants once vLLM's loader supports absent norm tensors.
|
| 19 |
|
| 20 |
## What FlashNorm does
|
| 21 |
|