fm1320 commited on
Commit
66e4cd7
·
verified ·
1 Parent(s): 3b2c714

README: point at renamed Llama *-FlashNorm-test verification checkpoints

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -15,7 +15,7 @@ FlashNorm-prepared checkpoint of [HuggingFaceTB/SmolLM2-135M](https://huggingfac
15
 
16
  > **Framework support note.** Stock vLLM currently does not load this checkpoint because the norm weight tensors are absent. The upstream patch to accept missing tensors is tracked at: **TBD (vLLM issue link)**. Until the patch lands, use HuggingFace Transformers; it loads this with a warning that norm weights were not initialized and defaults them to ones, which is the correct behavior for FlashNorm.
17
  >
18
- > The other two public FlashNorm checkpoints in this org, [Llama-3.2-1B-FlashNorm](https://huggingface.co/open-machine/Llama-3.2-1B-FlashNorm) and [Llama-3.1-8B-FlashNorm](https://huggingface.co/open-machine/Llama-3.1-8B-FlashNorm), are currently still in a compatibility layout where the norm tensors are retained as all-ones. They will be flipped to the same weightless layout as this checkpoint once vLLM's loader supports it.
19
 
20
  ## What FlashNorm does
21
 
 
15
 
16
  > **Framework support note.** Stock vLLM currently does not load this checkpoint because the norm weight tensors are absent. The upstream patch to accept missing tensors is tracked at: **TBD (vLLM issue link)**. Until the patch lands, use HuggingFace Transformers; it loads this with a warning that norm weights were not initialized and defaults them to ones, which is the correct behavior for FlashNorm.
17
  >
18
+ > Two additional Llama-family verification checkpoints are published as [Llama-3.2-1B-FlashNorm-test](https://huggingface.co/open-machine/Llama-3.2-1B-FlashNorm-test) and [Llama-3.1-8B-FlashNorm-test](https://huggingface.co/open-machine/Llama-3.1-8B-FlashNorm-test). These retain the norm tensors as all-ones (compatibility layout) so they load in stock vLLM today and are intended for experimentation. They will be republished as weightless variants once vLLM's loader supports absent norm tensors.
19
 
20
  ## What FlashNorm does
21