CofeAI
/

FLM-2-52B-Instruct-2407

Feature Extraction

Model card Files Files and versions

sleepylx commited on Jul 22, 2024

Commit

2e293b9

·

verified ·

1 Parent(s): 8498f6d

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -13,9 +13,9 @@ FLM-2-52B-Instruct utilizes the standard GPT-style decoder-only transformer arch
 * Embedding and language model head untied
 * Input and output multiplier
-| Models        | layer<br>number | attention<br>heads | hidden<br>size | ffn hidden<br>size | vocab<br>size |  params<br>count |
-| ------------- | --------------- | ------------------ | -------------- | ------------------ | ------------- |  --------------- |
-| FLM-2-52B-Instruct-2407  | 64              | 64                 | 8,192          | 21,824             | 80,000        |  52.85 B         |
 # Training details

 * Embedding and language model head untied
 * Input and output multiplier
+| Models                    | layer<br>number | attention<br>heads | hidden<br>size | ffn hidden<br>size | vocab<br>size |  params<br>count |
+| -------------             | :-------------: | :----------------: | :------------: | :----------------: | :-----------: | :--------------: |
+| FLM-2-52B-Instruct-2407   | 64              | 64                 | 8,192          | 21,824             | 80,000        |  52.85 B         |
 # Training details