infCapital
/

llama2-7b-chat-hf

Text Generation

text-generation-inference

Model card Files Files and versions

hungeni commited on Sep 22, 2023

Commit

0f2ee3d

·

1 Parent(s): bebdb8e

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -16,14 +16,14 @@ tags:
 InfCapital LLama2-7b clone of [Meta's Llama 2 7B Chat](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf).
-Adopt for Vietnamese continued pretraining or fine-tuning by extend vocab size from 32,000 to 48,281
 ## Model Architecture
 ```
 LlamaForCausalLM(
   (model): LlamaModel(
-    (embed_tokens): Embedding(32000, 4096, padding_idx=0)
     (layers): ModuleList(
       (0-31): 32 x LlamaDecoderLayer(
         (self_attn): LlamaAttention(
@@ -45,6 +45,6 @@ LlamaForCausalLM(
     )
     (norm): LlamaRMSNorm()
   )
-  (lm_head): Linear(in_features=4096, out_features=32000, bias=False)
 )
 ```

 InfCapital LLama2-7b clone of [Meta's Llama 2 7B Chat](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf).
+Adopt for Vietnamese continued pretraining or fine-tuning by extend vocab size from 32,000 to 44,800. Vocabs added by training sentencepiece method from dataset vnnews-corpus.
 ## Model Architecture
 ```
 LlamaForCausalLM(
   (model): LlamaModel(
+    (embed_tokens): Embedding(44800, 4096)
     (layers): ModuleList(
       (0-31): 32 x LlamaDecoderLayer(
         (self_attn): LlamaAttention(
     )
     (norm): LlamaRMSNorm()
   )
+  (lm_head): Linear(in_features=4096, out_features=44800, bias=False)
 )
 ```