thangvip
/

vwen-1.5B-instruct

Text Generation

text-generation-inference

Model card Files Files and versions

thangvip commited on Jul 2, 2024

Commit

51a220d

·

verified ·

1 Parent(s): 94c9761

Update README.md

Files changed (1) hide show

README.md +1 -5

README.md CHANGED Viewed

@@ -33,7 +33,6 @@ This is the model card of a 🤗 transformers model that has been pushed on the
 <!-- This section is meant to convey both technical and sociotechnical limitations. -->
 Have not trained on RLHF for safety
-[More Information Needed]
 ## How to Get Started with the Model
@@ -56,7 +55,6 @@ outputs = model.generate(**inputs, tokenizer=tokenizer, max_new_tokens=256, do_s
 print(tokenizer.decode(outputs[0]))
 ```
-[More Information Needed]
 ## Training Details
@@ -64,12 +62,10 @@ print(tokenizer.decode(outputs[0]))
 <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
 Trained on OpenHermes dataset (translated to Vietnamese) > 600k samples
-[More Information Needed]
 #### Training Hyperparameters
-- **Training regime:**  [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
 - **Target_modules:** q_proj, k_proj, v_proj, o_proj, up_proj, down_proj, gate_proj
 - **batch_size:** 2048
 - **epoch:** 1

 <!-- This section is meant to convey both technical and sociotechnical limitations. -->
 Have not trained on RLHF for safety
 ## How to Get Started with the Model
 print(tokenizer.decode(outputs[0]))
 ```
 ## Training Details
 <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
 Trained on OpenHermes dataset (translated to Vietnamese) > 600k samples
 #### Training Hyperparameters
+ <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
 - **Target_modules:** q_proj, k_proj, v_proj, o_proj, up_proj, down_proj, gate_proj
 - **batch_size:** 2048
 - **epoch:** 1