shareAI
/

llama3-8b-Chinese-Instruct-DPO-beta0.5

Model card Files Files and versions

Baicai003 commited on May 4, 2024

Commit

ee1619d

·

verified ·

1 Parent(s): f1c3aed

Update README.md

Files changed (1) hide show

README.md +0 -16

README.md CHANGED Viewed

@@ -8,22 +8,6 @@ tags:
 - llama3
 ---
----
-frameworks:
-- Pytorch
-license: Apache License 2.0
-tasks:
-- chatbot
-language:
-- cn
-tags:
-- RL-tuned
-tools:
-- vllm
----
 Github：https://github.com/CrazyBoyM/llama3-Chinese-chat
 放出训练配方细节供网友参考分享：
 DPO(beta 0.5) + lora rank128, alpha256 + 打开"lm_head", "input_layernorm", "post_attention_layernorm", "norm"层训练。

 - llama3
 ---
 Github：https://github.com/CrazyBoyM/llama3-Chinese-chat
 放出训练配方细节供网友参考分享：
 DPO(beta 0.5) + lora rank128, alpha256 + 打开"lm_head", "input_layernorm", "post_attention_layernorm", "norm"层训练。