DevQuasar
/

vintage-nextstep_os_systemadmin-ft-phi2

Text Generation

text-generation-inference

Model card Files Files and versions

csabakecskemeti commited on Mar 13, 2024

Commit

fd6bcfd

·

verified ·

1 Parent(s): 95c0f7e

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ contains less than 100 tokens). The maximum token size for Orca2 is 4096 so a si
 Evaluation set has been generated similar method on 1% of the raw data with LLama2 chat (https://huggingface.co/TheBloke/Llama-2-13B-chat-GGUF).
 Trained locally on 2x3090 GPU with vanila DDP with HuggingFace Accelerate for 50 Epoch.
-As I wanted to add new knowledge to the base model r=128 and lora_alpha=128 has been used -> LoRA weights are 3.5% of the base model.
 Chat with model sample code:

 Evaluation set has been generated similar method on 1% of the raw data with LLama2 chat (https://huggingface.co/TheBloke/Llama-2-13B-chat-GGUF).
 Trained locally on 2x3090 GPU with vanila DDP with HuggingFace Accelerate for 50 Epoch.
+As I wanted to add new knowledge to the base model r=128 and lora_alpha=128 has been used -> LoRA weights were 3.5% of the base model.
 Chat with model sample code: