s-nlp
/

Knowledge-Packing-Llama-3.1-8B-Instruct-3000Unknown-10HighKnown

Question Answering

Model card Files Files and versions

memyprokotow commited on Feb 24, 2025

Commit

feb7e0c

·

verified ·

1 Parent(s): 5894ef9

Update README.md

Files changed (1) hide show

README.md +13 -2

README.md CHANGED Viewed

@@ -60,6 +60,7 @@ Users should be aware of potential biases in the model's responses and the limit
 ### Training Data
 The training data consists of questions and answers generated using the head-to-tail pipeline with a Dbpedia script.  See the paper and Github repository for more details.
 ### Training Procedure
@@ -67,11 +68,21 @@ The model was fine-tuned using LoRA.
 #### Training Hyperparameters
-[More Information Needed]
 ## Evaluation
-[More Information Needed]
 ## Environmental Impact

 ### Training Data
 The training data consists of questions and answers generated using the head-to-tail pipeline with a Dbpedia script.  See the paper and Github repository for more details.
+Model was trained on 3000 Unknown questions with 10 additional HighlyKnown question per Unknown
 ### Training Procedure
 #### Training Hyperparameters
+    LR = 1e-3
+    BS = 8
+    EPOCHS = 10
+    LoRA:
+    lora_rank = 1
+    lora_alpha = 2
+    use_rslora = True
+    lora_dropout = 0.1
+    bias = "none"
+    target_modules = ["down_proj", "gate_proj", "up_proj"]
+    task_type = "CAUSAL_LM"
 ## Evaluation
+For evaluation you can use [notebooks](https://github.com/AIRI-Institute/knowledge-packing/tree/main/notebooks) from github repository
 ## Environmental Impact