muditbaid
/

llama3.1-8b-Instruct-qlora-hatexplain

Text Classification

Model card Files Files and versions

muditbaid commited on Nov 11, 2025

Commit

a7b3160

·

verified ·

1 Parent(s): 82c45f1

README.md

Files changed (1) hide show

README.md +7 -2

README.md CHANGED Viewed

@@ -8,8 +8,13 @@ tags:
 - qlora
 - peft
 - llama3
 base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
 pipeline_tag: text-classification
 ---
 # llama3.1-8b-Instruct-qlora-hatexplain
@@ -50,7 +55,7 @@ python scripts/qlora_inference.py \
 ## Training Notes
 - LLaMA-Factory SFT stage with LoRA rank 8, alpha 16, dropout 0.05.
-- Cutoff length 2048, cosine scheduler, 3 epochs, learning rate 5e-5.
 - QLoRA (4-bit) backbone for efficient fine-tuning on a single GPU.
-Refer to `config/llama31_hatexplain_qlora_sft.yaml` for the full set of hyperparameters.

 - qlora
 - peft
 - llama3
+- llama-factory
 base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
 pipeline_tag: text-classification
+datasets:
+- Hate-speech-CNERG/hatexplain
+metrics:
+- accuracy
 ---
 # llama3.1-8b-Instruct-qlora-hatexplain
 ## Training Notes
 - LLaMA-Factory SFT stage with LoRA rank 8, alpha 16, dropout 0.05.
+- Cutoff length 1024, cosine scheduler, 3 epochs, learning rate 2e-5.
 - QLoRA (4-bit) backbone for efficient fine-tuning on a single GPU.
+Refer to `config/llama31_hatexplain_qlora_sft.yaml` for the full set of hyperparameters.