BEncoderRT
/

Pythia-QLoRA-Instruction-Tuning

Text Generation

Instruction-Tuning

Model card Files Files and versions

BEncoderRT commited on Jan 8

Commit

76a2e30

·

verified ·

1 Parent(s): eb03849

Update README.md

Files changed (1) hide show

README.md +15 -1

README.md CHANGED Viewed

@@ -1,3 +1,17 @@
 # QLoRA Instruction Tuning on Pythia-1B
 This repository provides a **Hugging Face–compatible LoRA adapter** trained via **QLoRA (4-bit quantization + LoRA adapters)** on the **EleutherAI Pythia-1B-deduped** base model.
@@ -444,4 +458,4 @@ This adapter is intended for **research, experimentation, and non-production use
 ---
-This repository provides a **clean, minimal reference implementation** of QLoRA-based instruction tuning on a 1B-scale language model.

+---
+license: mit
+datasets:
+- databricks/databricks-dolly-15k
+language:
+- en
+base_model:
+- EleutherAI/pythia-1b-deduped
+pipeline_tag: text-generation
+tags:
+- QLORA
+- Instruction-Tuning
+- peft
+---
 # QLoRA Instruction Tuning on Pythia-1B
 This repository provides a **Hugging Face–compatible LoRA adapter** trained via **QLoRA (4-bit quantization + LoRA adapters)** on the **EleutherAI Pythia-1B-deduped** base model.
 ---
+This repository provides a **clean, minimal reference implementation** of QLoRA-based instruction tuning on a 1B-scale language model.