Spaces:

OliverSlivka
/

testrun2

Paused

App Files Files Community

slivk commited on about 1 month ago

Commit

641aaa0

1 Parent(s): cbaf615

docs: Update README to reflect T4 GPU (not Zero GPU)

Browse files

Files changed (1) hide show

README.md +25 -18

README.md CHANGED Viewed

@@ -8,12 +8,12 @@ sdk_version: "5.13.0"
 app_file: app.py
 pinned: false
 license: mit
-hardware: zero-a10g
 ---
 # Qwen2.5 Fine-Tuning for Itemset Extraction
-This Space fine-tunes Qwen2.5-0.5B-Instruct on the [itemset-extraction-v2](https://huggingface.co/datasets/OliverSlivka/itemset-extraction-v2) dataset.
 ## What it does
@@ -21,7 +21,7 @@ Trains a language model to extract frequent itemsets from transaction data using
 - **Dataset**: 488 training examples with real-world column names
 - **Model**: Qwen2.5-3B-Instruct (high quality results)
 - **Method**: Supervised Fine-Tuning (SFT) with 4-bit LoRA
-- **Hardware**: Zero GPU A10G (free GPU access)
 ## How to use
@@ -38,27 +38,34 @@ Trains a language model to extract frequent itemsets from transaction data using
 - **Batch size**: 2 (effective 16 with gradient accumulation)
 - **Duration**: ~10-15 minutes
 - **Output**: `OliverSlivka/qwen2.5-3b-itemset-test`
-Notes
-This Space supports two training modes:
-- **Test Mode**: Quick validation with 50 examples (~10-15 min)
-  - Verifies setup works on Zero GPU
-  - Pushes to test repo for inspection
-- **Full Mode**: Production training with 439 examples, 3 epochs (~40-60 min)
-  - Target: 80-90% valid JSON (vs 6.7% from 0.5B baseline)
-  - Final model for real-world use
-Both modes use **Qwen2.5-3B with 4-bit quantization** - fits perfectly in Zero GPU's 16GB memory!
 ## Notes
-This is a **test run** with 50 training examples to verify the setup works with Zero GPU.
-For production training:
-- Use full 439-example training set
-- Train for 2-3 epochs (~200 steps)
-- Consider using Qwen2.5-3B or 7B for better results (requires paid GPU)
 ## Dataset

 app_file: app.py
 pinned: false
 license: mit
+hardware: t4-small
 ---
 # Qwen2.5 Fine-Tuning for Itemset Extraction
+Fine-tune Qwen2.5-3B on the [itemset-extraction-v2](https://huggingface.co/datasets/OliverSlivka/itemset-extraction-v2) dataset.
 ## What it does
 - **Dataset**: 488 training examples with real-world column names
 - **Model**: Qwen2.5-3B-Instruct (high quality results)
 - **Method**: Supervised Fine-Tuning (SFT) with 4-bit LoRA
+- **Hardware**: NVIDIA T4 Small (paid GPU, 16GB VRAM)
 ## How to use
 - **Batch size**: 2 (effective 16 with gradient accumulation)
 - **Duration**: ~10-15 minutes
 - **Output**: `OliverSlivka/qwen2.5-3b-itemset-test`
+## Training Modes
+### Test Mode (50 examples)
+- **Duration**: ~10-15 minutes
+- **Output**: `OliverSlivka/qwen2.5-3b-itemset-test`
+- **Purpose**: Quick validation before full training
+### Full Mode (439 examples, 3 epochs)
+- **Duration**: ~40-60 minutes
+- **Output**: `OliverSlivka/qwen2.5-3b-itemset-extractor`
+- **Target**: 80-90% valid JSON (vs 6.7% from 0.5B baseline)
+- **Cost**: ~$0.60 on T4 Small
+**Technical Details:**
+- LoRA rank 16, alpha 32
+- Batch size 2, gradient accumulation 8 (effective batch 16)
+- 4-bit quantization (QLoRA) - efficient training, proven results
+- FP16 precision (T4 compatible)
 ## Notes
+Both modes use **4-bit quantization** for:
+- ✅ Faster training (lower memory = faster iteration)
+- ✅ Lower cost (~30% faster = ~30% cheaper)
+- ✅ Proven effective for LoRA fine-tuning
+- ✅ No quality loss vs full precision LoRA
+Paid T4 GPU ($0.60/hour) provides consistent performance without time limits.
 ## Dataset