Instructions to use whats2000/lora-llama-pathoqa-checkpoints with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use whats2000/lora-llama-pathoqa-checkpoints with PEFT:
Task type is invalid.
- Notebooks
- Google Colab
- Kaggle
LoRA-Llama PathoQA β all trained adapters (ablation study)
LoRA / DoRA adapters for meta-llama/Llama-3.2-1B-Instruct fine-tuned on
PathoQA (4-option medical pathology MCQA). This repo holds every adapter
from a 10-experiment ablation study (base model fixed at 1B; the study is about
method). Full lab journal and code: see the course submission package
(report.ipynb + experiments/NOTES.md).
Metric = Kaggle hw-1-question-answering test accuracy (public == private).
Best pipeline β 0.7988 (5-member option-likelihood ensemble)
Uniform-average the per-option probabilities of these 5 adapters (all rank 256, effective-batch 192):
| role | path |
|---|---|
| zero-shot r256 | experiments/exp06_zeroshot_recipe/e06c/saved_models |
| zero-shot r192 | experiments/exp07_eff192_ensemble/e07a/saved_models |
| few-shot 2-shot | experiments/exp08_fewshot_diversity/e08b/saved_models |
| few-shot 4-shot | experiments/exp07_eff192_ensemble/e07b/saved_models |
| few-shot 8-shot | experiments/exp08_fewshot_diversity/e08a/saved_models |
Progression: baseline 0.7700 β rank=256 0.7766 β +ensemble 0.7811 β +few-shot 0.7888 β @eff192 0.7922 β +multi-shot 0.7988.
Load an adapter
import torch
from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel
from huggingface_hub import snapshot_download
base_id = "meta-llama/Llama-3.2-1B-Instruct"
local = snapshot_download("whats2000/lora-llama-pathoqa-checkpoints",
allow_patterns="experiments/exp06_zeroshot_recipe/e06c/saved_models/*")
adapter = f"{local}/experiments/exp06_zeroshot_recipe/e06c/saved_models"
tok = AutoTokenizer.from_pretrained(base_id)
base = AutoModelForCausalLM.from_pretrained(base_id, torch_dtype=torch.bfloat16)
model = PeftModel.from_pretrained(base, adapter).eval()
All adapters (organised by experiment)
exp01_lora_rank/e01a..eβ LoRA rank sweep rβ{8,16,32,64,128} (zero-shot)exp02_rank_scaling/e02a..eβ rank push rβ{128,192,256,384,512}exp04_strategy_diverse/e04aβ few-shot @ eff768exp05_dora_variant/e05a..cβ DoRA rβ{128,192,256}exp06_zeroshot_recipe/e06a..fβ LR / effective-batch / loss recipeexp07_eff192_ensemble/e07a,bβ zero/few-shot retrained @ eff-batch 192exp08_fewshot_diversity/e08a,bβ few-shot 8-shot / 2-shot @ eff192exp09_more_fewshot/e09a..cβ few-shot 1/3/6-shot @ eff192
Each dir is a standard PEFT adapter (adapter_config.json +
adapter_model.safetensors) plus its training_history.json and loss/accuracy
curves.
- Downloads last month
- -
Model tree for whats2000/lora-llama-pathoqa-checkpoints
Base model
meta-llama/Llama-3.2-1B-Instruct