MDIIII's picture
Upload README.md with huggingface_hub
af21830 verified
|
Raw
History Blame Contribute Delete
1.35 kB
---
license: apache-2.0
base_model: openbmb/MiniCPM4-8B
tags:
- question-generation
- education
- lora
- paperprof
datasets:
- squad
- allenai/sciq
language:
- en
---
# MiniCPM4-8B-PaperProf
Fine-tuned from [openbmb/MiniCPM4-8B](https://huggingface.co/openbmb/MiniCPM4-8B) for
**exam-question generation** in [PaperProf](https://huggingface.co/spaces/build-small-hackathon/PaperProf),
an AI study buddy that turns course PDFs into interactive quiz sessions.
## Training
- **Method:** QLoRA (4-bit NF4, r=16, alpha=32, all-linear targets), merged to bf16
- **Data:** ~3500 multi-task pairs in PaperProf's three production formats:
open question generation (SQuAD), MCQ with distractors and per-option
explanations (SciQ), and structured answer evaluation (SQuAD-derived),
so the model is optimized for the exact tasks it serves.
- **Epochs:** 1, lr 2e-4 cosine, bf16 compute
## Usage
Drop-in replacement for the base model:
```python
from transformers import AutoTokenizer, AutoModelForCausalLM
tok = AutoTokenizer.from_pretrained("build-small-hackathon/MiniCPM4-8B-PaperProf", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("build-small-hackathon/MiniCPM4-8B-PaperProf", trust_remote_code=True, torch_dtype="bfloat16")
```
Built for the Build Small Hackathon, June 2026, by Team PaperProf (EPITA).