MDIIII's picture
Upload README.md with huggingface_hub
af21830 verified
|
Raw
History Blame Contribute Delete
1.35 kB
metadata
license: apache-2.0
base_model: openbmb/MiniCPM4-8B
tags:
  - question-generation
  - education
  - lora
  - paperprof
datasets:
  - squad
  - allenai/sciq
language:
  - en

MiniCPM4-8B-PaperProf

Fine-tuned from openbmb/MiniCPM4-8B for exam-question generation in PaperProf, an AI study buddy that turns course PDFs into interactive quiz sessions.

Training

  • Method: QLoRA (4-bit NF4, r=16, alpha=32, all-linear targets), merged to bf16
  • Data: ~3500 multi-task pairs in PaperProf's three production formats: open question generation (SQuAD), MCQ with distractors and per-option explanations (SciQ), and structured answer evaluation (SQuAD-derived), so the model is optimized for the exact tasks it serves.
  • Epochs: 1, lr 2e-4 cosine, bf16 compute

Usage

Drop-in replacement for the base model:

from transformers import AutoTokenizer, AutoModelForCausalLM
tok = AutoTokenizer.from_pretrained("build-small-hackathon/MiniCPM4-8B-PaperProf", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("build-small-hackathon/MiniCPM4-8B-PaperProf", trust_remote_code=True, torch_dtype="bfloat16")

Built for the Build Small Hackathon, June 2026, by Team PaperProf (EPITA).