raniero's picture
Upload README.md with huggingface_hub
8d10f11 verified

Modello addestrato: submission_dpo_from_validator

  • Data: 2025-07-09T21:58:08.998726
  • Epochs: 2
  • Learning Rate: 1e-4
  • LoRA config: r=64, q=128

SHA256 del modello (model.safetensors): 238bf8909c1d373d93571890b67438ec1130ea05edd2f0a3556feba865061107