CreitinGameplays's picture
Update README.md
e74f6ce verified
|
raw
history blame
834 Bytes
metadata
base_model: CreitinGameplays/Mistral-Nemo-12B-R1-v0.4
tags:
  - text-generation-inference
  - transformers
  - unsloth
  - mistral
license: apache-2.0
language:
  - en
datasets:
  - allura-forge/doubao-seed2.0-claude-distill-v1-qwen3.5-format

Uploaded finetuned model

  • Developed by: CreitinGameplays
  • License: apache-2.0
  • Finetuned from model : CreitinGameplays/Mistral-Nemo-12B-R1-v0.4

This mistral model was trained 2x faster with Unsloth and Huggingface's TRL library.

Recommended generation parameters:

  • Temperature: 0.6 - 0.9
  • Top_P = 0.95
  • Top_K = 40
  • Repeat Penalty = 1.0 (default)
  • Do Sample = true