Update README.md

e74f6ce verified about 1 month ago

834 Bytes

base_model: CreitinGameplays/Mistral-Nemo-12B-R1-v0.4
tags:
  - text-generation-inference
  - transformers
  - unsloth
  - mistral
license: apache-2.0
language:
  - en
datasets:
  - allura-forge/doubao-seed2.0-claude-distill-v1-qwen3.5-format

Uploaded finetuned model

Developed by: CreitinGameplays
License: apache-2.0
Finetuned from model : CreitinGameplays/Mistral-Nemo-12B-R1-v0.4

This mistral model was trained 2x faster with Unsloth and Huggingface's TRL library.

Recommended generation parameters:

Temperature: 0.6 - 0.9
Top_P = 0.95
Top_K = 40
Repeat Penalty = 1.0 (default)
Do Sample = true