DidulaThavishaPro
/

exp_10_4_grpo_smooth_error_16bit_vllm

Text Generation

text-generation-inference

Model card Files Files and versions

exp_10_4_grpo_smooth_error_16bit_vllm

Commit History

(Trained with Unsloth)

cefc777
verified

DidulaThavishaPro commited on Oct 21, 2025

(Trained with Unsloth)

5248ee4
verified

DidulaThavishaPro commited on Oct 21, 2025

(Trained with Unsloth)

7a1488e
verified

DidulaThavishaPro commited on Oct 21, 2025

(Trained with Unsloth)

2ad4165
verified

DidulaThavishaPro commited on Oct 21, 2025

(Trained with Unsloth)

02fb05a
verified

DidulaThavishaPro commited on Oct 21, 2025

(Trained with Unsloth)

19132cb
verified

DidulaThavishaPro commited on Oct 21, 2025

Unsloth Model Card

da49ae5
verified

DidulaThavishaPro commited on Oct 21, 2025

initial commit

fa9cdae
verified

DidulaThavishaPro commited on Oct 21, 2025