Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

morizon
/

llm-jp-3-13b-instruct2-grpo-R1-0223_lora_step1600

text-generation-inference

Model card Files Files and versions

llm-jp-3-13b-instruct2-grpo-R1-0223_lora_step1600

1.01 GB

Ctrl+K

Ctrl+K

1 contributor

History: 4 commits

morizon's picture

Trained with Unsloth

17a3593 verified about 1 year ago

.gitattributes

1.52 kB
initial commit about 1 year ago
README.md

578 Bytes
Upload README.md with huggingface_hub about 1 year ago
adapter_config.json

800 Bytes
Trained with Unsloth about 1 year ago
adapter_model.safetensors

1 GB
xet

Trained with Unsloth about 1 year ago
special_tokens_map.json

874 Bytes
Trained with Unsloth about 1 year ago
tokenizer.json

6.41 MB
Trained with Unsloth about 1 year ago
tokenizer_config.json

2.39 kB
Trained with Unsloth about 1 year ago