Sizzing
/

aws-rl-grpo-qwen25coder3b-adapter

Text Generation

reinforcement-learning

Model card Files Files and versions

aws-rl-grpo-qwen25coder3b-adapter

45.4 MB

Ctrl+K

Ctrl+K

1 contributor

History: 8 commits

Sizzing's picture

GRPO run 2026-04-25T19:02+00:00

6f976d1 verified about 1 month ago

.gitattributes

1.57 kB
GRPO run 2026-04-24T06:06+00:00 (Trained with Unsloth) about 1 month ago
.hub_write_probe.txt

2 Bytes
probe: verify write scope about 1 month ago
README.md

1.75 kB
GRPO run 2026-04-25T19:02+00:00 about 1 month ago
adapter_config.json

1.18 kB
GRPO run 2026-04-25T19:02+00:00 about 1 month ago
adapter_model.safetensors

29.5 MB
xet

GRPO run 2026-04-25T19:02+00:00 about 1 month ago
added_tokens.json

632 Bytes
GRPO run 2026-04-24T06:06+00:00 (Trained with Unsloth) about 1 month ago
chat_template.jinja

2.51 kB
GRPO run 2026-04-24T06:06+00:00 (Trained with Unsloth) about 1 month ago
merges.txt

1.67 MB
GRPO run 2026-04-24T06:06+00:00 (Trained with Unsloth) about 1 month ago
special_tokens_map.json

613 Bytes
GRPO run 2026-04-24T06:06+00:00 (Trained with Unsloth) about 1 month ago
tokenizer.json

11.4 MB
xet

GRPO run 2026-04-24T06:06+00:00 (Trained with Unsloth) about 1 month ago
tokenizer_config.json

4.89 kB
GRPO run 2026-04-24T06:06+00:00 (Trained with Unsloth) about 1 month ago
vocab.json

2.78 MB
GRPO run 2026-04-24T06:06+00:00 (Trained with Unsloth) about 1 month ago