Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
DatPySci
/
RLVR-SGDM-Gap
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
d29b9f4
RLVR-SGDM-Gap
1.52 TB
Ctrl+K
Ctrl+K
1 contributor
History:
43 commits
DatPySci
Add files using upload-large-folder tool
d29b9f4
verified
about 2 months ago
Llama-3.2-3B-Instruct-polaris-GRPO--bsz128
Add files using upload-large-folder tool
about 2 months ago
Llama-3.2-3B-Instruct-polaris-GRPO--bsz16
Add files using upload-large-folder tool
about 2 months ago
Llama-3.2-3B-Instruct-polaris-GRPO--bsz256
Add files using upload-large-folder tool
about 2 months ago
Llama-3.2-3B-Instruct-polaris-GRPO--bsz32
Add files using upload-large-folder tool
about 2 months ago
Llama-3.2-3B-Instruct-polaris-GRPO--bsz512
Add files using upload-large-folder tool
about 2 months ago
Llama-3.2-3B-Instruct-polaris-GRPO--bsz64
Add files using upload-large-folder tool
about 2 months ago
Qwen2.5-3B-Instruct-polaris-AdamW-GRPO
Upload folder using huggingface_hub
2 months ago
.gitattributes
11 kB
Add files using upload-large-folder tool
about 2 months ago