Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Miaow-Lab
/
RLVR-Linearity-Checkpoints
like
0
Follow
Miaow Lab @ CityUHK
4
Text Generation
Safetensors
Miaow-Lab/RLVR-Linearity-Dataset
arxiv:
2601.04537
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
7c1c4f9
RLVR-Linearity-Checkpoints
Commit History
Delete distill-qwen-1-5b_grpo/tokenizer_config.json
7c1c4f9
verified
louiswng
commited on
Jan 26
Delete distill-qwen-1-5b_grpo/tokenizer.json
9359a48
verified
louiswng
commited on
Jan 26
Delete distill-qwen-1-5b_grpo/special_tokens_map.json
5756970
verified
louiswng
commited on
Jan 26
Delete distill-qwen-1-5b_grpo/model.safetensors
a8af3ba
verified
louiswng
commited on
Jan 26
Delete distill-qwen-1-5b_grpo/generation_config.json
af8d07c
verified
louiswng
commited on
Jan 26
Delete distill-qwen-1-5b_grpo/config.json
f0199ad
verified
louiswng
commited on
Jan 26
Delete distill-qwen-1-5b_grpo/.DS_Store
a22e199
verified
louiswng
commited on
Jan 26
Upload folder using huggingface_hub
b521eed
verified
louiswng
commited on
Jan 26
Upload folder using huggingface_hub
f21695e
verified
louiswng
commited on
Jan 26
Upload folder using huggingface_hub
5e84a76
verified
louiswng
commited on
Jan 26
Upload folder using huggingface_hub
1098195
verified
louiswng
commited on
Jan 26
initial commit
9ceb096
verified
louiswng
commited on
Jan 26