Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
newmindai
/
QwQ-32B-r1
like
1
Follow
NewMind AI
63
Text Generation
Transformers
Safetensors
newmindai/simplescaling
Turkish
llama
reinforcement-learning
rl-post-training
reward-model
dapo
simplescaling
orm
huggingface
custom-reward
trl
llm
adapter
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
QwQ-32B-r1
269 MB
1 contributor
History:
40 commits
zgrgr
Update README.md
59848a4
verified
6 months ago
.gitattributes
1.52 kB
initial commit
10 months ago
README.md
3.45 kB
Update README.md
6 months ago
adapter_config.json
865 Bytes
Upload adapter_config.json to model checkpoint
10 months ago
adapter_model.safetensors
269 MB
xet
Upload adapter_model.safetensors to model checkpoint
10 months ago
additional_config.json
67 Bytes
Upload additional_config.json to model checkpoint
10 months ago
args.json
21.6 kB
Upload args.json to model checkpoint
10 months ago
config.json
376 Bytes
Create config.json
6 months ago
latest
14 Bytes
Upload latest to model checkpoint
10 months ago
rng_state_0.pth
15.4 kB
xet
Upload rng_state_0.pth to model checkpoint
10 months ago
rng_state_1.pth
15.5 kB
xet
Upload rng_state_1.pth to model checkpoint
10 months ago
rng_state_2.pth
15.5 kB
xet
Upload rng_state_2.pth to model checkpoint
10 months ago
rng_state_3.pth
15.5 kB
xet
Upload rng_state_3.pth to model checkpoint
10 months ago
scheduler.pt
1.06 kB
xet
Upload scheduler.pt to model checkpoint
10 months ago
trainer_state.json
27.8 kB
Upload trainer_state.json to model checkpoint
10 months ago
training_args.bin
9.4 kB
xet
Upload training_args.bin to model checkpoint
10 months ago
zero_to_fp32.py
25.3 kB
Upload zero_to_fp32.py to model checkpoint
10 months ago