Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

newmindai
/
QwQ-32B-r1

Text Generation
Transformers
Safetensors
Turkish
llama
reinforcement-learning
rl-post-training
reward-model
dapo
simplescaling
orm
huggingface
custom-reward
trl
llm
adapter
text-generation-inference
Model card Files Files and versions
xet
Community
QwQ-32B-r1
269 MB
  • 1 contributor
History: 40 commits
zgrgr's picture
zgrgr
Update README.md
59848a4 verified 6 months ago
  • .gitattributes
    1.52 kB
    initial commit 10 months ago
  • README.md
    3.45 kB
    Update README.md 6 months ago
  • adapter_config.json
    865 Bytes
    Upload adapter_config.json to model checkpoint 10 months ago
  • adapter_model.safetensors
    269 MB
    xet
    Upload adapter_model.safetensors to model checkpoint 10 months ago
  • additional_config.json
    67 Bytes
    Upload additional_config.json to model checkpoint 10 months ago
  • args.json
    21.6 kB
    Upload args.json to model checkpoint 10 months ago
  • config.json
    376 Bytes
    Create config.json 6 months ago
  • latest
    14 Bytes
    Upload latest to model checkpoint 10 months ago
  • rng_state_0.pth
    15.4 kB
    xet
    Upload rng_state_0.pth to model checkpoint 10 months ago
  • rng_state_1.pth
    15.5 kB
    xet
    Upload rng_state_1.pth to model checkpoint 10 months ago
  • rng_state_2.pth
    15.5 kB
    xet
    Upload rng_state_2.pth to model checkpoint 10 months ago
  • rng_state_3.pth
    15.5 kB
    xet
    Upload rng_state_3.pth to model checkpoint 10 months ago
  • scheduler.pt
    1.06 kB
    xet
    Upload scheduler.pt to model checkpoint 10 months ago
  • trainer_state.json
    27.8 kB
    Upload trainer_state.json to model checkpoint 10 months ago
  • training_args.bin
    9.4 kB
    xet
    Upload training_args.bin to model checkpoint 10 months ago
  • zero_to_fp32.py
    25.3 kB
    Upload zero_to_fp32.py to model checkpoint 10 months ago