Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

santos-sanz
/
vending-machine-rl-model

Reinforcement Learning
Transformers
Safetensors
Generated from Trainer
grpo
trl
vending-machine
Model card Files Files and versions
xet
Community
vending-machine-rl-model
36.1 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 16 commits
santos-sanz's picture
santos-sanz
Move readme to qwen3-0.6 model
fdb67da verified 4 months ago
  • .gitattributes
    1.57 kB
    Upload tokenizer for RL-trained vending machine model 4 months ago
  • README.md
    4.74 kB
    Move readme to qwen3-0.6 model 4 months ago
  • adapter_config.json
    1.04 kB
    Upload model 4 months ago
  • adapter_model.safetensors
    20.2 MB
    xet
    Upload model 4 months ago
  • added_tokens.json
    707 Bytes
    Upload tokenizer 4 months ago
  • chat_template.jinja
    4.17 kB
    Upload tokenizer 4 months ago
  • merges.txt
    1.67 MB
    Upload tokenizer for RL-trained vending machine model 4 months ago
  • special_tokens_map.json
    613 Bytes
    Upload tokenizer 4 months ago
  • tokenizer.json
    11.4 MB
    xet
    Upload tokenizer 4 months ago
  • tokenizer_config.json
    5.4 kB
    Upload tokenizer 4 months ago
  • vocab.json
    2.78 MB
    Upload tokenizer for RL-trained vending machine model 4 months ago