Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Arijit-07
/
aria-devops-llama3b

Reinforcement Learning
Safetensors
PyTorch
llama
openenv
devops
incident-response
rl-environment
multi-agent
llm-agent
grpo
curriculum-learning
huggingface
meta
Model card Files Files and versions
xet
Community
aria-devops-llama3b
6.54 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 15 commits
Arijit-07's picture
Arijit-07
Update README.md
e1b5cb0 verified 14 days ago
  • .gitattributes
    1.63 kB
    Upload training_curve.png with huggingface_hub 19 days ago
  • README.md
    24.9 kB
    Update README.md 14 days ago
  • adapter_config.json
    1.21 kB
    Upload adapter_config.json with huggingface_hub 17 days ago
  • adapter_model.safetensors
    97.3 MB
    xet
    Upload adapter_model.safetensors with huggingface_hub 17 days ago
  • chat_template.jinja
    3.83 kB
    (Trained with Unsloth) 19 days ago
  • config.json
    974 Bytes
    (Trained with Unsloth) 19 days ago
  • model-00001-of-00002.safetensors
    4.97 GB
    xet
    (Trained with Unsloth) 19 days ago
  • model-00002-of-00002.safetensors
    1.46 GB
    xet
    (Trained with Unsloth) 19 days ago
  • model.safetensors.index.json
    20.9 kB
    (Trained with Unsloth) 19 days ago
  • tokenizer.json
    17.2 MB
    xet
    (Trained with Unsloth) 19 days ago
  • tokenizer_config.json
    428 Bytes
    Upload tokenizer_config.json with huggingface_hub 17 days ago
  • training_curve.png
    67.5 kB
    xet
    Upload training_curve.png with huggingface_hub 17 days ago
  • training_log.json
    18.7 kB
    Upload training_log.json with huggingface_hub 19 days ago