Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Arijit-07
/
aria-devops-llama3b
like
0
Reinforcement Learning
Safetensors
PyTorch
llama
openenv
devops
incident-response
rl-environment
multi-agent
llm-agent
grpo
curriculum-learning
huggingface
meta
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
aria-devops-llama3b
6.54 GB
Ctrl+K
Ctrl+K
1 contributor
History:
15 commits
Arijit-07
Update README.md
e1b5cb0
verified
14 days ago
.gitattributes
Safe
1.63 kB
Upload training_curve.png with huggingface_hub
19 days ago
README.md
Safe
24.9 kB
Update README.md
14 days ago
adapter_config.json
Safe
1.21 kB
Upload adapter_config.json with huggingface_hub
17 days ago
adapter_model.safetensors
Safe
97.3 MB
xet
Upload adapter_model.safetensors with huggingface_hub
17 days ago
chat_template.jinja
Safe
3.83 kB
(Trained with Unsloth)
19 days ago
config.json
Safe
974 Bytes
(Trained with Unsloth)
19 days ago
model-00001-of-00002.safetensors
Safe
4.97 GB
xet
(Trained with Unsloth)
19 days ago
model-00002-of-00002.safetensors
Safe
1.46 GB
xet
(Trained with Unsloth)
19 days ago
model.safetensors.index.json
Safe
20.9 kB
(Trained with Unsloth)
19 days ago
tokenizer.json
Safe
17.2 MB
xet
(Trained with Unsloth)
19 days ago
tokenizer_config.json
Safe
428 Bytes
Upload tokenizer_config.json with huggingface_hub
17 days ago
training_curve.png
Safe
67.5 kB
xet
Upload training_curve.png with huggingface_hub
17 days ago
training_log.json
Safe
18.7 kB
Upload training_log.json with huggingface_hub
19 days ago