Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Arijit-07
/
aria-devops-llama3b
like
0
Reinforcement Learning
Safetensors
PyTorch
llama
openenv
devops
incident-response
rl-environment
multi-agent
llm-agent
grpo
curriculum-learning
huggingface
meta
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
aria-devops-llama3b
6.54 GB
Ctrl+K
Ctrl+K
1 contributor
History:
15 commits
Arijit-07
Update README.md
e1b5cb0
verified
2 months ago
.gitattributes
Safe
1.63 kB
Upload training_curve.png with huggingface_hub
2 months ago
README.md
Safe
24.9 kB
Update README.md
2 months ago
adapter_config.json
Safe
1.21 kB
Upload adapter_config.json with huggingface_hub
2 months ago
adapter_model.safetensors
97.3 MB
xet
Upload adapter_model.safetensors with huggingface_hub
2 months ago
chat_template.jinja
Safe
3.83 kB
(Trained with Unsloth)
2 months ago
config.json
Safe
974 Bytes
(Trained with Unsloth)
2 months ago
model-00001-of-00002.safetensors
4.97 GB
xet
(Trained with Unsloth)
2 months ago
model-00002-of-00002.safetensors
1.46 GB
xet
(Trained with Unsloth)
2 months ago
model.safetensors.index.json
Safe
20.9 kB
(Trained with Unsloth)
2 months ago
tokenizer.json
Safe
17.2 MB
xet
(Trained with Unsloth)
2 months ago
tokenizer_config.json
Safe
428 Bytes
Upload tokenizer_config.json with huggingface_hub
2 months ago
training_curve.png
67.5 kB
xet
Upload training_curve.png with huggingface_hub
2 months ago
training_log.json
Safe
18.7 kB
Upload training_log.json with huggingface_hub
2 months ago