Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Arijit-07
/

aria-devops-llama3b

Reinforcement Learning

incident-response

curriculum-learning

Model card Files Files and versions

aria-devops-llama3b

6.54 GB

Ctrl+K

Ctrl+K

1 contributor

History: 15 commits

Arijit-07's picture

Update README.md

e1b5cb0 verified 2 months ago

.gitattributes

1.63 kB
Upload training_curve.png with huggingface_hub 2 months ago
README.md

24.9 kB
Update README.md 2 months ago
adapter_config.json

1.21 kB
Upload adapter_config.json with huggingface_hub 2 months ago
adapter_model.safetensors

97.3 MB
xet

Upload adapter_model.safetensors with huggingface_hub 2 months ago
chat_template.jinja

3.83 kB
(Trained with Unsloth) 2 months ago
config.json

974 Bytes
(Trained with Unsloth) 2 months ago
model-00001-of-00002.safetensors

4.97 GB
xet

(Trained with Unsloth) 2 months ago
model-00002-of-00002.safetensors

1.46 GB
xet

(Trained with Unsloth) 2 months ago
model.safetensors.index.json

20.9 kB
(Trained with Unsloth) 2 months ago
tokenizer.json

17.2 MB
xet

(Trained with Unsloth) 2 months ago
tokenizer_config.json

428 Bytes
Upload tokenizer_config.json with huggingface_hub 2 months ago
training_curve.png

67.5 kB
xet

Upload training_curve.png with huggingface_hub 2 months ago
training_log.json

18.7 kB
Upload training_log.json with huggingface_hub 2 months ago