Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
raniero
/
dpo_test_600
like
0
Safetensors
LoRA
bittensor
gradients
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Submission for task
dpo_test_600
Submission for task
dpo_test_600
Fine-tuned using LoRA on dynamic dataset.
Task ID:
test_dpo_456
Repo:
dpo_test_600
SHA256: 1e91a7172c06bf9f6ac56f9e611ae2d080f6e7572064640dc8c97ce7979652c0
Timestamp: 2025-08-01T00:20:02.514383
Downloads last month
-
Downloads are not tracked for this model.
How to track
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
Model tree for
raniero/dpo_test_600
Base model
meta-llama/Llama-2-7b-hf
Finetuned
(
1088
)
this model