Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
thejaminator
/
grpo-feature-vector-step-100
like
0
Text Generation
PEFT
Safetensors
lora
Model card
Files
Files and versions
xet
Community
Use this model
main
grpo-feature-vector-step-100
/
README.md
thejaminator
verl GRPO trained model at step 100
08dd32a
verified
6 months ago
preview
code
|
raw
Copy download link
history
blame
contribute
delete
Safe
128 Bytes
---
base_model:
thejaminator/qwen-hook-layer-9-merged
library_name:
peft
tags:
-
lora
-
peft
pipeline_tag:
text-generation
---