thejaminator
/

grpo-feature-vector-step-100

Text Generation

Model card Files Files and versions

grpo-feature-vector-step-100 / README.md

thejaminator's picture

verl GRPO trained model at step 100

08dd32a verified 6 months ago

|

history blame contribute delete

128 Bytes

	---
	base_model: thejaminator/qwen-hook-layer-9-merged
	library_name: peft
	tags:
	- lora
	- peft
	pipeline_tag: text-generation
	---