Luminous-Designs
/

Qwen3.5-9B-Base-Humane-Interiority

Image-Text-to-Text

Model card Files Files and versions

Qwen3.5-9B-Base-Humane-Interiority / README.md

Lambent's picture

Update README.md

f0540b6 verified about 2 months ago

|

history blame contribute delete

880 Bytes

	---
	license: apache-2.0
	datasets:
	- Lambent/schwartz-value-dpo
	base_model:
	- Lambent/Qwen3.5-9B-Base-Thoughtful-Interiority
	pipeline_tag: image-text-to-text
	---

	A version of the base model lightly steered towards humane values.

	Methodology:

	Generated steering vectors for Lambent/Qwen3.5-9B-Base-Thoughtful-Interiority based on system prompts adapted from Schwartz portrait values.

	Relevant vectors for this model had the positive direction pointed at Benevolence and Universalism; and negative direction pointed at Achievement and Power.

	Asked GLM-5 to create scenarios that would test values against each other on these axes.

	Created a DPO dataset of 100 chosen/rejected based on the model's answers to those scenarios under the vector.

	Trained on DPO for the following iterations at batch size 1 and LoRA rank 256:

	2e-7 for 4 epochs; 5e-6 for 1 epoch; 2e-7 for 4 epochs