RESMP-DEV
/

Accessible_Qwen_4B

Model card Files Files and versions

Accessible_Qwen_4B / README.md

Kearm's picture

Update README.md

a4787ce verified 8 months ago

|

history blame contribute delete

465 Bytes

	---
	license: apache-2.0
	base_model:
	- Qwen/Qwen3-4B
	datasets:
	- Kearm/Acc_Qwen_4B_Dataset
	---

	# Model Card for Acc Qwen 4B

	<!-- Provide a quick summary of what the model is/does. -->

	Acc Qwen 4B is a state of the art accessibility GRPO RL trained model with RM_R1 style Chain of Rubric distsillation of Claude 4 Opus using Gemini 2.5 Flash to Qwen 3 4B over 18 million tokens.

	The code for training the model is at https://github.com/Nottlespike/Accessible_Qwen