mzh12345
/

RLLaVA_coding_grpo_3b

Model card Files Files and versions

README.md exists but content is empty.

Downloads last month: 1

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for mzh12345/RLLaVA_coding_grpo_3b

Base model

Qwen/Qwen2.5-VL-3B-Instruct

Finetuned

(794)

this model

Dataset used to train mzh12345/RLLaVA_coding_grpo_3b

Collection including mzh12345/RLLaVA_coding_grpo_3b

RLLaVA

An RL-central Framework for Language and Vision Assistant, which decouples algorithm logic from distributed execution, enables modular customization. • 4 items • Updated Nov 28, 2025