mzh12345/RLLaVA_counting_grpo_online_3b
Updated
•
6
An RL-central Framework for Language and Vision Assistant, which decouples algorithm logic from distributed execution, enables modular customization.