An RL-central Framework for Language and Vision Assistant, which decouples algorithm logic from distributed execution, enables modular customization.
ZiHao Ma
mzh12345
·
AI & ML interests
None yet
Recent Activity
liked
a dataset
1 day ago
UnipatAI/BabyVision
updated
a model
13 days ago
mzh12345/RLLaVA_search_grpo_3b
updated
a model
13 days ago
mzh12345/RLLaVA_coding_grpo_3b