zhipeng ma
mamazi00
AI & ML interests
None yet
Recent Activity
new activity about 2 months ago
haydn-jones/gemma-4-E4B-it-RL:Issue with Reinforcement Learning Fine-tuning on Gemma4 new activity about 2 months ago
google/gemma-4-31B-it:When training models from the gemma4 series using GRPO, an abnormally high grad norm was observed upvoted a paper 8 months ago
Bee: A High-Quality Corpus and Full-Stack Suite to Unlock Advanced Fully
Open MLLMsOrganizations
None yet