🔄 In a Training Loop

4 15 47

David Andrews PRO

Broyojo

https://broyojo.com

AI & ML interests

Tranformer models, diffusion models, reinforcement learning, AI accelerators, computer architecture, VSLI

Recent Activity

liked a model about 14 hours ago

futo-org/futo-swipe

liked a model about 1 month ago

Qwen/Qwen3.6-27B

updated a model about 2 months ago

HumorR1/policy-e3-dpo-no-thinking

View all activity

Organizations

liked a model about 14 hours ago

futo-org/futo-swipe

Updated 13 days ago • 115 • 23

liked a model about 1 month ago

Qwen/Qwen3.6-27B

Image-Text-to-Text • 28B • Updated Apr 24 • 5.81M • • 1.8k

updated a model about 2 months ago

HumorR1/policy-e3-dpo-no-thinking

Updated May 1 • 4

published a model about 2 months ago

HumorR1/policy-e3-dpo-no-thinking

Updated May 1 • 4

updated 2 models about 2 months ago

HumorR1/policy-e2b-grpo-thinking

Updated May 1 • 1

HumorR1/policy-e2a-grpo-no-thinking

Updated May 1 • 1

published 2 models about 2 months ago

HumorR1/policy-e2b-grpo-thinking

Updated May 1 • 1

HumorR1/policy-e2a-grpo-no-thinking

Updated May 1 • 1

updated 2 models about 2 months ago

HumorR1/policy-e1a-sft-no-thinking

Updated May 1 • 4

HumorR1/policy-e1b-sft-thinking

Image-Text-to-Text • 2B • Updated May 1 • 4

published 2 models about 2 months ago

HumorR1/policy-e1b-sft-thinking

Image-Text-to-Text • 2B • Updated May 1 • 4

HumorR1/policy-e1a-sft-no-thinking

Updated May 1 • 4

updated a model about 2 months ago

HumorR1/rm-qwen25vl-3b-nodesc

Updated Apr 30

published a model about 2 months ago

HumorR1/rm-qwen25vl-3b-nodesc

Updated Apr 30

updated a model about 2 months ago

HumorR1/policy-qwen3vl-2b-grpo-newyorker

Updated Apr 30 • 3

published a model about 2 months ago

HumorR1/policy-qwen3vl-2b-grpo-newyorker

Updated Apr 30 • 3

updated a model about 2 months ago

HumorR1/rm-qwen25vl-3b-20k

Updated Apr 30

published a model about 2 months ago

HumorR1/rm-qwen25vl-3b-20k

Updated Apr 30

liked a dataset about 2 months ago

yguooo/newyorker_caption_ranking

Viewer • Updated Sep 15, 2024 • 2.18M • 181 • 6

upvoted a paper 2 months ago

TREX: Automating LLM Fine-tuning via Agent-Driven Tree-based Exploration

Paper • 2604.14116 • Published Apr 15 • 13

David Andrews PRO

AI & ML interests

Recent Activity

Organizations

Broyojo's activity