AI & ML interests
None yet
Organizations
None yet
nmcco/23-truly-classic-2e4-just5epochs
Updated
nmcco/32-gemma3-packing-2e5-15epochs
Updated
nmcco/31-gemma3-packing-2e4-10epochs
Updated
nmcco/30-gemma3-packing-3e4-3epochs
Updated
nmcco/31-classic-just-3-epochs-3e5
Updated
nmcco/31-classic-idgi-why-doesnt-it-work-at-fewer-epochs-1e4
Updated
nmcco/30-gemma3-packing-3e4-10epochs
Updated
nmcco/29-classic-again-save-this-1e4
Updated
nmcco/28-gemma3-packing-1e4-20epochs
Updated
nmcco/28-classic-just-lowerLR
Updated
nmcco/27-gemma3-packing-1e4-15epochs
Updated
nmcco/24-gemma3-PADRIGHT-alsonoquant-alsonopacking-1e-4-bf16
Updated
nmcco/25-classic-2e-5-lr-but50epochslol
Updated
nmcco/24-gemma3-PADRIGHT-alsonoquant-alsonopacking-1e-4-flash-bf16
Updated
nmcco/24-gemma3-alsonoquant-alsonopacking-1e-4-flash-bf16
Updated
nmcco/gemma-3-4b-with-speaker-tokens
Image-Text-to-Text
• 4B • Updated • 1
nmcco/classic-verify-things-work
Updated
nmcco/22-classic-but-lower-lr-and-40-length
Updated
nmcco/21-classic-but-much-lower-lr
Updated
nmcco/19-3books-classic-halfLR
Updated
nmcco/20-qwen-1gpu-tenepochs
Updated
nmcco/gemma-2-27b-speakertokens
Text Generation
• 27B • Updated nmcco/16-llama3.2-3b-flashattn2-batch8-balanced_vs_hp
Updated
nmcco/14-llama3.2-3b-balanced_vs_hp
Updated
nmcco/15-qwen2.5-3b-balanced_vs_hp
Updated