AI & ML interests
None yet
Organizations
None yet
nmcco/qwen-2.5-3b-speakertokens
Text Generation
• 3B • Updated nmcco/llama-3.2-3b-speakertokens
Text Generation
• 3B • Updated • 1
nmcco/13-oops-hp-didnt-have-enough-context
Updated
nmcco/13-pnp-hpeval-with-gemma3-4b
Updated
nmcco/gemma-3-4b-pt-with-speaker-tokens
Image-Text-to-Text
• 4B • Updated • 1
nmcco/gemma-3-1b-pt-with-speaker-tokens
Text Generation
• 1.0B • Updated nmcco/12-unbalanced-10epochs
Updated
nmcco/11-just-fixedpnp-10epochs
Updated
nmcco/10-2books-fixedpnp-10epochs
Updated
nmcco/09-training-on-bigger-pnp
Updated
nmcco/08-training-on-bigger-pnp
Updated
nmcco/07-pandp-only-testset-moreepochsand1024
Updated
nmcco/06-pandp-only-testset
Updated
nmcco/03-p-and-p-nospeakertoken
Updated
nmcco/gemma-2-2b-with-speaker-tokens-nospeaker-tok
Text Generation
• 3B • Updated nmcco/02-gemma-extra-speaker-prompt
Updated
nmcco/01-gemma-first-long-run
Updated
nmcco/18_gemma_eager_variablelength_20k10epochs_lr2e-4
Updated
nmcco/gemma-2-2b-with-speaker-tokens
Text Generation
• 3B • Updated nmcco/14_gemma_flashattn2_deepspeed_250words_lr-2e-4_100k_5epoch
Updated
nmcco/14_gemma_deepspeed_250words_lr-2e-4_100k_5epoch
Updated
nmcco/13_REAL_gemma_deepspeed_250words_lr-2e-4_100k_5epoch
Updated
nmcco/12_gemma_deepspeed_250words_lr-2e-4
Updated
nmcco/10-gemma-deepspeed-250words-1
Updated