nm

nmcco

2

AI & ML interests

None yet

Organizations

None yet

nmcco 's models 126

nmcco/qwen-2.5-3b-speakertokens

Text Generation • 3B • Updated Mar 17, 2025 • 4

nmcco/llama-3.2-3b-speakertokens

Text Generation • 3B • Updated Mar 17, 2025 • 4

nmcco/13-oops-hp-didnt-have-enough-context

Updated Mar 17, 2025

nmcco/13-pnp-hpeval-with-gemma3-4b

Updated Mar 16, 2025

nmcco/gemma-3-4b-pt-with-speaker-tokens

Image-Text-to-Text • 4B • Updated Mar 16, 2025 • 2

nmcco/gemma-3-1b-pt-with-speaker-tokens

Text Generation • 1.0B • Updated Mar 16, 2025 • 6

nmcco/12-unbalanced-10epochs

Updated Mar 14, 2025

nmcco/11-just-fixedpnp-10epochs

Updated Mar 13, 2025

nmcco/10-2books-fixedpnp-10epochs

Updated Mar 13, 2025

nmcco/09-training-on-bigger-pnp

Updated Mar 12, 2025

nmcco/08-training-on-bigger-pnp

Updated Mar 12, 2025

nmcco/07-pandp-only-testset-moreepochsand1024

Updated Mar 12, 2025

nmcco/06-pandp-only-testset

Updated Mar 12, 2025

nmcco/05-2books-testset

Updated Mar 11, 2025

nmcco/04-2books-03

Updated Mar 11, 2025

nmcco/04-2books

Updated Mar 11, 2025

nmcco/03-p-and-p-nospeakertoken

Updated Feb 26, 2025

nmcco/gemma-2-2b-with-speaker-tokens-nospeaker-tok

Text Generation • 3B • Updated Feb 24, 2025 • 3

nmcco/02-gemma-extra-speaker-prompt

Updated Feb 20, 2025

nmcco/01-gemma-first-long-run

Updated Feb 19, 2025

nmcco/18_gemma_eager_variablelength_20k10epochs_lr2e-4

Updated Feb 19, 2025

nmcco/gemma-2-2b-with-speaker-tokens

Text Generation • 3B • Updated Feb 18, 2025 • 16 •

nmcco/17_gemma_eager

Updated Feb 14, 2025

nmcco/16_gemma_flash

Updated Feb 14, 2025

nmcco/15_flashattn_gemma

Updated Feb 14, 2025

nmcco/14_gemma_flashattn2_deepspeed_250words_lr-2e-4_100k_5epoch

Updated Feb 13, 2025

nmcco/14_gemma_deepspeed_250words_lr-2e-4_100k_5epoch

Updated Feb 13, 2025

nmcco/13_REAL_gemma_deepspeed_250words_lr-2e-4_100k_5epoch

Updated Feb 12, 2025

nmcco/12_gemma_deepspeed_250words_lr-2e-4

Updated Feb 11, 2025

nmcco/10-gemma-deepspeed-250words-1

Updated Feb 10, 2025