·
AI & ML interests
None yet
Recent Activity
Organizations
None yet
Image-Text-to-Text
• 0.5B • Updated
• 1
Erland/nanoVLM_0116-175256
Updated
Erland/nanoVLM-momh-pretrain
Updated
Erland/nanoVLM-base-pretrain
Updated
Erland/kda-340M-4096-batch16-steps10000-20251229-225359
Updated
Erland/kda-340M-4096-batch16-steps10000-20251229-223742
Updated
Erland/gpt_oss_sink-340M-4096-batch16-steps10000-20251228-191448
Updated
Erland/gpt_oss_sink-340M-4096-batch16-steps10000.amd-20251228-192204
Updated
Erland/gpt_oss_sink-340M-4096-batch16-steps10000-20251228-190112
Updated
Erland/relusoftpick1-340M-4096-batch16-steps10000-20251225-190221
Updated
Erland/softmax_plus_one-340M-4096-batch16-steps100000-20251225-184810
Updated
Erland/relusoftpick1-340M-4096-batch16-steps10000-20251225-185550
Updated
Erland/softmax_plus_one-340M-4096-batch16-steps100000-20251225-180218
Updated
Erland/gated_attention-340M-4096-batch16-steps100000-20251223-203939
Updated
Erland/relusoftpick1-340M-4096-batch16-steps100000-20251222-134550
Updated
Erland/vanillaFT-gsm8k-math-1.8B-4096-model-5epochs
2B • Updated
Erland/vanillaFT-gsm8k-math-1.8B-4096-model
2B • Updated
Erland/GemmaFT-gsm8k-math-270M-4096-model
Text Generation
• 0.3B • Updated
Erland/vanillaFT-gsm8k-1.8B-4096-model
2B • Updated
Erland/vanillaFT-gsm8k-340M-4096-model
0.4B • Updated
• 3
Erland/softpick-120M-4096-model
0.1B • Updated
• 1
Erland/sst0.9-120M-4096-model
0.1B • Updated
Erland/sst0.7-120M-4096-model
0.1B • Updated
• 1
0.3B • Updated
• 5
Erland/sst0.3-120M-4096-model
0.1B • Updated
Erland/vanilla-120M-4096-model
0.1B • Updated
Erland/LlaMA-3.2-1B-Instruct
Text Generation
• 1B • Updated
• 9
Text Generation
• 0.3B • Updated
0.1B • Updated
Erland/mtp-120M-4096-batch16-steps100000-20250613-111312
Updated