view post Post 4333 I am very sad to say that the budget in creating of SnowflakeCore-G1 1b and 7b MoE models ran out and I can't pre-train them anymore. See translation
view post Post 786 the training for SnowflakeCore-G1-1B and 7B would be retaken because now I implemented DeepSpeed and management to use two gpus. See translation
Liquid Claude Liquid Claude is a small series of LiquidAI/LFM2.5-1.2B-Thinking model that have been fine tuned on Claude chats/data. FlameF0X/LFM2.5-1.2B-Distilled-Claude-4.6 Text Generation • 1B • Updated 14 days ago • 1.2k • 2 FlameF0X/LFM2.5-1.2B-Distilled-Claude Text Generation • 1B • Updated 23 days ago • 2.04k • 1 FlameF0X/LFM2.5-1.2B-Distilled-Claude-4.6-GGUF 1B • Updated 14 days ago • 3.4k • 4 FlameF0X/LFM2.5-1.2B-Distilled-Claude-GGUF Text Generation • 1B • Updated 27 days ago • 1.38k
NanoSR FlameF0X/NanoSR-6x Image-to-Image • Updated 4 days ago • 2 FlameF0X/NanoSR-ResNet Image-to-Text • Updated 4 days ago FlameF0X/NanoSR Viewer • Updated 5 days ago • 1.6k • 68 • 1
Liquid Claude Liquid Claude is a small series of LiquidAI/LFM2.5-1.2B-Thinking model that have been fine tuned on Claude chats/data. FlameF0X/LFM2.5-1.2B-Distilled-Claude-4.6 Text Generation • 1B • Updated 14 days ago • 1.2k • 2 FlameF0X/LFM2.5-1.2B-Distilled-Claude Text Generation • 1B • Updated 23 days ago • 2.04k • 1 FlameF0X/LFM2.5-1.2B-Distilled-Claude-4.6-GGUF 1B • Updated 14 days ago • 3.4k • 4 FlameF0X/LFM2.5-1.2B-Distilled-Claude-GGUF Text Generation • 1B • Updated 27 days ago • 1.38k
NanoSR FlameF0X/NanoSR-6x Image-to-Image • Updated 4 days ago • 2 FlameF0X/NanoSR-ResNet Image-to-Text • Updated 4 days ago FlameF0X/NanoSR Viewer • Updated 5 days ago • 1.6k • 68 • 1