sagarchapara/qwen3-4b-thinking-aimo-numina-cot-sft Text Generation โข 4B โข Updated Jan 2 โข 2 โข
view article Article Distributed Training with JAX and Flax NNX: A Practical Guide to Sharding Mar 26, 2025 โข 13
sagarchapara/qwen3-4b-thinking-aimo-numina-cot-sft Text Generation โข 4B โข Updated Jan 2 โข 2 โข
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog โข 9 items โข Updated Mar 2 โข 90
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12, 2025 โข 495
sagarchapara/whisper-small-tel Automatic Speech Recognition โข 2B โข Updated Mar 12, 2025 โข 3 โข 1
sagarchapara/whisper-small-tel Automatic Speech Recognition โข 2B โข Updated Mar 12, 2025 โข 3 โข 1
Running 596 Scaling test-time compute ๐ 596 Run advanced search strategies to boost LLM problem solving