Chat spaces Collection of chat spaces to test llm Collection by kramp Dec 19, 2025 1 Runtime error 377 Chat with Baize 🐲 377 Running on Zero Featured 490 Llama 2 13b Chat 🦙 490 Chat with Llama‑2 13B for instant AI-generated replies Paused Featured 900 Zephyr Chat 🪁 900 Chat with an AI model Build error Featured 377 IDEFICS Playground 🐨 377
Running on Zero Featured 490 Llama 2 13b Chat 🦙 490 Chat with Llama‑2 13B for instant AI-generated replies
Large Language Models Precompiled language models for on-device text generation. Collection by simaai 21 days ago - simaai/Llama-3.2-3B-Instruct-a16w4 Text Generation • Updated Jan 12 • 25 simaai/Qwen3-4B-Instruct-2507-a16w4 Text Generation • Updated Jan 12 • 5 simaai/gemma-3-4b-it-a16w4 Text Generation • Updated Dec 19, 2025 • 21 simaai/Phi-3.5-mini-instruct-a16w4 Text Generation • Updated Jan 12 • 12
Struct-SQL Distilled Query-Plan CoT to an SLM Collection by craterlabs 7 days ago - craterlabs/struct-sql-data Viewer • Updated Jan 28 • 1.3k • 20 craterlabs/Struct-SQL Text Generation • 4B • Updated Jan 28 • 34 Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL Paper • 2512.17053 • Published Dec 18, 2025 heegyu/bird-sql-mini-dev Viewer • Updated Jul 26, 2024 • 500 • 38 • 1
Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL Paper • 2512.17053 • Published Dec 18, 2025
Teprocessor.batch_decode(outputs Collection by Lennie29 Dec 30, 2025 - LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published Dec 29, 2025 • 66
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published Dec 29, 2025 • 66
Olg Collection by kyrgan Jan 24 - Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation Paper • 2511.14993 • Published Nov 19, 2025 • 233
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation Paper • 2511.14993 • Published Nov 19, 2025 • 233
pp Collection by HRPBloom 7 days ago - zai-org/GLM-5 Text Generation • 754B • Updated about 6 hours ago • 252k • • 1.93k Qwen/Qwen3.5-9B Image-Text-to-Text • 10B • Updated Mar 2 • 4.82M • • 1.16k
SSAE Training and evaluation dataset, model checkpoints in 'Step-Level Sparse Autoencoder for Reasoning Process Interpretation' Collection by TorresYang about 1 month ago - Miaow-Lab/SSAE-Dataset Viewer • Updated about 1 month ago • 1.28M • 54 Miaow-Lab/SSAE-Checkpoints Feature Extraction • Updated about 1 month ago Step-Level Sparse Autoencoder for Reasoning Process Interpretation Paper • 2603.03031 • Published Mar 3
Step-Level Sparse Autoencoder for Reasoning Process Interpretation Paper • 2603.03031 • Published Mar 3
LASIK Eye Surgery Vs Contact Lenses: Which is Right for You? If you’ve ever fumbled with contact lenses early in the morning or felt tired of depending on glasses, you’ve probably wondered if LASIK eye surgery Collection by Dranisha Dec 27, 2025 -
Math LMs A collection of fine tuned models for better mathematics reasoning Collection by Cannae-AI 3 days ago - Cannae-AI/ReasoningLlama-Math-1B-IT-gguf Text Generation • 1B • Updated Nov 18, 2025 • 30 • 1 Cannae-AI/GsMath-Llama-1B Text Generation • 1B • Updated Nov 24, 2025 • 3 Cannae-AI/ReasoningLlama-Math-1B-IT Text Generation • 1B • Updated Nov 18, 2025 • 2 mradermacher/GsMath-Llama-1B-GGUF 1B • Updated Nov 19, 2025 • 25 • 1
Chat spaces Collection of chat spaces to test llm Collection by kramp Dec 19, 2025 1 Runtime error 377 Chat with Baize 🐲 377 Running on Zero Featured 490 Llama 2 13b Chat 🦙 490 Chat with Llama‑2 13B for instant AI-generated replies Paused Featured 900 Zephyr Chat 🪁 900 Chat with an AI model Build error Featured 377 IDEFICS Playground 🐨 377
Running on Zero Featured 490 Llama 2 13b Chat 🦙 490 Chat with Llama‑2 13B for instant AI-generated replies
pp Collection by HRPBloom 7 days ago - zai-org/GLM-5 Text Generation • 754B • Updated about 6 hours ago • 252k • • 1.93k Qwen/Qwen3.5-9B Image-Text-to-Text • 10B • Updated Mar 2 • 4.82M • • 1.16k
Large Language Models Precompiled language models for on-device text generation. Collection by simaai 21 days ago - simaai/Llama-3.2-3B-Instruct-a16w4 Text Generation • Updated Jan 12 • 25 simaai/Qwen3-4B-Instruct-2507-a16w4 Text Generation • Updated Jan 12 • 5 simaai/gemma-3-4b-it-a16w4 Text Generation • Updated Dec 19, 2025 • 21 simaai/Phi-3.5-mini-instruct-a16w4 Text Generation • Updated Jan 12 • 12
SSAE Training and evaluation dataset, model checkpoints in 'Step-Level Sparse Autoencoder for Reasoning Process Interpretation' Collection by TorresYang about 1 month ago - Miaow-Lab/SSAE-Dataset Viewer • Updated about 1 month ago • 1.28M • 54 Miaow-Lab/SSAE-Checkpoints Feature Extraction • Updated about 1 month ago Step-Level Sparse Autoencoder for Reasoning Process Interpretation Paper • 2603.03031 • Published Mar 3
Step-Level Sparse Autoencoder for Reasoning Process Interpretation Paper • 2603.03031 • Published Mar 3
Struct-SQL Distilled Query-Plan CoT to an SLM Collection by craterlabs 7 days ago - craterlabs/struct-sql-data Viewer • Updated Jan 28 • 1.3k • 20 craterlabs/Struct-SQL Text Generation • 4B • Updated Jan 28 • 34 Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL Paper • 2512.17053 • Published Dec 18, 2025 heegyu/bird-sql-mini-dev Viewer • Updated Jul 26, 2024 • 500 • 38 • 1
Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL Paper • 2512.17053 • Published Dec 18, 2025
Teprocessor.batch_decode(outputs Collection by Lennie29 Dec 30, 2025 - LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published Dec 29, 2025 • 66
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published Dec 29, 2025 • 66
LASIK Eye Surgery Vs Contact Lenses: Which is Right for You? If you’ve ever fumbled with contact lenses early in the morning or felt tired of depending on glasses, you’ve probably wondered if LASIK eye surgery Collection by Dranisha Dec 27, 2025 -
Olg Collection by kyrgan Jan 24 - Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation Paper • 2511.14993 • Published Nov 19, 2025 • 233
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation Paper • 2511.14993 • Published Nov 19, 2025 • 233
Math LMs A collection of fine tuned models for better mathematics reasoning Collection by Cannae-AI 3 days ago - Cannae-AI/ReasoningLlama-Math-1B-IT-gguf Text Generation • 1B • Updated Nov 18, 2025 • 30 • 1 Cannae-AI/GsMath-Llama-1B Text Generation • 1B • Updated Nov 24, 2025 • 3 Cannae-AI/ReasoningLlama-Math-1B-IT Text Generation • 1B • Updated Nov 18, 2025 • 2 mradermacher/GsMath-Llama-1B-GGUF 1B • Updated Nov 19, 2025 • 25 • 1