SSAE Training and evaluation dataset, model checkpoints in 'Step-Level Sparse Autoencoder for Reasoning Process Interpretation' Collection by TorresYang 24 days ago - Miaow-Lab/SSAE-Dataset Viewer • Updated 24 days ago • 1.28M • 65 Miaow-Lab/SSAE-Checkpoints Feature Extraction • Updated 24 days ago Step-Level Sparse Autoencoder for Reasoning Process Interpretation Paper • 2603.03031 • Published 24 days ago
Step-Level Sparse Autoencoder for Reasoning Process Interpretation Paper • 2603.03031 • Published 24 days ago
LASIK Eye Surgery Vs Contact Lenses: Which is Right for You? If you’ve ever fumbled with contact lenses early in the morning or felt tired of depending on glasses, you’ve probably wondered if LASIK eye surgery Collection by Dranisha Dec 27, 2025 -
Math + Coding LMs A collection of fine tuned models spanning mathematics, coding, and cybersecurity.Engineered for comprehensive coverage of computational reasoning. Collection by Cannae-AI 6 days ago - Cannae-AI/ReasoningLlama-Math-1B-IT-gguf Text Generation • 1B • Updated Nov 18, 2025 • 70 • 1 Cannae-AI/GsMath-Llama-1B Text Generation • 1B • Updated Nov 24, 2025 • 1 Cannae-AI/ReasoningLlama-Math-1B-IT Text Generation • 1B • Updated Nov 18, 2025 • 3 mradermacher/GsMath-Llama-1B-GGUF 1B • Updated Nov 19, 2025 • 82 • 1
Struct-SQL Distilled Query-Plan CoT to an SLM Collection by craterlabs about 1 hour ago - craterlabs/struct-sql-data Viewer • Updated Jan 28 • 1.3k • 25 craterlabs/Struct-SQL Text Generation • 4B • Updated Jan 28 • 30 Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL Paper • 2512.17053 • Published Dec 18, 2025 heegyu/bird-sql-mini-dev Viewer • Updated Jul 26, 2024 • 500 • 43 • 1
Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL Paper • 2512.17053 • Published Dec 18, 2025
Teprocessor.batch_decode(outputs Collection by Lennie29 Dec 30, 2025 - LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published Dec 29, 2025 • 66
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published Dec 29, 2025 • 66
Olg Collection by kyrgan Jan 24 - Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation Paper • 2511.14993 • Published Nov 19, 2025 • 233
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation Paper • 2511.14993 • Published Nov 19, 2025 • 233
AI Collection by bigpappic Dec 17, 2025 - openai/gdpval Viewer • Updated Feb 10 • 220 • 34.1k • 477 deepseek-ai/DeepSeek-OCR Image-Text-to-Text • 3B • Updated Nov 4, 2025 • 2.8M • 3.19k Running 1 The Ultimate Blank Canvas App 🎨 👁 1 Track and visualize device locations on a map
PeptideCLM-2 Official models and data for the PeptideCLM-2 paper, featuring MLM, MTR, and Hybrid architectures. Collection by aaronfeller Feb 3 - aaronfeller/PeptideMLM_sm Updated Dec 23, 2025 • 277 aaronfeller/PeptideMLM_base Updated Dec 23, 2025 • 301 aaronfeller/PeptideMLM_lg Updated Dec 23, 2025 • 294 aaronfeller/PeptideMLM-MTR_sm Updated Dec 23, 2025 • 204
SSAE Training and evaluation dataset, model checkpoints in 'Step-Level Sparse Autoencoder for Reasoning Process Interpretation' Collection by TorresYang 24 days ago - Miaow-Lab/SSAE-Dataset Viewer • Updated 24 days ago • 1.28M • 65 Miaow-Lab/SSAE-Checkpoints Feature Extraction • Updated 24 days ago Step-Level Sparse Autoencoder for Reasoning Process Interpretation Paper • 2603.03031 • Published 24 days ago
Step-Level Sparse Autoencoder for Reasoning Process Interpretation Paper • 2603.03031 • Published 24 days ago
Struct-SQL Distilled Query-Plan CoT to an SLM Collection by craterlabs about 1 hour ago - craterlabs/struct-sql-data Viewer • Updated Jan 28 • 1.3k • 25 craterlabs/Struct-SQL Text Generation • 4B • Updated Jan 28 • 30 Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL Paper • 2512.17053 • Published Dec 18, 2025 heegyu/bird-sql-mini-dev Viewer • Updated Jul 26, 2024 • 500 • 43 • 1
Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL Paper • 2512.17053 • Published Dec 18, 2025
Teprocessor.batch_decode(outputs Collection by Lennie29 Dec 30, 2025 - LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published Dec 29, 2025 • 66
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published Dec 29, 2025 • 66
LASIK Eye Surgery Vs Contact Lenses: Which is Right for You? If you’ve ever fumbled with contact lenses early in the morning or felt tired of depending on glasses, you’ve probably wondered if LASIK eye surgery Collection by Dranisha Dec 27, 2025 -
Olg Collection by kyrgan Jan 24 - Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation Paper • 2511.14993 • Published Nov 19, 2025 • 233
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation Paper • 2511.14993 • Published Nov 19, 2025 • 233
Math + Coding LMs A collection of fine tuned models spanning mathematics, coding, and cybersecurity.Engineered for comprehensive coverage of computational reasoning. Collection by Cannae-AI 6 days ago - Cannae-AI/ReasoningLlama-Math-1B-IT-gguf Text Generation • 1B • Updated Nov 18, 2025 • 70 • 1 Cannae-AI/GsMath-Llama-1B Text Generation • 1B • Updated Nov 24, 2025 • 1 Cannae-AI/ReasoningLlama-Math-1B-IT Text Generation • 1B • Updated Nov 18, 2025 • 3 mradermacher/GsMath-Llama-1B-GGUF 1B • Updated Nov 19, 2025 • 82 • 1
AI Collection by bigpappic Dec 17, 2025 - openai/gdpval Viewer • Updated Feb 10 • 220 • 34.1k • 477 deepseek-ai/DeepSeek-OCR Image-Text-to-Text • 3B • Updated Nov 4, 2025 • 2.8M • 3.19k Running 1 The Ultimate Blank Canvas App 🎨 👁 1 Track and visualize device locations on a map
PeptideCLM-2 Official models and data for the PeptideCLM-2 paper, featuring MLM, MTR, and Hybrid architectures. Collection by aaronfeller Feb 3 - aaronfeller/PeptideMLM_sm Updated Dec 23, 2025 • 277 aaronfeller/PeptideMLM_base Updated Dec 23, 2025 • 301 aaronfeller/PeptideMLM_lg Updated Dec 23, 2025 • 294 aaronfeller/PeptideMLM-MTR_sm Updated Dec 23, 2025 • 204