Airbus5717's picture

3 8 41

Airbus5717

airbus5717

·

Airbus5717

AI & ML interests

None yet

Recent Activity

liked a model about 6 hours ago

deepseek-ai/DeepSeek-OCR-2

upvoted a collection 1 day ago

liked a model 1 day ago

cerebras/MiniMax-M2.1-REAP-172B-A10B

View all activity

Organizations

upvoted a collection 1 day ago

Cerebras REAP

Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 26 items • Updated 3 days ago • 97

upvoted an article 7 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

752

upvoted a collection 8 months ago

Llama Nemotron

Open, Production-ready Enterprise Models • 12 items • Updated 1 day ago • 77

upvoted a collection 10 months ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10, 2025 • 216

upvoted a paper 12 months ago

DeepSeek-V3 Technical Report

Paper • 2412.19437 • Published Dec 27, 2024 • 75

upvoted a paper about 1 year ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 437

upvoted a collection over 1 year ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 654