9 31

Arslan

cowgoesmoo

volf52

AI & ML interests

None yet

Recent Activity

upvoted a collection 2 days ago

Ornith-1.0

liked a model 10 days ago

zai-org/GLM-5.2

liked a model about 2 months ago

Zyphra/ZAYA1-8B

View all activity

Organizations

upvoted a collection 2 days ago

Ornith-1.0

Collection

Ornith-1.0 is a family of open-source LLMs specialized for agentic coding. • 8 items • Updated 2 days ago • 223

liked a model 10 days ago

zai-org/GLM-5.2

Text Generation • 753B • Updated 6 days ago • 119k • • 2.8k

liked 2 models about 2 months ago

Zyphra/ZAYA1-8B

9B • Updated 3 days ago • 61.4k • 578

ibm-granite/granite-4.1-8b

9B • Updated May 4 • 265k • 201

updated a collection 2 months ago

reasoning-gym

Collection

Datasets generated using https://github.com/open-thought/reasoning-gym (with Qwen3-instruct templates) • 15 items • Updated Apr 26

liked a model 2 months ago

moonshotai/MoonViT-SO-400M

Image Feature Extraction • 0.4B • Updated Apr 17, 2025 • 6.66k • 85

liked a model 3 months ago

llava-hf/llava-v1.6-mistral-7b-hf

Image-Text-to-Text • 8B • Updated Dec 22, 2025 • 724k • 310

liked 2 Spaces 3 months ago

Evaluation Guidebook

📝

330

Explore LLM benchmark scores over time

The Smol Training Playbook

📚

3.22k

The secrets to building world-class LLMs

liked 3 models 6 months ago

liked a model 7 months ago

NousResearch/nomos-1

Text Generation • 31B • Updated Jan 10 • 378 • 154

upvoted a collection 7 months ago

GLM-4.6

Collection

2 items • Updated Mar 2 • 54

upvoted 5 papers 7 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 454

DeepSeek-OCR: Contexts Optical Compression

Paper • 2510.18234 • Published Oct 21, 2025 • 95

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published Nov 27, 2025 • 96

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 270

Small Language Models are the Future of Agentic AI

Paper • 2506.02153 • Published Jun 2, 2025 • 25

liked a model 8 months ago

deepseek-ai/DeepSeek-OCR

Image-Text-to-Text • 3B • Updated Nov 4, 2025 • 2.2M • 3.29k

Arslan

AI & ML interests

Recent Activity

Organizations

cowgoesmoo's activity

Evaluation Guidebook

The Smol Training Playbook