ky666 (k) – Likes

liked a model 4 months ago

ByteDance-Seed/Seed-X-PPO-7B

Translation • 8B • Updated Jul 28, 2025 • 1.43k • 302

liked a Space 8 months ago

The Smol Training Playbook

📚

3.22k

The secrets to building world-class LLMs

liked a model 9 months ago

microsoft/UserLM-8b

Text Generation • 8B • Updated 16 days ago • 1.16k • • 379

liked 2 models 11 months ago

unsloth/Qwen3-32B-unsloth-bnb-4bit

Text Generation • Updated May 14, 2025 • 6.09k • 15

unsloth/Qwen3-14B-unsloth-bnb-4bit

Text Generation • Updated May 13, 2025 • 236k • 17

liked a model 12 months ago

unsloth/GLM-Z1-32B-0414

Text Generation • 33B • Updated Jul 3, 2025 • 23 • • 1

liked 2 models about 1 year ago

ByteDance-Seed/UI-TARS-1.5-7B

Image-Text-to-Text • 8B • Updated Apr 18, 2025 • 415k • • 570

deepseek-ai/DeepSeek-V3-0324

Text Generation • 685B • Updated Mar 27, 2025 • 1.12M • • 3.13k

liked a model over 1 year ago

Qwen/QwQ-32B

Text Generation • 33B • Updated Mar 11, 2025 • 76.7k • • 2.93k

liked a dataset over 1 year ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k-SFT

Viewer • Updated Feb 19, 2025 • 110k • 586 • 225

liked 2 models over 1 year ago

microsoft/OmniParser-v2.0

Updated Mar 28, 2025 • 8.06k • 1.34k

unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF

Text Generation • 8B • Updated May 10, 2025 • 48k • 304

liked a dataset over 1 year ago

Conard/fortune-telling

Viewer • Updated Feb 17, 2025 • 207 • 404 • 171

liked a Space over 1 year ago

The Ultra-Scale Playbook

🌌

3.9k

The ultimate guide to training LLM on large GPU Clusters

liked 6 models over 1 year ago

k

AI & ML interests

Organizations

ByteDance-Seed/Seed-X-PPO-7B

The Smol Training Playbook

microsoft/UserLM-8b

unsloth/Qwen3-32B-unsloth-bnb-4bit

unsloth/Qwen3-14B-unsloth-bnb-4bit

unsloth/GLM-Z1-32B-0414

ByteDance-Seed/UI-TARS-1.5-7B

deepseek-ai/DeepSeek-V3-0324

Qwen/QwQ-32B

Congliu/Chinese-DeepSeek-R1-Distill-data-110k-SFT

microsoft/OmniParser-v2.0

unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF

Conard/fortune-telling

The Ultra-Scale Playbook

Open-Reasoner-Zero/Open-Reasoner-Zero-7B

Open-Reasoner-Zero/Open-Reasoner-Zero-32B

unsloth/DeepSeek-R1-Distill-Qwen-32B-bnb-4bit

unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF

ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q4

ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4

k

AI & ML interests

Organizations

ky666's activity

The Smol Training Playbook

The Ultra-Scale Playbook