wang

zhaokai

1 4 17

gklab

AI & ML interests

None yet

Recent Activity

upvoted a collection 9 days ago

DeepSpec

upvoted a collection 2 months ago

DeepSeek-V4

liked a dataset 4 months ago

karpathy/tinystories-gpt4-clean

View all activity

Organizations

liked a dataset 4 months ago

karpathy/tinystories-gpt4-clean

Viewer • Updated Feb 8 • 2.73M • 2.06k • 80

liked 2 models about 1 year ago

Menlo/Jan-nano

Text Generation • 4B • Updated Jul 4, 2025 • 448 • • 507

Qwen/Qwen3-30B-A3B

Text Generation • 31B • Updated Jul 26, 2025 • 2.92M • 905

liked a dataset over 1 year ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k

Viewer • Updated Feb 21, 2025 • 110k • 656 • 767

liked a Space over 1 year ago

The Ultra-Scale Playbook

🌌

3.93k

The ultimate guide to training LLM on large GPU Clusters

liked 3 models over 1 year ago

liked 3 models almost 2 years ago

microsoft/Phi-3.5-MoE-instruct

Text Generation • 42B • Updated Dec 10, 2025 • 163k • 574

Qwen/Qwen2-Audio-7B-Instruct

Audio-Text-to-Text • 8B • Updated Jan 12, 2025 • 575k • 546

meta-llama/Prompt-Guard-86M

Text Classification • 0.3B • Updated Nov 12, 2025 • 3.88M • • 351

liked a model about 2 years ago

openbmb/MiniCPM-Llama3-V-2_5

Image-Text-to-Text • 9B • Updated Jan 15, 2025 • 16.7k • 1.41k

liked a Space about 2 years ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.38k

Explore and download the FineWeb web‑scale text dataset

liked a dataset over 2 years ago

Skywork/SkyPile-150B

Viewer • Updated Dec 7, 2023 • 1.76M • 9.97k • 407

liked a model over 2 years ago

SkunkworksAI/phi-2

Text Generation • 3B • Updated Dec 13, 2023 • 170 • 132

liked 2 models almost 3 years ago

huggyllama/llama-30b

Text Generation • 33B • Updated Apr 7, 2023 • 857 • 48

huggyllama/llama-65b

Text Generation • 65B • Updated Apr 7, 2023 • 835 • 78