38 1 150

JaheimLee

JaheimLee

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

ManniX-ITA/Qwen3.6-27B-Omnimerge-v4

new activity 3 months ago

Jackrong/Qwopus3.5-27B-v3:MTP Speculation

new activity 4 months ago

Sehyo/Qwen3.5-397B-A17B-NVFP4:missing think tag

View all activity

Organizations

liked a model about 1 month ago

ManniX-ITA/Qwen3.6-27B-Omnimerge-v4

Image-Text-to-Text • 28B • Updated May 22 • 81 • 14

New activity in Jackrong/Qwopus3.5-27B-v3 3 months ago

MTP Speculation

👍 1

#11 opened 3 months ago by

memtalow

New activity in Sehyo/Qwen3.5-397B-A17B-NVFP4 4 months ago

missing think tag

#2 opened 4 months ago by

fouvy

liked 3 datasets 5 months ago

liked a model 6 months ago

QuantTrio/MiniMax-M2-REAP-162B-A10B-AWQ

Text Generation • 162B • Updated Jan 5 • 99 • 3

New activity in 0xSero/GLM-4.7-185B-W4A16 6 months ago

REAP-55 quant version

👍 2

#7 opened 6 months ago by

JaheimLee

liked a model 7 months ago

RESMP-DEV/Qwen3-Next-80B-A3B-Thinking-NVFP4

Text Generation • Updated Oct 11, 2025 • 82 • 10

liked 3 Spaces 8 months ago

The Smol Training Playbook

📚

3.22k

The secrets to building world-class LLMs

FineWeb: decanting the web for the finest text data at scale

🍷

1.37k

Explore and download the FineWeb web‑scale text dataset

The Ultra-Scale Playbook

🌌

3.9k

The ultimate guide to training LLM on large GPU Clusters

New activity in DevQuasar/Qwen.Qwen3-Next-80B-A3B-Instruct-FP8 9 months ago

VLLM compatibility?

#1 opened 9 months ago by

aidendle94

liked a model 10 months ago

Qwen/Qwen3-Next-80B-A3B-Instruct

Text Generation • 81B • Updated Sep 17, 2025 • 259k • • 1.03k

New activity in cyankiwi/GLM-4.5-Air-AWQ-4bit 11 months ago

Does this actually work with VLLM?

#1 opened 11 months ago by

sirus

liked a model 11 months ago

Multiverse4FM/Multiverse-32B

Text Generation • 33B • Updated Jun 13, 2025 • 14 • 10

liked a model 12 months ago

tencent/Hunyuan-A13B-Instruct-GPTQ-Int4

Text Generation • 80B • Updated Jul 11, 2025 • 176 • 51

liked a model about 1 year ago

Tongyi-Zhiwen/QwenLong-L1-32B-AWQ

33B • Updated May 29, 2025 • 18 • 10

New activity in Qwen/Qwen3-32B-FP8 about 1 year ago

Is this a QAT model?

#2 opened about 1 year ago by

Downtown-Case

liked a model about 1 year ago

RedHatAI/Qwen3-32B-FP8-dynamic

Text Generation • 33B • Updated May 13, 2025 • 2.22k • 15

JaheimLee

AI & ML interests

Recent Activity

Organizations

JaheimLee's activity

MTP Speculation

missing think tag

REAP-55 quant version

The Smol Training Playbook

FineWeb: decanting the web for the finest text data at scale

The Ultra-Scale Playbook

VLLM compatibility?

Does this actually work with VLLM?

Is this a QAT model?