🤝 Open to Collab

Mike Ravkine PRO

mike-ravkine

49 26 665

the-crypt-keeper

AI & ML interests

LLM Research / Development / Evaluation

Recent Activity

liked a model 1 day ago

bkhmsi/micro-llama-3b

liked a model 3 days ago

deepreinforce-ai/Ornith-1.0-397B-FP8

liked a model 21 days ago

ideogram-ai/ideogram-4-fp8

View all activity

Organizations

liked a model 1 day ago

bkhmsi/micro-llama-3b

12B • Updated Oct 15, 2025 • 5 • 2

liked a model 3 days ago

deepreinforce-ai/Ornith-1.0-397B-FP8

Text Generation • 397B • Updated 4 days ago • 4.79k • 58

liked 3 models 21 days ago

liked 2 models about 1 month ago

LatitudeGames/Equinox-31B

31B • Updated May 22 • 287 • 55

CohereLabs/command-a-plus-05-2026-fp8

Image-Text-to-Text • 219B • Updated May 27 • 6.05k • • 37

liked 2 models about 2 months ago

nvidia/NVIDIA-Nemotron-Labs-3-Elastic-30B-A3B-BF16

Text Generation • 32B • Updated May 8 • 652 • 30

tencent/Hy3-preview

Text Generation • 299B • Updated Apr 23 • 84.8k • 282

upvoted a paper about 2 months ago

Large Language Models Explore by Latent Distilling

Paper • 2604.24927 • Published Apr 27 • 74

liked 5 models 2 months ago

ibm-granite/granite-4.1-8b

9B • Updated May 4 • 293k • 202

ibm-granite/granite-4.1-30b

Text Generation • 29B • Updated May 4 • 73k • 134

poolside/Laguna-XS.2

Text Generation • 33B • Updated 11 days ago • 84.4k • 317

deepseek-ai/DeepSeek-V4-Flash

Text Generation • 158B • Updated 7 days ago • 1.93M • • 1.64k

SandyResearch/parcae-1.3b

Text Generation • Updated Apr 16 • 985 • 7

replied to their post 2 months ago

Some clarity is emerging:

The distribution of response lengths has shifted considerably in 3.6 and 2 of my tasks are no longer fitting into 16k, the ignorance zone blows up.

Re-running at 32k then we'll see if that extra thinking pays off or nah.

An interesting outlier here is the word-sort task where 3.6 thinks ~half as much and this costs it about 10pp of performance.

posted an update 2 months ago

Post

110

A word of warning that my initial run of 3.6 produced unfavorable results vs 3.5; performance on out-of-distribution and instruction following tasks appears to have collapsed. Potentially a vLLM 0.19 issue here, the original eval was done with their fork of 0.18. I am re-running both with nightly so we have apples to apples and will report back, curious to get to the bottom of this one.

1 reply

liked 3 models 3 months ago

MiniMaxAI/MiniMax-M2.7

Text Generation • 229B • Updated Apr 20 • 1.89M • • 1.22k

LGAI-EXAONE/EXAONE-4.5-33B

Image-Text-to-Text • 34B • Updated 13 days ago • 248k • 174

nvidia/NVIDIA-Nemotron-3-Nano-4B-BF16

Text Generation • 4B • Updated Mar 20 • 1.91M • 99

Mike Ravkine PRO

AI & ML interests

Recent Activity

Organizations

mike-ravkine's activity