Mike Ravkine's picture

Open to Collab

Mike Ravkine PRO

mike-ravkine

·

the-crypt-keeper

AI & ML interests

LLM Research / Development / Evaluation

Recent Activity

liked a model 9 days ago

nvidia/NVIDIA-Nemotron-Labs-3-Elastic-30B-A3B-BF16

liked a model 20 days ago

tencent/Hy3-preview

upvoted a paper 20 days ago

Large Language Models Explore by Latent Distilling

View all activity

Organizations

upvoted a paper 20 days ago

Large Language Models Explore by Latent Distilling

Paper • 2604.24927 • Published 24 days ago • 74

upvoted an article about 2 months ago

Article

How I contributed a new model to the Transformers library using Codex

nielsr

•

Mar 30

• 51

upvoted a paper about 2 months ago

Reasoning as Compression: Unifying Budget Forcing via the Conditional Information Bottleneck

Paper • 2603.08462 • Published Mar 9 • 22

upvoted an article 4 months ago

Article

We Got Claude to Build CUDA Kernels and teach open models!

+2

burtenshaw, evalstate, merve, pcuenq

•

Jan 28

• 156

upvoted an article 5 months ago

Article

Case Study: The Marcus-Thorne Mystery Cache Standoff

unmodeled-tyler

•

Jan 1

• 3

upvoted a collection 5 months ago

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models • 18 items • Updated 1 day ago • 294

upvoted a paper 7 months ago

The End of Manual Decoding: Towards Truly End-to-End Language Models

Paper • 2510.26697 • Published Oct 30, 2025 • 120

upvoted 3 articles 7 months ago

Article

Vision Tokens vs Text Tokens: Understanding the 10× Compression

onekq

•

Oct 22, 2025

• 6

Article

All LLMs Write Great Code, But Some Make (A Lot) Fewer Mistakes

onekq

•

Sep 12, 2024

• 5

Article

Hall of Multimodal OCR VLMs and Demonstrations

prithivMLmods

•

Oct 31, 2025

• 8

upvoted a paper 7 months ago

SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs

Paper • 2510.05069 • Published Oct 6, 2025 • 13

upvoted a paper 8 months ago

Symbolic Graphics Programming with Large Language Models

Paper • 2509.05208 • Published Sep 5, 2025 • 47

upvoted an article 9 months ago

Article

Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B

nvidia

•

Aug 18, 2025

• 32

upvoted a collection over 1 year ago

Lumimaid 0.2

4 items • Updated Jul 26, 2024 • 20

upvoted 2 papers over 1 year ago

RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation

Paper • 2412.11919 • Published Dec 16, 2024 • 36

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Paper • 2410.10139 • Published Oct 14, 2024 • 51

upvoted a collection over 1 year ago

My most recent datasets

6 items • Updated Oct 8, 2024 • 6

upvoted an article over 1 year ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

+4

medmekk, marcsun13, lvwerra, pcuenq, osanseviero, thomwolf

•

Sep 18, 2024

• 280

upvoted a collection over 1 year ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 38 items • Updated Mar 2 • 367

upvoted a paper over 1 year ago

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published Sep 3, 2024 • 83