Simon Kotchou

Simon-Kotchou

·

Simon-Kotchou

AI & ML interests

Self supervised learning, Computer vision

Recent Activity

liked a model 9 days ago

krea/Krea-2-Turbo

liked a model 9 days ago

zai-org/GLM-5.2

liked a model 16 days ago

black-forest-labs/FLUX.2-small-decoder

View all activity

Organizations

upvoted a paper about 2 months ago

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published Apr 15 • 168

upvoted a paper 6 months ago

LFM2 Technical Report

Paper • 2511.23404 • Published Nov 28, 2025 • 65

upvoted 18 papers 7 months ago

EmbeddingGemma: Powerful and Lightweight Text Representations

Paper • 2509.20354 • Published Sep 24, 2025 • 50

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published Nov 12, 2025 • 218

One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models

Paper • 2511.10629 • Published Nov 13, 2025 • 131

PAN: A World Model for General, Interactable, and Long-Horizon World Simulation

Paper • 2511.09057 • Published Nov 12, 2025 • 82

World-in-World: World Models in a Closed-Loop World

Paper • 2510.18135 • Published Oct 20, 2025 • 78

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention

Paper • 2509.24006 • Published Sep 28, 2025 • 119

WMPO: World Model-based Policy Optimization for Vision-Language-Action Models

Paper • 2511.09515 • Published Nov 12, 2025 • 21

LeJEPA: Provable and Scalable Self-Supervised Learning Without the Heuristics

Paper • 2511.08544 • Published Nov 11, 2025 • 12

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Paper • 2510.02283 • Published Oct 2, 2025 • 98

Back to Basics: Let Denoising Generative Models Denoise

Paper • 2511.13720 • Published Nov 17, 2025 • 70

WorldGrow: Generating Infinite 3D World

Paper • 2510.21682 • Published Oct 24, 2025 • 42

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

Paper • 2511.14993 • Published Nov 19, 2025 • 234

Does DINOv3 Set a New Medical Vision Standard?

Paper • 2509.06467 • Published Sep 8, 2025 • 39

Titans: Learning to Memorize at Test Time

Paper • 2501.00663 • Published Dec 31, 2024 • 31

SAM 3D: 3Dfy Anything in Images

Paper • 2511.16624 • Published Nov 20, 2025 • 117

In-Video Instructions: Visual Signals as Generative Control

Paper • 2511.19401 • Published Nov 24, 2025 • 32

Diversity Has Always Been There in Your Visual Autoregressive Models

Paper • 2511.17074 • Published Nov 21, 2025 • 8

MedSAM3: Delving into Segment Anything with Medical Concepts

Paper • 2511.19046 • Published Nov 24, 2025 • 55