Illuminum

Illuminum

·

illuminum2

AI & ML interests

Image to 3D

Recent Activity

liked a model about 1 month ago

prism-ml/bonsai-image-ternary-4B-mlx-2bit

liked a model about 1 month ago

depth-anything/Depth-Anything-V2-Metric-Indoor-Small-hf

liked a model about 2 months ago

tencent/HY-Motion-1.0

View all activity

Organizations

None yet

upvoted 2 collections 2 months ago

talkie-13b

talkie-1930-13b is a vintage language model trained on pre-1931 English-language text. See https://github.com/talkie-lm/talkie to run talkie. • 3 items • Updated Apr 21 • 56

DeepSeek-V4

6 items • Updated 4 days ago • 710

upvoted a paper 4 months ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7, 2025 • 158

upvoted a paper 9 months ago

AToken: A Unified Tokenizer for Vision

Paper • 2509.14476 • Published Sep 17, 2025 • 37

upvoted an article 9 months ago

Article

DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive

bpan

•

Apr 9, 2024

• 30

upvoted a paper 9 months ago

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Paper • 2509.16197 • Published Sep 19, 2025 • 58

upvoted a collection 9 months ago

Qwen3-Omni

6 items • Updated Dec 31, 2025 • 204

upvoted an article 9 months ago

Article

Understanding Vector Quantization in VQ-VAE

ariG23498

•

Aug 28, 2024

• 64

upvoted a paper 10 months ago

Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action

Paper • 2312.17172 • Published Dec 28, 2023 • 31

upvoted 4 articles 10 months ago

Article

Small Language Models (SLM): A Comprehensive Overview

jjokah

•

Feb 22, 2025

• 165

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

+1

Leyo, HugoLaurencon, VictorSanh

•

Apr 15, 2024

• 191

Article

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

+1

HugoLaurencon, Leyo, VictorSanh

•

Mar 15, 2024

• 13

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

+4

tomaarsen, Xenova, alvarobartt, ariG23498, pcuenq, sergiopaniego

•

Sep 4, 2025

• 275

upvoted a collection 10 months ago

RoRF Jina

RoRF (Routing on Random Forests) trained on Jina AI's SOTA open-source embeddings • 6 items • Updated Sep 24, 2024 • 1

upvoted a paper 10 months ago

MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft

Paper • 2504.08388 • Published Apr 11, 2025 • 43

upvoted a collection 10 months ago

LLM papers

It is a collection of papers that are useful in studying LLM. • 14 items • Updated Apr 3, 2024 • 16

upvoted a paper 11 months ago

Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation

Paper • 2507.10524 • Published Jul 14, 2025 • 74