L

TaidanaHito

1 230 370

AI & ML interests

None yet

Recent Activity

liked a model about 8 hours ago

LiquidAI/LFM2.5-8B-A1B-GGUF

liked a model about 8 hours ago

LiquidAI/LFM2.5-8B-A1B

liked a model about 9 hours ago

LiquidAI/LFM2.5-350M-GGUF

View all activity

Organizations

None yet

upvoted a paper 9 days ago

Step-DeepResearch Technical Report

Paper • 2512.20491 • Published Dec 23, 2025 • 89

upvoted a collection 12 days ago

MiMo-V2.5

Collection

4 items • Updated Apr 27 • 90

upvoted 2 articles 12 days ago

Article

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

nvidia

•

Dec 15, 2025

• 113

Article

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

nvidia

•

Mar 17

• 67

upvoted a paper 14 days ago

MiniMax Sparse Attention

Paper • 2606.13392 • Published 23 days ago • 148

upvoted a collection 29 days ago

Gemma 4 — DECKARD HERETIC, Multimodal & Speculators

Collection

Gemma 4 abliterated/quantized — DECKARD HERETIC 31B, SuperGemma4-26B multimodal, 26B-A4B MoE, plus EAGLE3/DFlash drafters. • 14 items • Updated 2 days ago • 8

upvoted a paper 30 days ago

HEBATRON: A Hebrew-Specialized Open-Weight Mixture-of-Experts Language Model

Paper • 2605.11255 • Published May 11 • 1

upvoted an article 30 days ago

Article

What makes good reasoning data

MiniMax-AI

•

Oct 30, 2025

• 45

upvoted 12 papers about 1 month ago

Learn from your own latents and not from tokens: A sample-complexity theory

Paper • 2605.27734 • Published May 26 • 2

TextCraftor: Your Text Encoder Can be Image Quality Controller

Paper • 2403.18978 • Published Mar 27, 2024 • 15

Generating Coherent Sequences of Visual Illustrations for Real-World Manual Tasks

Paper • 2405.10122 • Published May 16, 2024 • 1

Making Multimodal Generation Easier: When Diffusion Models Meet LLMs

Paper • 2310.08949 • Published Oct 13, 2023 • 2

Programmable-Room: Interactive Textured 3D Room Meshes Generation Empowered by Large Language Models

Paper • 2506.17707 • Published Jun 21, 2025 • 1

Reason out Your Layout: Evoking the Layout Master from Large Language Models for Text-to-Image Synthesis

Paper • 2311.17126 • Published Nov 28, 2023 • 2

LaViDa: A Large Diffusion Language Model for Multimodal Understanding

Paper • 2505.16839 • Published May 22, 2025 • 14

SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with Large Language Models

Paper • 2305.05189 • Published May 9, 2023 • 4

Training Optimal Large Diffusion Language Models

Paper • 2510.03280 • Published Sep 28, 2025 • 1

dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models

Paper • 2512.19433 • Published Dec 22, 2025 • 4

MMaDA: Multimodal Large Diffusion Language Models

Paper • 2505.15809 • Published May 21, 2025 • 99

DIFFA: Large Language Diffusion Models Can Listen and Understand

Paper • 2507.18452 • Published Jul 24, 2025 • 2

L

AI & ML interests

Recent Activity

Organizations

TaidanaHito's activity

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

What makes good reasoning data