Gyanateet Dutta's picture

🔄 In a Training Loop

Gyanateet Dutta

Ryukijano

·

https://ryukijano.github.io

AI & ML interests

Computer Vision, Robotics, Generative modelling, AI for Sciences.

Recent Activity

new activity 2 days ago

Ryukijano/CatCon-One-Shot-Controlnet-SD-1-5-b2:Apply for community grant: Personal project (gpu: A100 Large)

updated a Space 2 days ago

Ryukijano/CatCon-One-Shot-Controlnet-SD-1-5-b2

published a Space 2 days ago

Ryukijano/CatCon-One-Shot-Controlnet-SD-1-5-b2

View all activity

Organizations

upvoted a paper 8 days ago

Kairos: A Native World Model Stack for Physical AI

Paper • 2606.16533 • Published 11 days ago • 37

upvoted a collection 12 days ago

VLA-JEPA

VLA-JEPA model checkpoints (LIBERO, Pretrain, SimplerEnv) • 3 items • Updated 30 days ago • 14

upvoted a paper 15 days ago

SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer

Paper • 2605.30409 • Published about 1 month ago • 41

upvoted a paper about 1 month ago

World Action Models: The Next Frontier in Embodied AI

Paper • 2605.12090 • Published May 12 • 68

upvoted 3 papers about 2 months ago

Generative Modeling with Orbit-Space Particle Flow Matching

Paper • 2605.02222 • Published May 4 • 9

JEPA-T: Joint-Embedding Predictive Architecture with Text Fusion for Image Generation

Paper • 2510.00974 • Published Oct 1, 2025 • 1

PDE-Transformer: Efficient and Versatile Transformers for Physics Simulations

Paper • 2505.24717 • Published May 30, 2025 • 1

upvoted 3 collections about 2 months ago

P3D

Datasets and pre-trained P3D models. • 3 items • Updated Apr 11 • 1

MolmoAct2 Models

Collection of the base models for MolmoAct2 • 6 items • Updated May 5 • 23

MolmoAct2 Datasets

Collection of robotics datasets for MolmoAct2 • 10 items • Updated 17 days ago • 13

upvoted 2 collections 2 months ago

Lyra

Project Lyra: Open Generative 3D World Models • 6 items • Updated 15 days ago • 12

NVIDIA Ising

NVIDIA Ising is a new Model Family to enable building useful Quantum Computers with AI. • 4 items • Updated 15 days ago • 27

upvoted 2 collections 3 months ago

SigLino: Vision Foundation Models (SigLIP2 + DINOv3)

Vision encoders distilled from DINOv3 and SigLIP2 (MoE & Dense). CVPR 2026. • 6 items • Updated Apr 10 • 17

WildDet3D

This is the collection of WildDet3D artifacts, including demos, model checkpoints and data. https://github.com/allenai/WildDet3D • 8 items • Updated Apr 13 • 20

upvoted a paper 3 months ago

PRISM: A Unified Framework for Photorealistic Reconstruction and Intrinsic Scene Modeling

Paper • 2504.14219 • Published Apr 19, 2025 • 2

upvoted a changelog 3 months ago

Hugging Face Changelog

Hugging Face Papers for AI Agents

Mar 18

• 142

upvoted a collection 4 months ago

Nemotron-Terminal

We are releasing Nemotron-Terminal models and training datasets. • 5 items • Updated 15 days ago • 35

upvoted 3 papers 4 months ago

VLASH: Real-Time VLAs via Future-State-Aware Asynchronous Inference

Paper • 2512.01031 • Published Nov 30, 2025 • 27

Utonia: Toward One Encoder for All Point Clouds

Paper • 2603.03283 • Published Mar 3 • 186

Unified Latents (UL): How to train your latents

Paper • 2602.17270 • Published Feb 19 • 62