Open to Work

Tolga Cangöz

tolgacangoz

standard_ai

AI & ML interests

AIGC

Recent Activity

upvoted a paper about 21 hours ago

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

upvoted a paper 8 days ago

WorldJen: An End-to-End Multi-Dimensional Benchmark for Generative Video Models

liked a dataset 8 days ago

ik6626/WorldJen-benchmarking-subsystem

View all activity

Organizations

upvoted a paper about 21 hours ago

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

Paper • 2605.15178 • Published 2 days ago • 53

upvoted a paper 8 days ago

WorldJen: An End-to-End Multi-Dimensional Benchmark for Generative Video Models

Paper • 2605.03475 • Published 11 days ago • 8

upvoted an article 21 days ago

Article

Nucleus-Image: Scaling Text-to-Image with Sparse Mixture of Experts

NucleusAI

•

Apr 14

• 11

upvoted a paper 21 days ago

Speculative Decoding for Autoregressive Video Generation

Paper • 2604.17397 • Published 27 days ago • 11

upvoted an article about 1 month ago

Article

How I contributed a new model to the Transformers library using Codex

nielsr

•

Mar 30

• 51

upvoted a collection about 1 month ago

Modular Pipelines

Collection

Diffusers Modular Pipeline repositories • 7 items • Updated Feb 20 • 2

upvoted an article about 1 month ago

Article

Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines

YiYiXu, OzzyGT, dn6, sayakpaul

•

Mar 5

• 51

upvoted 2 papers about 2 months ago

6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models

Paper • 2603.18742 • Published Mar 19 • 10

Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing

Paper • 2603.12254 • Published Mar 12 • 22

upvoted a collection about 2 months ago

AutoGaze

Collection

7 items • Updated Mar 19 • 9

upvoted 3 papers about 2 months ago

ID-LoRA: Identity-Driven Audio-Video Personalization with In-Context LoRA

Paper • 2603.10256 • Published Mar 10 • 23

LiteAttention: A Temporal Sparse Attention for Diffusion Transformers

Paper • 2511.11062 • Published Nov 14, 2025 • 33

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

Paper • 2603.21986 • Published Mar 23 • 125

upvoted 2 articles about 2 months ago

Article

Make your ZeroGPU Spaces go brrr with ahead-of-time compilation

cbensimon, sayakpaul, linoyts, multimodalart

•

Sep 2, 2025

• 77

Article

Fast LoRA inference for Flux with Diffusers and PEFT

sayakpaul, BenjaminB

•

Jul 23, 2025

• 54

upvoted a collection about 2 months ago

LTX-2.3

Collection

LTX-2.3 base models, quantized models and accompanying LoRAs and IC-LoRAs • 10 items • Updated 5 days ago • 49

upvoted 3 articles 2 months ago

Article

Ulysses Sequence Parallelism: Training with Million-Token Contexts

kashif, stas

•

Mar 9

• 28

Article

Text-to-image Architectural Experiments

Photoroom

•

Nov 13, 2025

• 57

Article

PRX Part 3 — Training a Text-to-Image Model in 24h!

Photoroom

•

Mar 3

• 64

upvoted a paper 3 months ago

Generative Modeling via Drifting

Paper • 2602.04770 • Published Feb 4 • 5

Tolga Cangöz

AI & ML interests

Recent Activity

Organizations

tolgacangoz's activity

Nucleus-Image: Scaling Text-to-Image with Sparse Mixture of Experts

How I contributed a new model to the Transformers library using Codex

Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines

Make your ZeroGPU Spaces go brrr with ahead-of-time compilation

Fast LoRA inference for Flux with Diffusers and PEFT

Ulysses Sequence Parallelism: Training with Million-Token Contexts

Text-to-image Architectural Experiments

PRX Part 3 — Training a Text-to-Image Model in 24h!