Raushan Turganbay

RaushanTurganbay

·

zucchini-nlp

AI & ML interests

Generation and Multimodality

Recent Activity

new activity 3 days ago

transformers-community/group-beam-search:Possible performance issue: group beam search appears to prefill identical prefixes once per beam

updated a model 4 days ago

RaushanTurganbay/kimi2.7-processor

published a model 4 days ago

RaushanTurganbay/kimi2.7-processor

View all activity

Organizations

upvoted a changelog 20 days ago

Hugging Face Changelog

Agent Traces on the Hub

Apr 7

• 148

upvoted an article 20 days ago

Article

Introducing Serge: GitHub-Native AI Code Review

huggingface

•

24 days ago

• 12

upvoted 2 articles 21 days ago

Article

Continuous batching from first principles

+1

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 417

Article

Unlocking asynchronicity in continuous batching

+1

ror, pcuenq, ariG23498

•

May 14

• 61

upvoted an article 27 days ago

Article

Designing the hf CLI as an agent-optimized way to work with the Hub

celinah, Wauplin

•

Jun 4

• 59

upvoted a paper 27 days ago

4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation

Paper • 2512.17012 • Published Dec 18, 2025 • 49

upvoted a collection about 1 month ago

Toto-2.0

5 items • Updated May 11 • 36

upvoted an article about 1 month ago

Article

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

+3

ariG23498, sayakpaul, sergiopaniego, ror, pcuenq

•

May 29

• 132

upvoted a paper about 1 month ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Paper • 2605.30280 • Published May 28 • 146

upvoted an article about 1 month ago

Article

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

nvidia

•

May 23

• 34

upvoted an article about 2 months ago

Article

EMO: Pretraining mixture of experts for emergent modularity

allenai

•

May 8

• 38

upvoted a paper 3 months ago

EXAONE 4.5 Technical Report

Paper • 2604.08644 • Published Apr 9 • 73

upvoted 2 articles 3 months ago

Article

Building a Fast Multilingual OCR Model with Synthetic Data

nvidia

•

Apr 17

• 34

Article

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

Apr 16

• 73

upvoted a collection 3 months ago

EXAONE 4.5

LG's First Open-Weight Vision-Language Model for Industrial Intelligence • 5 items • Updated Apr 22 • 45

upvoted 2 papers 3 months ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 149

Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing

Paper • 2603.12254 • Published Mar 12 • 23

upvoted an article 3 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+7

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 168

upvoted a paper 4 months ago

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13, 2025 • 171

upvoted an article 5 months ago

Article

Custom Kernels for All from Codex and Claude

+2

burtenshaw, sayakpaul, ariG23498, evalstate

•

Feb 13

• 80