Doron Adler's picture

Doron Adler PRO

Norod78

·

https://linktr.ee/Norod78

AI & ML interests

Fooling around with Generative machine learning models.

Recent Activity

liked a model 5 days ago

microsoft/FastContext-1.0-4B-SFT

reacted to eabdullin's post with 🔥 15 days ago

I’m doing a PhD in AI, which sounds impressive until you realize it mostly means I spend three years trying to make a computer say something slightly less stupid than it said yesterday. People hear "AI researcher" and they think I’m building the future. No. I’m in a basement at 2 a.m. Googling, "CUDA error what the f**k does this mean." And the worst part about AI research now is compute. You don’t even ask, "Is this idea good?" anymore. You ask, "Can I afford for this idea to be wrong?" My advisor comes to me one day and says, "I think we should fine-tune our own language model." I said, "Professor, with what money? I’m a PhD student. I have two bank accounts: checking and emotionally checking." He goes, "Don’t worry. We have compute." Now, in academia, "don’t worry" is never the beginning of a good sentence. I said, "What do you mean we have compute?" He said, "My friend knows the cluster admin. He can get us on the GPUs." I said, "Okay… what do we have to do?" He goes, "Nothing crazy. Just be very grateful in the acknowledgements." I said, "How grateful?" He said, "Maybe put him as co-author." I said, "Co-author? Are we using the cluster, or is the cluster using us?" Because at that point, that’s not a favor. That’s academic child support. So I go to the server room, and the cluster admin walks up to me and goes, "So you’re the NLP student." And in my head I’m like, "No, tonight you’re the principal investigator. You’re the provider. I’m just a little token waiting to be attended to." Because whoever controls the GPUs controls the relationship. That’s lab romance. He starts setting things up, and I’m trying to act casual, but I don’t understand any of the numbers he’s saying. He’s like, "Yeah, I can probably give you four H100s for the weekend." I’m nodding like, "Mmm. Four. Weekend. H. One hundred. Absolutely." Inside I’m like, "Is that good? Is that prison time? Why did he say it like he was offering me organs?" [Continue in comments...]

reacted to eabdullin's post with 🤗 15 days ago

I’m doing a PhD in AI, which sounds impressive until you realize it mostly means I spend three years trying to make a computer say something slightly less stupid than it said yesterday. People hear "AI researcher" and they think I’m building the future. No. I’m in a basement at 2 a.m. Googling, "CUDA error what the f**k does this mean." And the worst part about AI research now is compute. You don’t even ask, "Is this idea good?" anymore. You ask, "Can I afford for this idea to be wrong?" My advisor comes to me one day and says, "I think we should fine-tune our own language model." I said, "Professor, with what money? I’m a PhD student. I have two bank accounts: checking and emotionally checking." He goes, "Don’t worry. We have compute." Now, in academia, "don’t worry" is never the beginning of a good sentence. I said, "What do you mean we have compute?" He said, "My friend knows the cluster admin. He can get us on the GPUs." I said, "Okay… what do we have to do?" He goes, "Nothing crazy. Just be very grateful in the acknowledgements." I said, "How grateful?" He said, "Maybe put him as co-author." I said, "Co-author? Are we using the cluster, or is the cluster using us?" Because at that point, that’s not a favor. That’s academic child support. So I go to the server room, and the cluster admin walks up to me and goes, "So you’re the NLP student." And in my head I’m like, "No, tonight you’re the principal investigator. You’re the provider. I’m just a little token waiting to be attended to." Because whoever controls the GPUs controls the relationship. That’s lab romance. He starts setting things up, and I’m trying to act casual, but I don’t understand any of the numbers he’s saying. He’s like, "Yeah, I can probably give you four H100s for the weekend." I’m nodding like, "Mmm. Four. Weekend. H. One hundred. Absolutely." Inside I’m like, "Is that good? Is that prison time? Why did he say it like he was offering me organs?" [Continue in comments...]

View all activity

Organizations

upvoted a collection 26 days ago

Bonsai Image

6 items • Updated 22 days ago • 87

upvoted a collection about 2 months ago

SenseNova-U1

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-Unify Architecture • 10 items • Updated 14 days ago • 74

upvoted a changelog about 2 months ago

Hugging Face Changelog

Spaces agents.md for your coding agents

Apr 17

• 343

upvoted a changelog 3 months ago

Hugging Face Changelog

Introducing hf-mount

Mar 24

• 225

upvoted a collection 4 months ago

Multimodal Implementations

Comprehensive Demo of Multimodal VLMs on the Hub • 26 items • Updated 1 day ago • 13

upvoted 4 articles 4 months ago

Article

PRX Part 3 — Training a Text-to-Image Model in 24h!

Photoroom

•

Mar 3

• 67

Article

We’re open-sourcing our text-to-image model and the process behind it

Photoroom

•

Nov 12, 2025

• 100

Article

Text-to-image Architectural Experiments

Photoroom

•

Nov 13, 2025

• 60

Article

Train AI models with Unsloth and Hugging Face Jobs for FREE

+4

burtenshaw, danielhanchen, shimmyshimmer, mlabonne, davanstrien, evalstate

•

Feb 20

• 103

upvoted 2 collections 4 months ago

BitDance

BitDance: Open-source autoregressive model with binary visual tokens. A research project for building powerful multimodal autoregressive model. • 10 items • Updated Mar 2 • 11

Tiny Aya

Bridging Scale and Multilingual Depth • 10 items • Updated Feb 17 • 75

upvoted a paper 5 months ago

Alterbute: Editing Intrinsic Attributes of Objects in Images

Paper • 2601.10714 • Published Jan 15 • 31

upvoted 2 collections 5 months ago

YOLO26 Models

YOLO26 models: detection, segmentation, classification, pose, and OBB variants with demos and ONNX variants. • 42 items • Updated Jan 19 • 40

TranslateGemma

3 items • Updated Mar 12 • 244

upvoted 3 collections 6 months ago

CoreML

Models for Apple devices. See https://github.com/FluidInference/FluidAudio for usage details • 16 items • Updated 22 days ago • 6

👁️ LFM2.5-VL

6 items • Updated about 12 hours ago • 45

Hy-MT1.5

混元翻译模型1.5版本 • 12 items • Updated Apr 29 • 59

upvoted an article 6 months ago

Article

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

nvidia

•

Dec 15, 2025

• 112

upvoted 2 articles 7 months ago

Article

Streaming datasets: 100x More Efficient

+3

andito, lhoestq, burtenshaw, pcuenq, merve

•

Oct 27, 2025

• 86

Article

Introducing swift-huggingface: The Complete Swift Client for Hugging Face

mattt

•

Dec 5, 2025

• 44