Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
rendy saputra
10bodrex
Follow
0 followers
·
9 following
AI & ML interests
None yet
Recent Activity
reacted
to
IlyasMoutawwakil
's
post
with 🔥
11 days ago
Transformers v5 just landed! 🚀 It significantly unifies and reduces modeling code across architectures, while opening the door to a whole new class of performance optimizations. My favorite new feature? 🤔 The new dynamic weight loader + converter. Here’s why 👇 Over the last few months, the core Transformers maintainers built an incredibly fast weight loader, capable of converting tensors on the fly while loading them in parallel threads. This means we’re no longer constrained by how parameters are laid out inside the safetensors weight files. In practice, this unlocks two big things: - Much more modular modeling code. You can now clearly see how architectures build on top of each other (DeepSeek v2 → v3, Qwen v2 → v3 → MoE, etc.). This makes shared bottlenecks obvious and lets us optimize the right building blocks once, for all model families. - Performance optimizations beyond what torch.compile can do alone. torch.compile operates on the computation graph, but it can’t change parameter layouts. With the new loader, we can restructure weights at load time: fusing MoE expert projections, merging attention QKV projections, and enabling more compute-dense kernels that simply weren’t possible before. Personally, I'm honored to have contributed in this direction, including the work on optimizing MoE implementations and making modeling code more torch-exportable, so these optimizations can be ported cleanly across runtimes. Overall, Transformers v5 is a strong signal of where the community and industry are converging: Modularity and Performance, without sacrificing Flexibility. Transformers v5 makes its signature from_pretrained an entrypoint where you can mix and match: - Parallelism - Quantization - Custom kernels - Flash/Paged attention - Continuous batching - ... Kudos to everyone involved! I highly recommend the: Release notes: https://github.com/huggingface/transformers/releases/tag/v5.0.0 Blog post: https://huggingface.co/blog/transformers-v5
published
a dataset
16 days ago
10bodrex/DATA.69.00.Shot
published
a model
16 days ago
10bodrex/Mydellin_Stunding
View all activity
Organizations
None yet
10bodrex
's models
117
Sort:Â Recently updated
10bodrex/Alone
Updated
Dec 18, 2025
10bodrex/corNetoO
Updated
Dec 18, 2025
10bodrex/rahmonEs5
Updated
Dec 18, 2025
10bodrex/alEAle
Updated
Dec 17, 2025
10bodrex/0LG
Updated
Dec 17, 2025
10bodrex/jib0K
Updated
Dec 17, 2025
10bodrex/WiloNa4
Updated
Dec 16, 2025
10bodrex/darMuUzi
Updated
Dec 16, 2025
10bodrex/BamMb4nG
Updated
Dec 16, 2025
10bodrex/TelL3rRam
Updated
Dec 15, 2025
10bodrex/HaAik
Updated
Dec 15, 2025
10bodrex/sUuiwuw
Updated
Dec 15, 2025
10bodrex/B0nN3xX
Updated
Dec 14, 2025
10bodrex/DhanisSwAr4
Updated
Dec 14, 2025
10bodrex/yhahhHhoOlLiv
Updated
Dec 14, 2025
10bodrex/bBAakoOi
Updated
Dec 13, 2025
10bodrex/aYloLi
Updated
Dec 13, 2025
10bodrex/TjaAi
Updated
Dec 13, 2025
10bodrex/vilLeoO_1
Updated
Dec 12, 2025
10bodrex/kSar001
Updated
Dec 12, 2025
10bodrex/mnebaYou
Updated
Dec 12, 2025
10bodrex/p4Ram3Xxx
Updated
Dec 11, 2025
10bodrex/fahRiNi2332E_Lm
Updated
Dec 11, 2025
10bodrex/err_kdhu8uuU
Updated
Dec 11, 2025
10bodrex/jaaskjbbne322111
Updated
Dec 10, 2025
10bodrex/awewertyg
Text Classification
•
Updated
Dec 10, 2025
10bodrex/menolakmalas
Updated
Dec 10, 2025
Previous
1
2
3
4
Next