Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
33
14
47
Anton Vlasjuk
AntonV
Follow
21world's profile picture
Metatron696's profile picture
Ameeeee's profile picture
54 followers
·
55 following
vasqu
AI & ML interests
None yet
Recent Activity
new
activity
3 days ago
MiniMaxAI/MiniMax-M2.1:
Transformers v5 support
reacted
to
IlyasMoutawwakil
's
post
with 🚀
4 days ago
After 2 months of refinement, I'm happy to announce that a lot of Transformers' modeling code is now significantly more torch-compile & export-friendly 🔥 Why it had to be done 👇 PyTorch's Dynamo compiler is increasingly becoming the default interoperability layer for ML systems. Anything that relies on torch.export or torch.compile, from model optimization to cross-framework integrations, benefits directly when models can be captured as a single dynamo-traced graph ! Transformers models are now easier to: ⚙️ Compile end-to-end with torch.compile backends 📦 Export reliably via torch.export and torch.onnx.export 🚀 Deploy to ONNX / ONNX Runtime, Intel Corporation's OpenVINO, NVIDIA AutoDeploy (TRT-LLM), AMD's Quark, Meta's Executorch and more hardware-specific runtimes. This work aims at unblocking entire TorchDynamo-based toolchains that rely on exporting Transformers across runtimes and accelerators. We are doubling down on Transformers commitment to be a first-class citizen of the PyTorch ecosystem, more exportable, more optimizable, and easier to deploy everywhere. There are definitely some edge-cases that we still haven't addressed so don't hesitate to try compiling / exporting your favorite transformers and to open issues / PRs. PR in the comments ! More updates coming coming soon !
new
activity
4 days ago
baidu/ERNIE-4.5-VL-28B-A3B-PT:
HF v5 support
View all activity
Organizations
AntonV
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
7 days ago
zai-org/GLM-4.7-Flash
Text Generation
•
31B
•
Updated
6 days ago
•
450k
•
•
1.21k
liked
a model
19 days ago
MiniMaxAI/MiniMax-M2.1
Text Generation
•
229B
•
Updated
3 days ago
•
217k
•
•
1.15k
liked
a model
about 2 months ago
deepseek-ai/DeepSeek-V3.2-Speciale
Text Generation
•
685B
•
Updated
Dec 1, 2025
•
21.5k
•
652
liked
a model
2 months ago
nari-labs/Dia2-2B
Text-to-Speech
•
Updated
Dec 1, 2025
•
3.85k
•
152
liked
5 models
3 months ago
baidu/ERNIE-4.5-VL-28B-A3B-Thinking
Image-Text-to-Text
•
30B
•
Updated
Dec 24, 2025
•
1.25k
•
516
moonshotai/Kimi-K2-Thinking
Text Generation
•
Updated
Nov 8, 2025
•
260k
•
•
1.64k
moonshotai/Kimi-Linear-48B-A3B-Instruct
Text Generation
•
49B
•
Updated
Dec 16, 2025
•
36.5k
•
529
openai/gpt-oss-safeguard-20b
Text Generation
•
22B
•
Updated
12 days ago
•
12.8k
•
•
182
PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text
•
1.0B
•
Updated
Dec 11, 2025
•
15.7k
•
1.51k
liked
a model
4 months ago
ai21labs/AI21-Jamba-Reasoning-3B
Text Generation
•
3B
•
Updated
Oct 8, 2025
•
4.06k
•
130
liked
a Space
4 months ago
Running
75
Maintain the unmaintainable
📚
75
Visualize connections between transformer models
liked
3 models
4 months ago
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation
•
685B
•
Updated
Nov 18, 2025
•
63.3k
•
•
943
baidu/Qianfan-VL-8B
Image-Text-to-Text
•
9B
•
Updated
Sep 19, 2025
•
128
•
34
deepseek-ai/DeepSeek-V3.1-Terminus
Text Generation
•
685B
•
Updated
Sep 29, 2025
•
5.62k
•
•
359
liked
5 models
5 months ago
Qwen/Qwen3-Next-80B-A3B-Instruct
Text Generation
•
81B
•
Updated
Sep 17, 2025
•
1.39M
•
•
911
baidu/ERNIE-4.5-21B-A3B-Thinking
Text Generation
•
22B
•
Updated
Nov 26, 2025
•
275
•
•
772
Aleph-Alpha/llama-tfree-hat-pretrained-7b-dpo
7B
•
Updated
Oct 22, 2025
•
132
•
10
deepseek-ai/DeepSeek-V3.1
Text Generation
•
685B
•
Updated
Sep 5, 2025
•
67.1k
•
•
812
Qwen/Qwen-Image-Edit
Image-to-Image
•
Updated
Aug 25, 2025
•
44.7k
•
•
2.29k
liked
a model
6 months ago
mistralai/Voxtral-Small-24B-2507
Audio-Text-to-Text
•
24B
•
Updated
Dec 20, 2025
•
67.8k
•
444
Load more