view article Article DeepFabric: Generate, Train and Evaluate with Datasets curated for Model Behavior Training. lukehinds โข Dec 4, 2025 โข 9
view article Article mmBERT: ModernBERT goes Multilingual +4 mmarone, orionweller, will-fleshman, eugene-yang, dlawrie, vandurme โข Sep 9, 2025 โข 147
Energy-Based Transformers are Scalable Learners and Thinkers Paper โข 2507.02092 โข Published Jul 2, 2025 โข 70