view article Article EMO: Pretraining mixture of experts for emergent modularity allenai β’ 10 days ago β’ 37
Deepfake Classification 022025 Collection based on recent dataset β’ 8 items β’ Updated 21 days ago β’ 3
Japanese Role-playing Dataset Collection ζ₯ζ¬θͺγγΌγ«γγ¬γ€η¨γγΌγΏγ»γγ β’ 17 items β’ Updated Oct 7, 2025 β’ 13
FLUX.1 Collection A collection of our FLUX.1 models and LoRAs. β’ 13 items β’ Updated Jan 2 β’ 313
shadow-peft-models Collection pretrained weights and data for the ShadowPEFT paper β’ 30 items β’ Updated 27 days ago β’ 3
ShadowPEFT: Shadow Network for Parameter-Efficient Fine-Tuning Paper β’ 2604.19254 β’ Published 28 days ago β’ 29
Adam's Law: Textual Frequency Law on Large Language Models Paper β’ 2604.02176 β’ Published Apr 2 β’ 503
Maximal Brain Damage Without Data or Optimization: Disrupting Neural Networks via Sign-Bit Flips Paper β’ 2502.07408 β’ Published Apr 16 β’ 59
Transformers.js V4 demos Collection A collection of demos built with Transformers.js V4 β’ 24 items β’ Updated Apr 16 β’ 58
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. β’ 2 items β’ Updated Aug 7, 2025 β’ 436
100 Coder/Programming - MOE, Reasoning, Reg, Imatrix, Fused. Collection Models (0.8B to 87B) in regular, "reasoning", "Brainstorm", MOE (1x to 8x / 128 experts), and expanded to create better and stronger code, faster. β’ 68 items β’ Updated 5 days ago β’ 32
200+ Roleplay, Creative Writing, Uncensored, NSFW models. Collection Oldest models listed first, with Newest models at bottom of the page. Most repos have full examples, instructions, best settings and so on. β’ 278 items β’ Updated 2 days ago β’ 751