Qwen3.5-Claude-Fable-5 Collection Our series of Qwen3.5 finetunes on Claude-Fable-5 outputs • 2 items • Updated 13 days ago • 5
Qwen3.5-Claude-Mythos-5 Collection A collection of our uncensored Claude Mythos fine tunes • 2 items • Updated 13 days ago • 26
ReFreeKV: Towards Threshold-Free KV Cache Compression Paper • 2502.16886 • Published 7 days ago • 45
EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments Paper • 2606.13681 • Published 22 days ago • 142
Nemotron-Personas Collection A collection of multilingual, region-specific synthetic persona datasets that support sovereign AI development across many countries and regions. • 10 items • Updated 15 days ago • 56
view article Article EMO: Pretraining mixture of experts for emergent modularity allenai • May 8 • 38
Deepfake Classification 022025 Collection based on recent dataset • 8 items • Updated about 15 hours ago • 3
Japanese Role-playing Dataset Collection 日本語ãƒãƒ¼ãƒ«ãƒ—レイ用データセット • 17 items • Updated Oct 7, 2025 • 14
FLUX.1 Collection A collection of our FLUX.1 models and LoRAs. • 13 items • Updated Jan 2 • 330
shadow-peft-models Collection pretrained weights and data for the ShadowPEFT paper • 30 items • Updated Apr 22 • 4
ShadowPEFT: Shadow Network for Parameter-Efficient Fine-Tuning Paper • 2604.19254 • Published Apr 21 • 30
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 509
Maximal Brain Damage Without Data or Optimization: Disrupting Neural Networks via Sign-Bit Flips Paper • 2502.07408 • Published Apr 16 • 59