ILSP Greek Evaluation Suite Collection A collection of test sets for evaluating base and chat LLMs (incl. VLMs) on Greek generation and understanding capabilities • 23 items • Updated 12 days ago • 7
Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention Paper • 2605.22791 • Published May 21 • 33
🤏 Smol-Data Collection Tried and tested mixes for strong pretraining. Inspired by https://huggingface.co/blog/codelion/optimal-dataset-mixing • 14 items • Updated Mar 2 • 13
Granite 4.0 Nano Language Models Collection Ultra-compact language models designed for the edge and on-device deployment. • 9 items • Updated Apr 29 • 102