Pretrain Datasets Collection Datasets we use for pretraining large language models • 13 items • Updated Jan 3 • 1
Turkish Instruction Datasets Collection Collection of instruction datasets for Turkish. • 50 items • Updated Apr 7 • 20
view article Article Welcome PaliGemma 2 – New vision language models by Google +2 merve, andsteing, pcuenq, ariG23498 • Dec 5, 2024 • 166
ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference Paper • 2511.10645 • Published Nov 13, 2025 • 14
InstantID: Zero-shot Identity-Preserving Generation in Seconds Paper • 2401.07519 • Published Jan 15, 2024 • 57