view article Article Vision Language Models (Better, faster, stronger) +3 merve, sergiopaniego, ariG23498, pcuenq, andito • May 12, 2025 • 611
🧠Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19, 2025 • 190
The CoT Collection Collection Dataset and Model for "The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning" • 5 items • Updated Apr 3, 2024 • 5
view article Article Train and Fine-Tune Sentence Transformers Models espejelomar • Aug 10, 2022 • 17
Biomedical NLP papers Collection Papers posted on @ArxivHealthcareNLP@sigmoid.social (Clinical, Healthcare & Biomedical NLP) • 183 items • Updated Jan 24, 2025 • 43
Medical QA Datasets Collection A collection of medical question answering (QA) datasets • 23 items • Updated Feb 22, 2025 • 48
A Large Encoder-Decoder Family of Foundation Models For Chemical Language Paper • 2407.20267 • Published Jul 24, 2024 • 32
ThinK: Thinner Key Cache by Query-Driven Pruning Paper • 2407.21018 • Published Jul 30, 2024 • 32
Knesset-DictaBERT: A Hebrew Language Model for Parliamentary Proceedings Paper • 2407.20581 • Published Jul 30, 2024 • 24