Less is More: Recursive Reasoning with Tiny Networks Paper β’ 2510.04871 β’ Published Oct 6, 2025 β’ 511
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15, 2025 β’ 228
LongCodeZip: Compress Long Context for Code Language Models Paper β’ 2510.00446 β’ Published Oct 1, 2025 β’ 108
The Prompt Report: A Systematic Survey of Prompting Techniques Paper β’ 2406.06608 β’ Published Jun 6, 2024 β’ 68
Evaluating D-MERIT of Partial-annotation on Information Retrieval Paper β’ 2406.16048 β’ Published Jun 23, 2024 β’ 36
view article Article Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent +2 Apr 22, 2024 β’ 81
In-context Learning and Gradient Descent Revisited Paper β’ 2311.07772 β’ Published Nov 13, 2023 β’ 2
π Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized β’ 135 items β’ Updated Dec 18, 2025 β’ 120
Model Merging Papers Collection Collection of relevant papers about model merging β’ 13 items β’ Updated Apr 2, 2024 β’ 6
π« StarCoder2 Collection StarCoder2 models and datasets! β’ 8 items β’ Updated Mar 1, 2024 β’ 91