view article Article Multivariate Probabilistic Time Series Forecasting with Informer +1 elisim, nielsr, kashif • Mar 10, 2023 • 27
view article Article Probabilistic Time Series Forecasting with 🤗 Transformers nielsr, kashif • Dec 1, 2022 • 45
view article Article Evaluate Your Own RAG: Why Best Practices Failed Us charles-azam • Nov 5, 2025 • 14
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6, 2025 • 514
view article Article Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment NormalUhr • Feb 11, 2025 • 121
view article Article Introducing RTEB: A New Standard for Retrieval Evaluation +4 fzliu, KennethEnevoldsen, Samoed, isaacchung, tomaarsen, fzoll • Oct 1, 2025 • 144
Rated Games Dataset Collection Datasets where each row is a rated chess game • 10 items • Updated Jul 10, 2025 • 9
Positional Datasets Collection Datasets where each row is a chess position • 6 items • Updated Mar 26 • 8
Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers Paper • 2509.03059 • Published Sep 3, 2025 • 25
Finance Commons Collection A large collection of multimodal financial documents in open data. • 7 items • Updated Jul 17, 2024 • 14
🤔 Reasoning about Reasoning Collection papers and articles about reasoning LLMs • 12 items • Updated Jun 22, 2025 • 6
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets? Paper • 2510.02209 • Published Oct 2, 2025 • 57
LongCodeZip: Compress Long Context for Code Language Models Paper • 2510.00446 • Published Oct 1, 2025 • 108
view article Article When Does Reasoning Matter? Unpacking the Contribution of Reasoning to LLM Performance Nicolas-BZRD • Sep 30, 2025 • 12