Reasoning's Razor: Reasoning Improves Accuracy but Can Hurt Recall at Critical Operating Points in Safety and Hallucination Detection Paper • 2510.21049 • Published Oct 23, 2025 • 3
RePanda: Pandas-powered Tabular Verification and Reasoning Paper • 2503.11921 • Published Mar 14, 2025 • 2
"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization Paper • 2411.02355 • Published Nov 4, 2024 • 52
SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF Paper • 2411.01798 • Published Nov 4, 2024 • 8
view article Article StackLLaMA: A hands-on guide to train LLaMA with RLHF +5 edbeeching, kashif, ybelkada, lewtun, lvwerra, nazneen, natolambert • Apr 5, 2023 • 48
view article Article Chat Templates: An End to the Silent Performance Killer Rocketknight1 • Oct 3, 2023 • 32