jwergieluk 's Collections Papers inbox
updated
Training Language Models to Self-Correct via Reinforcement Learning
Paper
• 2409.12917
• Published • 140
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at
Any Resolution
Paper
• 2409.12191
• Published • 79
Expect the Unexpected: FailSafe Long Context QA for Finance
Paper
• 2502.06329
• Published • 133
Competitive Programming with Large Reasoning Models
Paper
• 2502.06807
• Published • 69
Retrieval-augmented Large Language Models for Financial Time Series
Forecasting
Paper
• 2502.05878
• Published • 40
LLMs Can Easily Learn to Reason from Demonstrations Structure, not
content, is what matters!
Paper
• 2502.07374
• Published • 40
Goedel-Prover: A Frontier Model for Open-Source Automated Theorem
Proving
Paper
• 2502.07640
• Published • 9
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time
Scaling
Paper
• 2502.06703
• Published • 153