jwergieluk
's Collections
Papers inbox
updated
Training Language Models to Self-Correct via Reinforcement Learning
Paper
•
2409.12917
•
Published
•
140
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at
Any Resolution
Paper
•
2409.12191
•
Published
•
78
Expect the Unexpected: FailSafe Long Context QA for Finance
Paper
•
2502.06329
•
Published
•
133
Competitive Programming with Large Reasoning Models
Paper
•
2502.06807
•
Published
•
69
Retrieval-augmented Large Language Models for Financial Time Series
Forecasting
Paper
•
2502.05878
•
Published
•
40
LLMs Can Easily Learn to Reason from Demonstrations Structure, not
content, is what matters!
Paper
•
2502.07374
•
Published
•
40
Goedel-Prover: A Frontier Model for Open-Source Automated Theorem
Proving
Paper
•
2502.07640
•
Published
•
9
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time
Scaling
Paper
•
2502.06703
•
Published
•
152