arxiv:2605.20552
Michal Valko
AI & ML interests
large language models, reasoning, fine-tuning, test-time computation, reinforcement learning with human feedback, world models
Recent Activity
liked a dataset 2 days ago
ulamai/verified-research-reasoning-trajectories authored a paper about 1 month ago
Spectral bandits for smooth graph functions with applications in recommender systems updated a dataset about 1 month ago
misovalko/my-research-papers