arxiv:2507.12284
Rodion Levichev
RLevichev
ยท
AI & ML interests
PLP, RL, Agents
Recent Activity
authored
a paper
2 days ago
SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language
Models on Software Engineering Tasks
authored
a paper
2 days ago
MERA Code: A Unified Framework for Evaluating Code Generation Across
Tasks
upvoted
a
paper
18 days ago
MERA Code: A Unified Framework for Evaluating Code Generation Across
Tasks