LM Provers

Team
community
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

lewtun  submitted a paper about 15 hours ago
Single-minus gluon tree amplitudes are nonzero
aviralku  updated a model 2 days ago
lm-provers/QED-Nano
lewtun  published a model 2 days ago
lm-provers/QED-Nano
View all activity

cfahlgren1 
posted an update 8 months ago
view post
Post
938
I ran the Anthropic Misalignment Framework for a few top models and added it to a dataset: cfahlgren1/anthropic-agentic-misalignment-results

You can read the reasoning traces of the models trying to blackmail the user and perform other actions. It's very interesting!!

cfahlgren1 
posted an update 9 months ago
cfahlgren1 
posted an update 9 months ago
lewtun 
posted an update 11 months ago
view post
Post
4349
Introducing OlympicCoder: a series of open reasoning models that can solve olympiad-level programming problems 🧑‍💻

- 7B open-r1/OlympicCoder-7B
- 32B open-r1/OlympicCoder-32B

We find that OlympicCoder models outperform Claude 3.7 Sonnet, as well as others over 100x larger 💪

Together with the models, we are releasing:

📊CodeForces-CoTs: new dataset of code problems from the most popular competitive coding platform, with R1 traces in C++ and Python open-r1/codeforces-cots

🏆 IOI'2024: a new benchmark of VERY hard programming problems where even frontier models struggle to match human performance open-r1/ioi

For links to the models and datasets, check out our latest progress report from Open R1: https://huggingface.co/blog/open-r1/update-3
  • 1 reply
·