# Benchmarks

This folder contains all the benchmarks for the project. Each benchmark is organized in its own subfolder, and includes the necessary code and data to run the benchmark.

- [AI Idea Bench 2025](https://github.com/yansheng-qiu/AI_Idea_Bench_2025): A benchmark for evaluating the performance of AI generating novel, creative, and feasible research ideas.
- [MLE-bench](https://github.com/openai/mle-bench): A benchmark for evaluating the performance of machine learning models on a variety of tasks.
- [SciCodeBench](https://github.com/scicode-bench/SciCode): A benchmark for scientific code generation and understanding.

To clone all the submodules, use the following command:

```bash
git submodule update --init --recursive
```