# Benchmarks This folder contains all the benchmarks for the project. Each benchmark is organized in its own subfolder, and includes the necessary code and data to run the benchmark. - [AI Idea Bench 2025](https://github.com/yansheng-qiu/AI_Idea_Bench_2025): A benchmark for evaluating the performance of AI generating novel, creative, and feasible research ideas. - [MLE-bench](https://github.com/openai/mle-bench): A benchmark for evaluating the performance of machine learning models on a variety of tasks. - [SciCodeBench](https://github.com/scicode-bench/SciCode): A benchmark for scientific code generation and understanding. To clone all the submodules, use the following command: ```bash git submodule update --init --recursive ```