Spaces:
Sleeping
Sleeping
| # Benchmarks | |
| This folder contains all the benchmarks for the project. Each benchmark is organized in its own subfolder, and includes the necessary code and data to run the benchmark. | |
| - [AI Idea Bench 2025](https://github.com/yansheng-qiu/AI_Idea_Bench_2025): A benchmark for evaluating the performance of AI generating novel, creative, and feasible research ideas. | |
| - [MLE-bench](https://github.com/openai/mle-bench): A benchmark for evaluating the performance of machine learning models on a variety of tasks. | |
| - [SciCodeBench](https://github.com/scicode-bench/SciCode): A benchmark for scientific code generation and understanding. | |
| To clone all the submodules, use the following command: | |
| ```bash | |
| git submodule update --init --recursive | |
| ``` | |