scider / benchmarks /README.md
leonardklin's picture
Upload 328 files
978fed5 verified
# Benchmarks
This folder contains all the benchmarks for the project. Each benchmark is organized in its own subfolder, and includes the necessary code and data to run the benchmark.
- [AI Idea Bench 2025](https://github.com/yansheng-qiu/AI_Idea_Bench_2025): A benchmark for evaluating the performance of AI generating novel, creative, and feasible research ideas.
- [MLE-bench](https://github.com/openai/mle-bench): A benchmark for evaluating the performance of machine learning models on a variety of tasks.
- [SciCodeBench](https://github.com/scicode-bench/SciCode): A benchmark for scientific code generation and understanding.
To clone all the submodules, use the following command:
```bash
git submodule update --init --recursive
```