Spaces:

AI4Research
/

scider

Sleeping

Upload 328 files

978fed5 verified about 2 months ago

744 Bytes

	# Benchmarks

	This folder contains all the benchmarks for the project. Each benchmark is organized in its own subfolder, and includes the necessary code and data to run the benchmark.

	- [AI Idea Bench 2025](https://github.com/yansheng-qiu/AI_Idea_Bench_2025): A benchmark for evaluating the performance of AI generating novel, creative, and feasible research ideas.
	- [MLE-bench](https://github.com/openai/mle-bench): A benchmark for evaluating the performance of machine learning models on a variety of tasks.
	- [SciCodeBench](https://github.com/scicode-bench/SciCode): A benchmark for scientific code generation and understanding.

	To clone all the submodules, use the following command:

	```bash
	git submodule update --init --recursive
	```