Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
InternScience
/
SGI-Bench-Leaderboard
like
5
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
33e428a
SGI-Bench-Leaderboard
/
eval-queue
/
sgi-bench
5.43 kB
3 contributors
History:
1 commit
unknown
update
33e428a
14 days ago
Claude-Opus-4.1_eval_request_False_float16_Original.json
Safe
307 Bytes
update
14 days ago
Claude-Sonnet-4.5_eval_request_False_float16_Original.json
Safe
309 Bytes
update
14 days ago
GPT-4.1_eval_request_False_float16_Original.json
Safe
299 Bytes
update
14 days ago
GPT-4o_eval_request_False_float16_Original.json
Safe
298 Bytes
update
14 days ago
GPT-5.1_eval_request_False_float16_Original.json
Safe
299 Bytes
update
14 days ago
GPT-5_eval_request_False_float16_Original.json
Safe
297 Bytes
update
14 days ago
Gemini-2.5-Flash_eval_request_False_float16_Original.json
Safe
308 Bytes
update
14 days ago
Gemini-2.5-Pro_eval_request_False_float16_Original.json
Safe
306 Bytes
update
14 days ago
Gemini-3-Pro_eval_request_False_float16_Original.json
Safe
304 Bytes
update
14 days ago
Grok-4_eval_request_False_float16_Original.json
Safe
298 Bytes
update
14 days ago
Intern-S1-mini_eval_request_False_float16_Original.json
Safe
304 Bytes
update
14 days ago
Intern-S1_eval_request_False_float16_Original.json
Safe
299 Bytes
update
14 days ago
Llama-4-Scout_eval_request_False_float16_Original.json
Safe
303 Bytes
update
14 days ago
Qwen3-8B_eval_request_False_float16_Original.json
Safe
298 Bytes
update
14 days ago
Qwen3-Max_eval_request_False_float16_Original.json
Safe
299 Bytes
update
14 days ago
Qwen3-VL-235B-A22B_eval_request_False_float16_Original.json
Safe
308 Bytes
update
14 days ago
o3_eval_request_False_float16_Original.json
Safe
294 Bytes
update
14 days ago
o4-mini_eval_request_False_float16_Original.json
Safe
299 Bytes
update
14 days ago