Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Spaces:
TeddyYao
/
grok4-gpqa-eval
like
0
Runtime error
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
grok4-gpqa-eval
/
benchmarks
82.1 kB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
TeddyYao
Upload 38 files
8474f02
verified
11 months ago
__pycache__
Upload 38 files
11 months ago
__init__.py
Safe
740 Bytes
Upload 38 files
11 months ago
base_benchmark.py
Safe
4.67 kB
Upload 38 files
11 months ago
evaluation_utils.py
Safe
4.93 kB
Upload 38 files
11 months ago
gpqa_benchmark.py
Safe
4.76 kB
Upload 38 files
11 months ago
gsm8k_benchmark.py
Safe
4.52 kB
Upload 38 files
11 months ago
humaneval_benchmark.py
Safe
4.85 kB
Upload 38 files
11 months ago
math_benchmark.py
Safe
4.68 kB
Upload 38 files
11 months ago
mmlu_benchmark.py
Safe
5.61 kB
Upload 38 files
11 months ago
prompt_templates.py
Safe
4.08 kB
Upload 38 files
11 months ago