Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Spaces:

TeddyYao
/

grok4-gpqa-eval

Runtime error

App Files Files Community

Fetching metadata from the HF Docker repository...

grok4-gpqa-eval / benchmarks

82.1 kB

Ctrl+K

Ctrl+K

1 contributor

History: 1 commit

TeddyYao's picture

Upload 38 files

8474f02 verified 11 months ago

__pycache__
Upload 38 files 11 months ago
__init__.py

740 Bytes
Upload 38 files 11 months ago
base_benchmark.py

4.67 kB
Upload 38 files 11 months ago
evaluation_utils.py

4.93 kB
Upload 38 files 11 months ago
gpqa_benchmark.py

4.76 kB
Upload 38 files 11 months ago
gsm8k_benchmark.py

4.52 kB
Upload 38 files 11 months ago
humaneval_benchmark.py

4.85 kB
Upload 38 files 11 months ago
math_benchmark.py

4.68 kB
Upload 38 files 11 months ago
mmlu_benchmark.py

5.61 kB
Upload 38 files 11 months ago
prompt_templates.py

4.08 kB
Upload 38 files 11 months ago