Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
aauss
/
timebench_eval
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
5470154
timebench_eval
278 kB
Ctrl+K
Ctrl+K
2 contributors
History:
5 commits
aauss
Correct the expected datatype.
5470154
verified
4 months ago
tests
Implement TimeDial evaluation.
4 months ago
.gitattributes
Safe
1.52 kB
initial commit
4 months ago
.gitignore
Safe
109 Bytes
Implement and test metric for TempReason, TimeQA, MenatQA and Date Arithmetic.
4 months ago
.python-version
Safe
5 Bytes
Implement and test metric for TempReason, TimeQA, MenatQA and Date Arithmetic.
4 months ago
README.md
236 Bytes
initial commit
4 months ago
app.py
Safe
142 Bytes
Move main script out of folder.
4 months ago
pyproject.toml
Safe
271 Bytes
Implement and test metric for TempReason, TimeQA, MenatQA and Date Arithmetic.
4 months ago
timebench_eval.py
Safe
9.95 kB
Correct the expected datatype.
4 months ago
uv.lock
Safe
254 kB
Implement TimeDial evaluation.
4 months ago