Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
aauss
/
timebench_eval
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
edd0a90
timebench_eval
279 kB
Ctrl+K
Ctrl+K
2 contributors
History:
6 commits
aauss
Fix None passed to squad metric and update import in tests.
edd0a90
4 months ago
tests
Fix None passed to squad metric and update import in tests.
4 months ago
.gitattributes
Safe
1.52 kB
initial commit
4 months ago
.gitignore
Safe
109 Bytes
Implement and test metric for TempReason, TimeQA, MenatQA and Date Arithmetic.
4 months ago
.python-version
Safe
5 Bytes
Implement and test metric for TempReason, TimeQA, MenatQA and Date Arithmetic.
4 months ago
README.md
Safe
1.97 kB
Fix None passed to squad metric and update import in tests.
4 months ago
app.py
Safe
142 Bytes
Move main script out of folder.
4 months ago
pyproject.toml
Safe
271 Bytes
Implement and test metric for TempReason, TimeQA, MenatQA and Date Arithmetic.
4 months ago
timebench_eval.py
Safe
9.95 kB
Fix None passed to squad metric and update import in tests.
4 months ago
uv.lock
Safe
254 kB
Implement TimeDial evaluation.
4 months ago