Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
aauss
/
timebench_eval
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
fde9678
timebench_eval
279 kB
Ctrl+K
Ctrl+K
2 contributors
History:
9 commits
aauss
Update README.md
fde9678
verified
3 months ago
tests
Fix None passed to squad metric and update import in tests.
3 months ago
.gitattributes
Safe
1.52 kB
initial commit
3 months ago
.gitignore
Safe
109 Bytes
Implement and test metric for TempReason, TimeQA, MenatQA and Date Arithmetic.
3 months ago
.python-version
Safe
5 Bytes
Implement and test metric for TempReason, TimeQA, MenatQA and Date Arithmetic.
3 months ago
README.md
Safe
1.96 kB
Update README.md
3 months ago
app.py
Safe
142 Bytes
Move main script out of folder.
3 months ago
pyproject.toml
Safe
271 Bytes
Implement and test metric for TempReason, TimeQA, MenatQA and Date Arithmetic.
3 months ago
requirements.txt
Safe
48 Bytes
Create requirements.txt
3 months ago
timebench_eval.py
Safe
10 kB
Capture Nones gracefully in historical date parsing.
3 months ago
uv.lock
Safe
254 kB
Implement TimeDial evaluation.
3 months ago