Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
aauss
/
timebench_eval
Sleeping

App Files Files Community
Fetching metadata from the HF Docker repository...
timebench_eval
279 kB
Ctrl+K
Ctrl+K
  • 2 contributors
History: 9 commits
aauss's picture
aauss
Update README.md
fde9678 verified 3 months ago
  • tests
    Fix None passed to squad metric and update import in tests. 3 months ago
  • .gitattributes
    1.52 kB
    initial commit 3 months ago
  • .gitignore
    109 Bytes
    Implement and test metric for TempReason, TimeQA, MenatQA and Date Arithmetic. 3 months ago
  • .python-version
    5 Bytes
    Implement and test metric for TempReason, TimeQA, MenatQA and Date Arithmetic. 3 months ago
  • README.md
    1.96 kB
    Update README.md 3 months ago
  • app.py
    142 Bytes
    Move main script out of folder. 3 months ago
  • pyproject.toml
    271 Bytes
    Implement and test metric for TempReason, TimeQA, MenatQA and Date Arithmetic. 3 months ago
  • requirements.txt
    48 Bytes
    Create requirements.txt 3 months ago
  • timebench_eval.py
    10 kB
    Capture Nones gracefully in historical date parsing. 3 months ago
  • uv.lock
    254 kB
    Implement TimeDial evaluation. 3 months ago