Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
UII-AI
/
MedVidBench-Leaderboard
Running

App Files Files Community
1
Fetching metadata from the HF Docker repository...
MedVidBench-Leaderboard / evaluation
245 kB
Ctrl+K
Ctrl+K
  • 4 contributors
History: 44 commits
MedGRPO Team
update
1a7ba72 18 days ago
  • llm_judge
    Add server-side LLM judge for caption evaluation 4 months ago
  • README.md
    11.5 kB
    update 4 months ago
  • dataset_utils.py
    3.09 kB
    Copy evaluation scripts to leaderboard and clean up template code 4 months ago
  • eval_caption_llm_judge.py
    21.7 kB
    fix issues about 1 month ago
  • eval_cvs_assessment.py
    14 kB
    fix issues about 1 month ago
  • eval_dvc.py
    22.1 kB
    fix issues about 1 month ago
  • eval_next_action.py
    22.6 kB
    fix issues about 1 month ago
  • eval_skill_assessment.py
    15.5 kB
    fix issues about 1 month ago
  • eval_stg.py
    12.5 kB
    fix issues about 1 month ago
  • eval_tal.py
    11.3 kB
    fix issues about 1 month ago
  • evaluate_all_pai.py
    36.9 kB
    fix issues about 1 month ago
  • evaluate_predictions.py
    17.9 kB
    update 18 days ago
  • extract_predictions.py
    2.74 kB
    upload prediction only 4 months ago
  • merge_predictions_with_gt.py
    5.9 kB
    upload prediction only 4 months ago
  • test_evaluation.sh
    5.34 kB
    update 4 months ago