Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Spaces:
UII-AI
/
MedVidBench-Leaderboard
Running

App Files Files Community
1
Fetching metadata from the HF Docker repository...
MedVidBench-Leaderboard / evaluation
245 kB
Ctrl+K
Ctrl+K
  • 4 contributors
History: 44 commits
MedGRPO Team
update
1a7ba72 2 months ago
  • llm_judge
    Add server-side LLM judge for caption evaluation 6 months ago
  • README.md
    11.5 kB
    update 6 months ago
  • dataset_utils.py
    3.09 kB
    Copy evaluation scripts to leaderboard and clean up template code 6 months ago
  • eval_caption_llm_judge.py
    21.7 kB
    fix issues 3 months ago
  • eval_cvs_assessment.py
    14 kB
    fix issues 3 months ago
  • eval_dvc.py
    22.1 kB
    fix issues 3 months ago
  • eval_next_action.py
    22.6 kB
    fix issues 3 months ago
  • eval_skill_assessment.py
    15.5 kB
    fix issues 3 months ago
  • eval_stg.py
    12.5 kB
    fix issues 3 months ago
  • eval_tal.py
    11.3 kB
    fix issues 3 months ago
  • evaluate_all_pai.py
    36.9 kB
    fix issues 3 months ago
  • evaluate_predictions.py
    17.9 kB
    update 2 months ago
  • extract_predictions.py
    2.74 kB
    upload prediction only 6 months ago
  • merge_predictions_with_gt.py
    5.9 kB
    upload prediction only 6 months ago
  • test_evaluation.sh
    5.34 kB
    update 6 months ago