Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
UII-AI
/
MedVidBench-Leaderboard
like
6
Running
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
main
MedVidBench-Leaderboard
/
evaluation
245 kB
Ctrl+K
Ctrl+K
4 contributors
History:
44 commits
MedGRPO Team
update
1a7ba72
18 days ago
llm_judge
Add server-side LLM judge for caption evaluation
4 months ago
README.md
Safe
11.5 kB
update
4 months ago
dataset_utils.py
Safe
3.09 kB
Copy evaluation scripts to leaderboard and clean up template code
4 months ago
eval_caption_llm_judge.py
Safe
21.7 kB
fix issues
about 1 month ago
eval_cvs_assessment.py
Safe
14 kB
fix issues
about 1 month ago
eval_dvc.py
Safe
22.1 kB
fix issues
about 1 month ago
eval_next_action.py
Safe
22.6 kB
fix issues
about 1 month ago
eval_skill_assessment.py
Safe
15.5 kB
fix issues
about 1 month ago
eval_stg.py
Safe
12.5 kB
fix issues
about 1 month ago
eval_tal.py
Safe
11.3 kB
fix issues
about 1 month ago
evaluate_all_pai.py
Safe
36.9 kB
fix issues
about 1 month ago
evaluate_predictions.py
Safe
17.9 kB
update
18 days ago
extract_predictions.py
Safe
2.74 kB
upload prediction only
4 months ago
merge_predictions_with_gt.py
Safe
5.9 kB
upload prediction only
4 months ago
test_evaluation.sh
Safe
5.34 kB
update
4 months ago