Loaded 10 evaluation results from data\eval_results\eval_caption_cogvlm_n10_seed123_20260214_061321.jsonl ================================================================================ CRITICAL CATEGORIES ================================================================================ Body type (body_type) Constraint: multi Ground truth tags: 0 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=0) How many (count) Constraint: exactly_one Ground truth tags: 0 Accuracy: 1.000 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=50) Species (species) Constraint: multi Ground truth tags: 0 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=0) ================================================================================ IMPORTANT CATEGORIES ================================================================================ Clothing (clothing) Constraint: multi Ground truth tags: 0 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=0) Sex/gender (gender) Constraint: multi Ground truth tags: 0 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=0) Location (location) Constraint: multi Ground truth tags: 0 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=0) Perspective (perspective) Constraint: multi Ground truth tags: 0 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=0) Posture (posture) Constraint: multi Ground truth tags: 0 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=0) ================================================================================ NICE-TO-HAVE CATEGORIES ================================================================================ Body decor (body_decor) Constraint: multi Ground truth tags: 0 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=0) Breasts (breasts) Constraint: multi Ground truth tags: 0 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=0) Expression (expression) Constraint: multi Ground truth tags: 0 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=0) Fur style (fur_style) Constraint: multi Ground truth tags: 0 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=0) Gaze (gaze) Constraint: multi Ground truth tags: 0 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=0) General activity (if any) (general_activity_if_any) Constraint: multi Ground truth tags: 0 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=0) Hair (hair) Constraint: multi Ground truth tags: 0 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=0) Limbs (limbs) Constraint: multi Ground truth tags: 0 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=0) ================================================================================ META CATEGORIES ================================================================================ Information (information) Constraint: multi Ground truth tags: 0 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=0) Picture organization (organization) Constraint: multi Ground truth tags: 0 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=0) Quality/medium (quality) Constraint: multi Ground truth tags: 0 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=0) Requests (requests) Constraint: multi Ground truth tags: 0 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=0) Image size (resolution) Constraint: multi Ground truth tags: 0 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=0) Style (style) Constraint: multi Ground truth tags: 0 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=0) Text and languages (text) Constraint: multi Ground truth tags: 0 Precision: 0.000 Recall: 0.000 F1: 0.000 (TP=0, FP=0, FN=0, TN=0) ================================================================================ SUMMARY ================================================================================ CRITICAL: Total GT tags: 0 Micro-avg P/R/F1: 0.000 / 0.000 / 0.000 Macro-avg P/R/F1: 0.000 / 0.000 / 0.000 IMPORTANT: Total GT tags: 0 Micro-avg P/R/F1: 0.000 / 0.000 / 0.000 Macro-avg P/R/F1: 0.000 / 0.000 / 0.000 NICE-TO-HAVE: Total GT tags: 0 Micro-avg P/R/F1: 0.000 / 0.000 / 0.000 Macro-avg P/R/F1: 0.000 / 0.000 / 0.000 META: Total GT tags: 0 Micro-avg P/R/F1: 0.000 / 0.000 / 0.000 Macro-avg P/R/F1: 0.000 / 0.000 / 0.000 ================================================================================