Prompt_Squirrel_RAG / eval_analysis.txt
Food Desert
Add eval results for debugging
a807f94
Loaded 10 evaluation results from data\eval_results\eval_caption_cogvlm_n10_seed123_20260214_061321.jsonl
================================================================================
CRITICAL CATEGORIES
================================================================================
Body type (body_type)
Constraint: multi
Ground truth tags: 0
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=0)
How many (count)
Constraint: exactly_one
Ground truth tags: 0
Accuracy: 1.000
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=50)
Species (species)
Constraint: multi
Ground truth tags: 0
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=0)
================================================================================
IMPORTANT CATEGORIES
================================================================================
Clothing (clothing)
Constraint: multi
Ground truth tags: 0
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=0)
Sex/gender (gender)
Constraint: multi
Ground truth tags: 0
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=0)
Location (location)
Constraint: multi
Ground truth tags: 0
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=0)
Perspective (perspective)
Constraint: multi
Ground truth tags: 0
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=0)
Posture (posture)
Constraint: multi
Ground truth tags: 0
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=0)
================================================================================
NICE-TO-HAVE CATEGORIES
================================================================================
Body decor (body_decor)
Constraint: multi
Ground truth tags: 0
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=0)
Breasts (breasts)
Constraint: multi
Ground truth tags: 0
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=0)
Expression (expression)
Constraint: multi
Ground truth tags: 0
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=0)
Fur style (fur_style)
Constraint: multi
Ground truth tags: 0
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=0)
Gaze (gaze)
Constraint: multi
Ground truth tags: 0
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=0)
General activity (if any) (general_activity_if_any)
Constraint: multi
Ground truth tags: 0
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=0)
Hair (hair)
Constraint: multi
Ground truth tags: 0
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=0)
Limbs (limbs)
Constraint: multi
Ground truth tags: 0
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=0)
================================================================================
META CATEGORIES
================================================================================
Information (information)
Constraint: multi
Ground truth tags: 0
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=0)
Picture organization (organization)
Constraint: multi
Ground truth tags: 0
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=0)
Quality/medium (quality)
Constraint: multi
Ground truth tags: 0
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=0)
Requests (requests)
Constraint: multi
Ground truth tags: 0
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=0)
Image size (resolution)
Constraint: multi
Ground truth tags: 0
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=0)
Style (style)
Constraint: multi
Ground truth tags: 0
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=0)
Text and languages (text)
Constraint: multi
Ground truth tags: 0
Precision: 0.000
Recall: 0.000
F1: 0.000
(TP=0, FP=0, FN=0, TN=0)
================================================================================
SUMMARY
================================================================================
CRITICAL:
Total GT tags: 0
Micro-avg P/R/F1: 0.000 / 0.000 / 0.000
Macro-avg P/R/F1: 0.000 / 0.000 / 0.000
IMPORTANT:
Total GT tags: 0
Micro-avg P/R/F1: 0.000 / 0.000 / 0.000
Macro-avg P/R/F1: 0.000 / 0.000 / 0.000
NICE-TO-HAVE:
Total GT tags: 0
Micro-avg P/R/F1: 0.000 / 0.000 / 0.000
Macro-avg P/R/F1: 0.000 / 0.000 / 0.000
META:
Total GT tags: 0
Micro-avg P/R/F1: 0.000 / 0.000 / 0.000
Macro-avg P/R/F1: 0.000 / 0.000 / 0.000
================================================================================