Spaces:

FoodDesert
/

Prompt_Squirrel_RAG

Running

Commit History

Food Desert commited on 25 days ago

Food Desert commited on 28 days ago

Food Desert commited on Feb 25

Food Desert commited on Feb 24

Food Desert commited on Feb 22

FoodDesert commited on Feb 20

Claude commited on Feb 14

Claude commited on Feb 14

Claude commited on Feb 12

Claude commited on Feb 12

Claude commited on Feb 11

Claude commited on Feb 11

Claude commited on Feb 10

Claude commited on Feb 10

Claude commited on Feb 10

Claude commited on Feb 10

Claude commited on Feb 10

Claude commited on Feb 10

Claude commited on Feb 10

Claude commited on Feb 10

Commit History

Switch Stage3 to explicit-only no-why selection, drop bear probe, and set k=1 defaults 06a3c46

Consolidate pending pipeline, structural, and analysis updates 30bedf0

Remove dead retrieval/display helpers and simplify debug paths 5188881

Simplify Stage3 chunking to interleave-only and add eval diagnostics 3c18372

Add eval audit tools, caption-evident set, and logging 73f56cf

Record non-fatal pipeline issues in eval JSONL outputs 41dd600

Fix eval_categorized.py to work with eval_pipeline.py output 435bff3

Add ranking metrics infrastructure to eval pipeline 0ed7e94

Add per-tag evidence tracking and wiki extraction script 019823a

Add structural tag inference (Stage 3s) and compact eval output a16e111

Default min_why to strong_implied; add retrieval gap analysis script 4968635

Normalize GT annotations: expand implications, exclude non-evaluable tags 14e5c38

Add tag implication expansion (fox→canine→canid→mammal) eeada1d

Fix min_why not passed to workers in parallel eval mode 054dd0f

Add --min-why threshold to filter Stage 3 selections by confidence level 09a248d

Add diagnostic eval metrics, why-distribution tracking, and generic character filter 349b999

Add parallel processing to eval pipeline with ThreadPoolExecutor 12dfa28

Add independent character tag metrics to eval pipeline f1b4da2

Improve eval harness: shuffle samples, always write results 133d74c

Add end-to-end evaluation harness for pipeline metrics 6909d06

Switch Stage3 to explicit-only no-why selection, drop bear probe, and set k=1 defaults

06a3c46

Consolidate pending pipeline, structural, and analysis updates

30bedf0

Remove dead retrieval/display helpers and simplify debug paths

5188881

Simplify Stage3 chunking to interleave-only and add eval diagnostics

3c18372

Add eval audit tools, caption-evident set, and logging

73f56cf

Record non-fatal pipeline issues in eval JSONL outputs

41dd600

Fix eval_categorized.py to work with eval_pipeline.py output

435bff3

Add ranking metrics infrastructure to eval pipeline

0ed7e94

Add per-tag evidence tracking and wiki extraction script

019823a

Add structural tag inference (Stage 3s) and compact eval output

a16e111

Default min_why to strong_implied; add retrieval gap analysis script

4968635

Normalize GT annotations: expand implications, exclude non-evaluable tags

14e5c38

Add tag implication expansion (fox→canine→canid→mammal)

eeada1d

Fix min_why not passed to workers in parallel eval mode

054dd0f

Add --min-why threshold to filter Stage 3 selections by confidence level

09a248d

Add diagnostic eval metrics, why-distribution tracking, and generic character filter

349b999

Add parallel processing to eval pipeline with ThreadPoolExecutor

12dfa28

Add independent character tag metrics to eval pipeline

f1b4da2

Improve eval harness: shuffle samples, always write results

133d74c

Add end-to-end evaluation harness for pipeline metrics

6909d06