SAINTHALF
/

layra-v1-hybrid

ethnopharmacology

Eval Results (legacy)

Model card Files Files and versions

LAYRA: Large Academic Visual RAG (v1.1)

This model entry represents the optimized version of the LAYRA retrieval system, featuring LLM-based reranking.

Performance Summary (v1.1)

Strategy: Hybrid Retrieval (ColQwen + BM25) $ ightarrow$ LLM Reranking (Qwen3-VL-Cloud)
Recall@20: 1.00 (Perfect retrieval on Kanna Gold Set)
MRR@20: 0.740 (Significant improvement over v1.0 baseline of 0.365)
Recall@1: 0.600 (Correct answer is #1 result 60% of the time)

System Configuration

Visual Encoder: vidore/colqwen2.5-v0.2
Vector DB: Milvus 2.6
Fusion: RRF (Candidate Pool: 500)
Reranker: qwen3-vl-235b-instruct-cloud (Listwise Ranking)

Evaluation

Metrics verified on 2025-12-31 using the SAINTHALF/layra-kanna-goldset repository.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Evaluation results

Recall@20 on LAYRA Kanna Gold Set
test set self-reported

1.000
MRR@20 (SOTA) on LAYRA Kanna Gold Set
test set self-reported

0.740
Recall@1 on LAYRA Kanna Gold Set
test set self-reported

0.600
Average Latency (s) on LAYRA Kanna Gold Set
test set self-reported

1.500