Commit History

Fix: Correct attribute references for error detection/correction metrics
005cdbf

RGB Evaluation commited on

fix: Information Integration evaluation - handle multiple answer variants with pipe-separated format
5253a83

RGB Evaluation commited on

feat: Show all 9 LLM models in app dropdown, add comprehensive code review and metric analysis documentation
b1ccc5d

RGB Evaluation commited on

feat: Add noise level percentage to Noise Robustness cards
ca20603

RGB Evaluation commited on

fix: Convert past results section to grid layout
90b0944

RGB Evaluation commited on

fix: Remove duplicate results display section
0cfbd9f

RGB Evaluation commited on

feat: Add Moonshot Kimi K2 Instruct model to DEFAULT_MODELS
1b8d19d

RGB Evaluation commited on

fix: Add HF Spaces YAML configuration to README.md
ca5ddcb

RGB Evaluation commited on

Merge main: accept local changes with new grid layout and model updates
bdba13c

RGB Evaluation commited on

feat: Add separate grid layout for 4 RAG abilities in Streamlit UI
af25c62

RGB Evaluation commited on

Fix results file pattern matching - search for results_*.json instead of evaluation_*.json
a757c21

RGB Deployment Bot commited on

Update all LLMs to new model list with 4 default models
3d27cc5

RGB Deployment Bot commited on

Fix debug function - replace st.debug() with st.write() in expander
e7842fe

RGB Deployment Bot commited on

Add debug info to past results viewer to diagnose missing results
8796cb2

RGB Deployment Bot commited on

Replace decommissioned llama-3.1-70b-versatile with llama2-70b-4096
800ecaf

RGB Deployment Bot commited on

Reduce RPM limit to 25 (safe margin below 30) and increase request interval to 2.5s
4649c35

RGB Deployment Bot commited on

Implement RPM rate limiting (30 requests/minute) with sliding window
68627ba

RGB Deployment Bot commited on

Add deepseek-r1-distill-llama-70b free chat model
b8b9c59

RGB Deployment Bot commited on

Add persistent results storage with past results viewer
2397c87

RGB Deployment Bot commited on

Replace decommissioned mixtral-8x7b-32768 with llama-3.1-70b-versatile
dd2d90a

RGB Deployment Bot commited on

Fix continuous page refresh during background evaluation
0deb4ed

RGB Deployment Bot commited on

Add 4th model: meta-llama/llama-4-maverick-17b-128e-instruct
d68feb2

RGB Deployment Bot commited on

Replace deprecated gemma2-9b-it with mixtral-8x7b-32768
8268ab9

RGB Deployment Bot commited on

Add background evaluation and downloadable reports (CSV, JSON, PDF)
81c6a30

RGB Deployment Bot commited on

Add dataset files for HF Space
07844bd

RGB Deployment Bot commited on

Add YAML metadata to README
921aa55

RGB Deployment Bot commited on

Deploy RGB Metrics dashboard - 2026-01-04 22:05:37
3f89944

RGB Deployment Bot commited on

initial commit
c1063cf
verified

gopikrishnait commited on