Commit History

Cleanup temp file
645ae0c
verified

lbartoszcze commited on

Trigger rebuild after removing flagged content
4105a2a
verified

lbartoszcze commited on

Remove sample_responses_url to clear abuse flag
391c076
verified

lbartoszcze commited on

Remove flagged responses file
22db096
verified

lbartoszcze commited on

Add meta-llama/Llama-3.2-1B-Instruct baseline submission
592d4b5
verified

lbartoszcze commited on

Add detailed responses for Llama-3.2-1B-Instruct baseline
bf35437
verified

lbartoszcze commited on

Remove meta-llama/Llama-3.2-1B-Instruct entry
b6eb9fa
verified

lbartoszcze commited on

Update Methods tab to use paired comparisons only
605a4dd
verified

lbartoszcze commited on

Add paired comparison logic for accurate method effectiveness calculation
e2129ad
verified

lbartoszcze commited on

Dynamic methods - accept any method from user submissions
a55ede4
verified

lbartoszcze commited on

Add Methods Comparison tab with delta from baseline
f24cb80
verified

lbartoszcze commited on

Add sample_responses_url column to leaderboard
26802a6
verified

lbartoszcze commited on

Update leaderboard: 2025-12-01T12:13:18.179565
473dc11
verified

lbartoszcze commited on

Add model_family, model_size, method fields
4afa93e
verified

lbartoszcze commited on

Add model_family, model_size, method fields
0248d1e
verified

lbartoszcze commited on

Upload leaderboard.csv with huggingface_hub
84c3f51
verified

lbartoszcze commited on

Upload README.md with huggingface_hub
70518d4
verified

lbartoszcze commited on

Upload Dockerfile with huggingface_hub
6efeb28
verified

lbartoszcze commited on

Upload requirements.txt with huggingface_hub
a2d4c52
verified

lbartoszcze commited on

Upload app.py with huggingface_hub
bb204d8
verified

lbartoszcze commited on

initial commit
b712f0f
verified

lbartoszcze commited on