·
AI & ML interests
Data
Organizations
dvilasuero/mmmlu-pro-eval-Llama-3.1-8B-Instruct-thinking
Viewer
• Updated • 70 • 8
dvilasuero/mmmlu-pro-eval-Llama-3.1-8B-Instruct-no-cot
Viewer
• Updated • 70 • 9
dvilasuero/mmmlu-pro-eval-Qwen2.5-72B-Instruct-thinking
Viewer
• Updated • 35 • 7
dvilasuero/mmmlu-pro-eval-Llama-3.1-70B-Instruct-thinking
Viewer
• Updated • 35 • 7
dvilasuero/ultrafeedback_thinking_llms
Viewer
• Updated • 100 • 6
• 1
dvilasuero/10k_prompts_thinking_llms
Viewer
• Updated • 5 • 18
• 2
dvilasuero/ultrafeedback_binarized_thinking_llms
Viewer
• Updated • 5 • 10
• 1
dvilasuero/ultrafeedback_binarized_thinkingllms
Viewer
• Updated • 10 • 5
dvilasuero/mmmlu-pro-eval-Llama-3.1-70B-Instruct-no-cot
Viewer
• Updated • 1k • 10
dvilasuero/mmmlu-pro-eval-Llama-3.1-70B-Instruct-cot
Viewer
• Updated • 1k • 10
dvilasuero/mmmlu-pro-eval-Llama-3.1-8B-Instruct-cot
Viewer
• Updated • 1k • 12
dvilasuero/mmlu-pro-prep-full
Viewer
• Updated • 12k • 8
dvilasuero/mmmlu-pro-eval-Qwen2.5-72B-Instruct-no-cot
Viewer
• Updated • 70 • 7
dvilasuero/mmmlu-pro-eval-Qwen2.5-72B-Instruct-cot
Viewer
• Updated • 70 • 10
dvilasuero/mmmlu-pro-eval-llama-70B
Viewer
• Updated • 70 • 5
dvilasuero/mmmlu-pro-eval-cot-70B
Viewer
• Updated • 70 • 9
dvilasuero/mmmlu-pro-eval-llama
Viewer
• Updated • 70 • 7
dvilasuero/mmmlu-pro-eval-cot
Viewer
• Updated • 70 • 13
Viewer
• Updated • 70 • 7
dvilasuero/synth-text-classification
Viewer
• Updated • 10 • 35
dvilasuero/reflection-v1-near-duplicates
Viewer
• Updated • 310 • 4
dvilasuero/reflection-v1-final-dedup
Viewer
• Updated • 36.5k • 6
• 14
dvilasuero/reflection-v1-exact-duplicates
Viewer
• Updated • 23.3k • 4
dvilasuero/reflection-v1-dedup
Viewer
• Updated • 36.9k • 8
dvilasuero/reflection-v1-duplicates
Viewer
• Updated • 44.9k • 8
dvilasuero/finevideo-qa-debug
Viewer
• Updated • 100 • 6
dvilasuero/reflection-v1-gpt-4o-judge
Viewer
• Updated • 1k • 5
dvilasuero/reflection-v1-openai-o-mini-judge
Viewer
• Updated • 3k • 30
• 8
dvilasuero/reflection-judge-llama70b
Viewer
• Updated • 1 • 8
dvilasuero/finevideo-qa-v3
Viewer
• Updated • 5 • 16