Upload evaluation/benchmarks/math_reasoning/eval.py with huggingface_hub 8fc83da verified FuryAssassin commited on 24 days ago
Upload evaluation/benchmarks/knowledge_retrieval/eval.py with huggingface_hub 818c24e verified FuryAssassin commited on 24 days ago
Upload evaluation/benchmarks/creative_writing/eval.py with huggingface_hub 560bae7 verified FuryAssassin commited on 24 days ago
Upload evaluation/benchmarks/summarization/eval.py with huggingface_hub 01b6eca verified FuryAssassin commited on 24 days ago
Upload evaluation/benchmarks/translation/eval.py with huggingface_hub e8d48dc verified FuryAssassin commited on 24 days ago
Upload evaluation/benchmarks/instruction_following/eval.py with huggingface_hub 15df644 verified FuryAssassin commited on 24 days ago
Upload evaluation/benchmarks/question_answering/eval.py with huggingface_hub 6581ce7 verified FuryAssassin commited on 24 days ago
Upload evaluation/benchmarks/safety_evaluation/eval.py with huggingface_hub a51af1a verified FuryAssassin commited on 24 days ago
Upload evaluation/benchmarks/common_sense/eval.py with huggingface_hub 778f871 verified FuryAssassin commited on 24 days ago