Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

FuryAssassin
/
DebuggedEvalPipeline-Toolathlon

Transformers
Model card Files Files and versions
xet
Community
DebuggedEvalPipeline-Toolathlon / evaluation /benchmarks
16.8 kB
  • 1 contributor
History: 4 commits
FuryAssassin's picture
FuryAssassin
Upload evaluation/benchmarks/dialogue_generation/eval.py with huggingface_hub
552cf7b verified 24 days ago
  • code_generation
    Upload evaluation/benchmarks/code_generation/eval.py with huggingface_hub 24 days ago
  • common_sense
    Upload folder using huggingface_hub 24 days ago
  • creative_writing
    Upload folder using huggingface_hub 24 days ago
  • dialogue_generation
    Upload evaluation/benchmarks/dialogue_generation/eval.py with huggingface_hub 24 days ago
  • instruction_following
    Upload folder using huggingface_hub 24 days ago
  • knowledge_retrieval
    Upload folder using huggingface_hub 24 days ago
  • logical_reasoning
    Upload folder using huggingface_hub 24 days ago
  • math_reasoning
    Upload folder using huggingface_hub 24 days ago
  • question_answering
    Upload folder using huggingface_hub 24 days ago
  • reading_comprehension
    Upload folder using huggingface_hub 24 days ago
  • safety_evaluation
    Upload folder using huggingface_hub 24 days ago
  • sentiment_analysis
    Upload folder using huggingface_hub 24 days ago
  • summarization
    Upload folder using huggingface_hub 24 days ago
  • text_classification
    Upload evaluation/benchmarks/text_classification/eval.py with huggingface_hub 24 days ago
  • translation
    Upload folder using huggingface_hub 24 days ago