Mini_Synatra_SFT / README.md

leaderboard-pr-bot

Adding Evaluation Results

76642ea verified almost 2 years ago

preview code

raw

history blame

4.07 kB

metadata

language:
  - ko
license: cc-by-sa-4.0
library_name: transformers
pipeline_tag: text-generation
model-index:
  - name: Mini_Synatra_SFT
    results:
      - task:
          type: text-generation
          name: Text Generation
        dataset:
          name: AI2 Reasoning Challenge (25-Shot)
          type: ai2_arc
          config: ARC-Challenge
          split: test
          args:
            num_few_shot: 25
        metrics:
          - type: acc_norm
            value: 62.46
            name: normalized accuracy
        source:
          url: >-
            https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=maywell/Mini_Synatra_SFT
          name: Open LLM Leaderboard
      - task:
          type: text-generation
          name: Text Generation
        dataset:
          name: HellaSwag (10-Shot)
          type: hellaswag
          split: validation
          args:
            num_few_shot: 10
        metrics:
          - type: acc_norm
            value: 83.44
            name: normalized accuracy
        source:
          url: >-
            https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=maywell/Mini_Synatra_SFT
          name: Open LLM Leaderboard
      - task:
          type: text-generation
          name: Text Generation
        dataset:
          name: MMLU (5-Shot)
          type: cais/mmlu
          config: all
          split: test
          args:
            num_few_shot: 5
        metrics:
          - type: acc
            value: 61.2
            name: accuracy
        source:
          url: >-
            https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=maywell/Mini_Synatra_SFT
          name: Open LLM Leaderboard
      - task:
          type: text-generation
          name: Text Generation
        dataset:
          name: TruthfulQA (0-shot)
          type: truthful_qa
          config: multiple_choice
          split: validation
          args:
            num_few_shot: 0
        metrics:
          - type: mc2
            value: 53.67
        source:
          url: >-
            https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=maywell/Mini_Synatra_SFT
          name: Open LLM Leaderboard
      - task:
          type: text-generation
          name: Text Generation
        dataset:
          name: Winogrande (5-shot)
          type: winogrande
          config: winogrande_xl
          split: validation
          args:
            num_few_shot: 5
        metrics:
          - type: acc
            value: 74.66
            name: accuracy
        source:
          url: >-
            https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=maywell/Mini_Synatra_SFT
          name: Open LLM Leaderboard
      - task:
          type: text-generation
          name: Text Generation
        dataset:
          name: GSM8k (5-shot)
          type: gsm8k
          config: main
          split: test
          args:
            num_few_shot: 5
        metrics:
          - type: acc
            value: 44.88
            name: accuracy
        source:
          url: >-
            https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=maywell/Mini_Synatra_SFT
          name: Open LLM Leaderboard

Mini_Synatra_SFT🐧

Support Me

시나트라는 개인 프로젝트로, 1인의 자원으로 개발되고 있습니다. 모델이 마음에 드셨다면 약간의 연구비 지원은 어떨까요?

Wanna be a sponser? Contact me on Telegram AlzarTakkarsen

Model Details

Base Model
Minirecord/Mini_synatra_7b_02

Trained On
A100 80GB * 1

Instruction format

It follows ChatML format.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	63.39
AI2 Reasoning Challenge (25-Shot)	62.46
HellaSwag (10-Shot)	83.44
MMLU (5-Shot)	61.20
TruthfulQA (0-shot)	53.67
Winogrande (5-shot)	74.66
GSM8k (5-shot)	44.88