Adding `safetensors` variant of this model

#6
by SFconvertbot - opened
Files changed (2) hide show
  1. README.md +42 -15
  2. config.json +1 -1
README.md CHANGED
@@ -5,19 +5,46 @@ language:
5
  tags:
6
  - generation
7
  license: apache-2.0
8
- ---
9
-
10
- Pretrained GPT2 with expanded n_ctx up to 2048(also with expanded embedding dimension to 1536) in Korean.
11
- # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
12
- Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_psyche__kogpt)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
 
14
- | Metric | Value |
15
- |-----------------------|---------------------------|
16
- | Avg. | 24.27 |
17
- | ARC (25-shot) | 21.16 |
18
- | HellaSwag (10-shot) | 28.11 |
19
- | MMLU (5-shot) | 26.56 |
20
- | TruthfulQA (0-shot) | 42.06 |
21
- | Winogrande (5-shot) | 49.09 |
22
- | GSM8K (5-shot) | 0.0 |
23
- | DROP (3-shot) | 2.89 |
 
 
5
  tags:
6
  - generation
7
  license: apache-2.0
8
+ model-index:
9
+ - name: task_1
10
+ results:
11
+ - task:
12
+ type: natural-language-generation
13
+ dataset:
14
+ type: hellaswag
15
+ name: hellaswag(10 shots)
16
+ metrics:
17
+ - type: acc_norm
18
+ value: 27.7
19
+ - name: task_2
20
+ results:
21
+ - task:
22
+ type: natural-language-generation
23
+ dataset:
24
+ type: ARC
25
+ name: ARC(25 shots)
26
+ metrics:
27
+ - type: acc_norm
28
+ value: 23.8
29
+ - name: task_3
30
+ results:
31
+ - task:
32
+ type: natural-language-generation
33
+ dataset:
34
+ type: MMLU
35
+ name: MMLU(5 shots)
36
+ metrics:
37
+ - type: acc
38
+ value: 24.9
39
 
40
+ - name: task_4
41
+ results:
42
+ - task:
43
+ type: natural-language-generation
44
+ dataset:
45
+ type: TruthfulQA
46
+ name: TruthfulQA(0 shots)
47
+ metrics:
48
+ - type: mc2
49
+ value: 46.5
50
+ ---
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "psyche/kogpt",
3
  "activation_function": "gelu_new",
4
  "architectures": [
5
  "GPT2LMHeadModel"
 
1
  {
2
+ "_name_or_path": "runs/checkpoint-100000",
3
  "activation_function": "gelu_new",
4
  "architectures": [
5
  "GPT2LMHeadModel"