PracticeLLM
/

Twice-KoSOLAR-16.1B-instruct-test

Text Generation

text-generation-inference

Model card Files Files and versions

kyujinpy commited on Jan 1, 2024

Commit

d0ebbc4

·

1 Parent(s): 43e7df4

Upload README.md

Files changed (1) hide show

README.md +13 -0

README.md CHANGED Viewed

@@ -71,6 +71,19 @@ python finetune.py \
 - Follow up as [beomi/LM-Harness](https://github.com/Beomi/ko-lm-evaluation-harness)
 ```
 gpt2 (pretrained=PracticeLLM/Twice-KoSOLAR-16.1B-test), limit: None, provide_description: False, num_fewshot: 0, batch_size: None
 |      Task      |Version| Metric |Value |   |Stderr|
 |----------------|------:|--------|-----:|---|-----:|

 - Follow up as [beomi/LM-Harness](https://github.com/Beomi/ko-lm-evaluation-harness)
 ```
+gpt2 (pretrained=PracticeLLM/Twice-KoSOLAR-16.1B-instruct-test), limit: None, provide_description: False, num_fewshot: 0, batch_size: None
+|      Task      |Version| Metric |Value |   |Stderr|
+|----------------|------:|--------|-----:|---|-----:|
+|kobest_boolq    |      0|acc     |0.5100|±  |0.0133|
+|                |       |macro_f1|0.3527|±  |0.0079|
+|kobest_copa     |      0|acc     |0.6740|±  |0.0148|
+|                |       |macro_f1|0.6732|±  |0.0148|
+|kobest_hellaswag|      0|acc     |0.4640|±  |0.0223|
+|                |       |acc_norm|0.5480|±  |0.0223|
+|                |       |macro_f1|0.4585|±  |0.0223|
+|kobest_sentineg |      0|acc     |0.6574|±  |0.0238|
+|                |       |macro_f1|0.6184|±  |0.0253|
 gpt2 (pretrained=PracticeLLM/Twice-KoSOLAR-16.1B-test), limit: None, provide_description: False, num_fewshot: 0, batch_size: None
 |      Task      |Version| Metric |Value |   |Stderr|
 |----------------|------:|--------|-----:|---|-----:|