psyche
/

kogpt

Text Generation

text-generation-inference

Model card Files Files and versions

Adding `safetensors` variant of this model

#4

by psyche - opened Jul 17, 2023

base: refs/heads/main

←

from: refs/pr/4

Discussion Files changed

Files changed (5) hide show

README.md +0 -23
config.json +2 -2
generation_config.json +1 -1
pytorch_model.bin +2 -2
model.safetensors → rust_model.ot +2 -2

README.md DELETED Viewed

@@ -1,23 +0,0 @@
----
-language:
-  - en
-  - ko
-tags:
-  - generation
-license: apache-2.0
----
-Pretrained GPT2 with expanded n_ctx up to 2048(also with expanded embedding dimension to 1536) in Korean.
-# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
-Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_psyche__kogpt)
-| Metric                | Value                     |
-|-----------------------|---------------------------|
-| Avg.                  | 24.27   |
-| ARC (25-shot)         | 21.16          |
-| HellaSwag (10-shot)   | 28.11    |
-| MMLU (5-shot)         | 26.56         |
-| TruthfulQA (0-shot)   | 42.06   |
-| Winogrande (5-shot)   | 49.09   |
-| GSM8K (5-shot)        | 0.0        |
-| DROP (3-shot)         | 2.89         |

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "psyche/kogpt",
   "activation_function": "gelu_new",
   "architectures": [
     "GPT2LMHeadModel"
@@ -33,7 +33,7 @@
     }
   },
   "torch_dtype": "float32",
-  "transformers_version": "4.31.0",
   "use_cache": true,
   "vocab_size": 32002
 }

 {
+  "_name_or_path": "runs/checkpoint-66000",
   "activation_function": "gelu_new",
   "architectures": [
     "GPT2LMHeadModel"
     }
   },
   "torch_dtype": "float32",
+  "transformers_version": "4.30.2",
   "use_cache": true,
   "vocab_size": 32002
 }

generation_config.json CHANGED Viewed

@@ -2,5 +2,5 @@
   "_from_model_config": true,
   "bos_token_id": 0,
   "eos_token_id": 2,
-  "transformers_version": "4.31.0"
 }

   "_from_model_config": true,
   "bos_token_id": 0,
   "eos_token_id": 2,
+  "transformers_version": "4.30.2"
 }

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4bb7be5d1b9b55633082dd466d75a8f6028b02bfa2255a430cdb6b63a3ac1e6d
-size 1569174365

 version https://git-lfs.github.com/spec/v1
+oid sha256:6174fe3c21d632e922a498fd5d347893add6efd757af0c3f7c316d9e78040346
+size 891699345

model.safetensors → rust_model.ot RENAMED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a97b23bfa8be7ce6e1cf0eaeee05774361fd096c6d057fc7d1bf653b098150dd
-size 1569143832

 version https://git-lfs.github.com/spec/v1
+oid sha256:e80d7e388967e4b2a2cb00047bc0cc7751fbe6249dd09a18f5e16bee8e62db61
+size 1817336536