microsoft
/

phi-1

Text Generation

text-generation-inference

Model card Files Files and versions

gugarosa commited on Dec 13, 2023

Commit

b3ebf08

·

1 Parent(s): 304b058

Upload 4 files

Files changed (3) hide show

README.md +1 -1
config.json +1 -1
configuration_phi.py +1 -1

README.md CHANGED Viewed

@@ -1,5 +1,4 @@
 ---
-inference: false
 license: other
 license_name: microsoft-research-license
 license_link: https://huggingface.co/microsoft/phi-1/resolve/main/Research%20License.docx
@@ -17,6 +16,7 @@ The language model Phi-1 is a Transformer with 1.3 billion parameters, specializ
 Given the nature of the training data, Phi-1 is best suited for prompts using the code format:
 ### Code Format:
 ```python
 def print_prime(n):
    """

 ---
 license: other
 license_name: microsoft-research-license
 license_link: https://huggingface.co/microsoft/phi-1/resolve/main/Research%20License.docx
 Given the nature of the training data, Phi-1 is best suited for prompts using the code format:
 ### Code Format:
 ```python
 def print_prime(n):
    """

config.json CHANGED Viewed

@@ -15,7 +15,7 @@
   "fused_dense": false,
   "initializer_range": 0.02,
   "layer_norm_epsilon": 1e-05,
-  "model_type": "phi",
   "n_embd": 2048,
   "n_head": 32,
   "n_head_kv": null,

   "fused_dense": false,
   "initializer_range": 0.02,
   "layer_norm_epsilon": 1e-05,
+  "model_type": "phi-msft",
   "n_embd": 2048,
   "n_head": 32,
   "n_head_kv": null,

configuration_phi.py CHANGED Viewed

@@ -10,7 +10,7 @@ from transformers import PretrainedConfig
 class PhiConfig(PretrainedConfig):
     """Phi configuration."""
-    model_type = "phi"
     attribute_map = {
         "max_position_embeddings": "n_positions",
         "hidden_size": "n_embd",

 class PhiConfig(PretrainedConfig):
     """Phi configuration."""
+    model_type = "phi-msft"
     attribute_map = {
         "max_position_embeddings": "n_positions",
         "hidden_size": "n_embd",