NorGLM
/

NorGPT-369M

Text Generation

text-generation-inference

Model card Files Files and versions

NorGLM commited on Mar 10, 2024

Commit

8a9d894

·

verified ·

1 Parent(s): 631ecbc

Update README.md

Files changed (1) hide show

README.md +21 -0

README.md CHANGED Viewed

@@ -8,6 +8,27 @@ Gnerative Pretrained Tranformer with 369M parameters for Norwegian.
 It belongs to NorGLM, a suite of pretrained Norwegian Generative Language Models. The model is based on GPT2 architecture. NorGLM can be used for non-commercial purposes.
 All models in NorGLM are trained on 200G datasets, nearly 25B tokens, including Norwegian, Denish, Swedish, Germany and English.
 More training and evaluation details and papers will come soon!

 It belongs to NorGLM, a suite of pretrained Norwegian Generative Language Models. The model is based on GPT2 architecture. NorGLM can be used for non-commercial purposes.
+## Datasets
 All models in NorGLM are trained on 200G datasets, nearly 25B tokens, including Norwegian, Denish, Swedish, Germany and English.
+## Run the Model
+```python
+import torch
+from transformers import AutoTokenizer, AutoModelForCausalLM
+model_id = "NorGLM/NorGPT-369M"
+tokenizer = AutoTokenizer.from_pretrained(model_id)
+model = AutoModelForCausalLM.from_pretrained(
+    model_id,
+    device_map='auto',
+    torch_dtype=torch.bfloat16
+)
+text = "Tom ønsket å gå på barene med venner"
+inputs = tokenizer(text, return_tensors="pt")
+outputs = model.generate(**inputs, max_new_tokens=20)
+```
+## Note
 More training and evaluation details and papers will come soon!