SimpleStories
/

SimpleStories-125M

Text Generation

model_hub_mixin

pytorch_model_hub_mixin

Model card Files Files and versions

lennart-finke commited on Dec 9, 2024

Commit

6c72502

·

verified ·

1 Parent(s): bcc051c

Extended readme

Files changed (1) hide show

README.md +34 -3

README.md CHANGED Viewed

@@ -7,8 +7,39 @@ tags:
 - model_hub_mixin
 - pytorch_model_hub_mixin
 - simple-stories
 ---
-This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
-- Library: https://github.com/danbraunai/simple_stories_train
-- Docs: [More Information Needed]

 - model_hub_mixin
 - pytorch_model_hub_mixin
 - simple-stories
+datasets:
+- lennart-finke/SimpleStories
 ---
+For loading this model from within [https://github.com/danbraunai/simple_stories_train](), you can run:
+```python
+from typing import Any
+import torch.nn as nn
+from huggingface_hub import PyTorchModelHubMixin
+from simple_stories_train.models.llama import Llama, LlamaConfig
+from simple_stories_train.models.model_configs import MODEL_CONFIGS_DICT
+class LlamaTransformer(
+    nn.Module,
+    PyTorchModelHubMixin,
+    repo_url="https://github.com/danbraunai/simple_stories_train",
+    language=["en"],
+    pipeline_tag="text-generation"
+):
+    def __init__(self, **config : Any):
+        super().__init__()
+        self.llama = Llama(LlamaConfig(**config))
+    def forward(self, x : torch.Tensor):
+        return self.llama(x)
+config = MODEL_CONFIGS_DICT["d12"]
+model = LlamaTransformer(**config)
+HUB_REPO_NAME = "lennart-finke/SimpleStories-125M"
+model = model.from_pretrained(HUB_REPO_NAME)
+```
+- Library: https://github.com/danbraunai/simple_stories_train