Push model using huggingface_hub.

Files changed (3) hide show

README.md ADDED Viewed

+---
+tags:
+- model_hub_mixin
+- pytorch_model_hub_mixin
+---
+This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
+- Code: [More Information Needed]
+- Paper: [More Information Needed]
+- Docs: [More Information Needed]

config.json ADDED Viewed

+{
+  "cfg": {
+    "batch": 64,
+    "context_length": 1024,
+    "cycle": 200,
+    "ddp_local_rank": 0,
+    "drop_rate": 0.1,
+    "emb_dim": 768,
+    "lr": 0.0004,
+    "n_heads": 12,
+    "n_layers": 12,
+    "num_epoch": 1,
+    "tok_per_batch": 524288,
+    "total_tok": 9898595200,
+    "val_ratio": 0.1,
+    "vocab_size": 50304,
+    "warmup_ratio": 0.00125,
+    "weight_decay": 0.1,
+    "world_size": 1
+  },
+  "tied": true
+}

model.safetensors ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:1b89b59de67e58acbd0b6cb62d7b72dd37134e34ccf872b5ccdb4624ca3d37c8
+size 497768520