ZeusLabs
/

Chronos-Divergence-33B

Text Generation

text-generation-inference

Model card Files Files and versions

elinas commited on Sep 11, 2024

Commit

b2cf22d

·

verified ·

1 Parent(s): 9f457ff

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ tags:
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/630417380907b9a115c6aa9f/tJ3zBlUS83BzKx0G0VutU.png)
-The original model, LLaMA 1 was pre-trained at a sequence length of 2048 tokens. We went through two individual runs, targeting a sequence length of 16,843 which is a
 significant increase over the original length. While it was originally pre-trained on 1.4T tokens, it was shown to respond positively to our 500M token train and will
 coherently write and keep the same writing format (granted some caveats) up to 12K tokens relatively consistently.

 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/630417380907b9a115c6aa9f/tJ3zBlUS83BzKx0G0VutU.png)
+The original model, LLaMA 1 was pre-trained at a sequence length of 2048 tokens. We went through two individual runs, targeting a sequence length of 16,384 which is a
 significant increase over the original length. While it was originally pre-trained on 1.4T tokens, it was shown to respond positively to our 500M token train and will
 coherently write and keep the same writing format (granted some caveats) up to 12K tokens relatively consistently.