Collections
Discover the best community collections!
Collections trending this week
-
togethercomputer/evo-1-131k-base
Text Generation • Updated • 634 • 114 -
togethercomputer/StripedHyena-Nous-7B
Text Generation • 8B • Updated • 349 • 143 -
togethercomputer/StripedHyena-Hessian-7B
Text Generation • 8B • Updated • 35 • 66 -
togethercomputer/evo-1-8k-base
Text Generation • Updated • 4.68k • 10
-
ByT5: Towards a token-free future with pre-trained byte-to-byte models
Paper • 2105.13626 • Published • 5 -
Beyond Language Models: Byte Models are Digital World Simulators
Paper • 2402.19155 • Published • 53 -
MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers
Paper • 2305.07185 • Published • 10 -
Byte-Level Recursive Convolutional Auto-Encoder for Text
Paper • 1802.01817 • Published
-
ByT5: Towards a token-free future with pre-trained byte-to-byte models
Paper • 2105.13626 • Published • 5 -
Beyond Language Models: Byte Models are Digital World Simulators
Paper • 2402.19155 • Published • 53 -
MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers
Paper • 2305.07185 • Published • 10 -
Byte-Level Recursive Convolutional Auto-Encoder for Text
Paper • 1802.01817 • Published
-
togethercomputer/evo-1-131k-base
Text Generation • Updated • 634 • 114 -
togethercomputer/StripedHyena-Nous-7B
Text Generation • 8B • Updated • 349 • 143 -
togethercomputer/StripedHyena-Hessian-7B
Text Generation • 8B • Updated • 35 • 66 -
togethercomputer/evo-1-8k-base
Text Generation • Updated • 4.68k • 10