Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
OpenNLPLab
/
TransNormerLLM2-1B-300B
like
3
Text Generation
Transformers
PyTorch
English
Chinese
TransNormerLLM
custom_code
arxiv:
2307.14995
arxiv:
2210.10340
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
main
TransNormerLLM2-1B-300B
2.04 GB
1 contributor
History:
22 commits
OpenNLPLab
Upgrade to lightning att2
bf796dd
verified
almost 2 years ago
images
Upload lightning-leopard.jpg
almost 2 years ago
.gitattributes
Safe
1.58 kB
Upload lightning-leopard.png
almost 2 years ago
Community License for TransNormerLLM Model.pdf
Safe
263 kB
Upload license
almost 2 years ago
README.md
Safe
10.6 kB
Update README.md
almost 2 years ago
TransNormerLLM模型社区许可协议.pdf
Safe
294 kB
Upload license
almost 2 years ago
config.json
Safe
814 Bytes
Publish 1B2-300B
almost 2 years ago
configuration_transnormer.py
Safe
2.27 kB
Publish 1B2-300B
almost 2 years ago
generation_config.json
Safe
164 Bytes
Publish 1B2-300B
almost 2 years ago
lightning_attention.py
Safe
15.3 kB
Publish 1B2-300B
almost 2 years ago
lightning_attention2.py
Safe
15.3 kB
Upgrade to lightning att2
almost 2 years ago
modeling_transnormer.py
Safe
34.6 kB
Upgrade to lightning att2
almost 2 years ago
norm.py
1.27 kB
Publish 1B2-300B
almost 2 years ago
pytorch_model-00001-of-00003.bin
994 MB
xet
Publish 1B2-300B
almost 2 years ago
pytorch_model-00002-of-00003.bin
977 MB
xet
Publish 1B2-300B
almost 2 years ago
pytorch_model-00003-of-00003.bin
Safe
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"torch.BFloat16Storage"
,
"collections.OrderedDict"
What is a pickle import?
69.2 MB
xet
Publish 1B2-300B
almost 2 years ago
pytorch_model.bin.index.json
Safe
7.02 kB
Publish 1B2-300B
almost 2 years ago
srmsnorm_triton.py
Safe
5.76 kB
Publish 1B2-300B
almost 2 years ago
tokenization_baichuan.py
Safe
9.58 kB
Publish 1B2-300B
almost 2 years ago
tokenizer.model
1.14 MB
xet
Publish 1B2-300B
almost 2 years ago
tokenizer_config.json
Safe
819 Bytes
Publish 1B2-300B
almost 2 years ago
utils.py
Safe
4.39 kB
Publish 1B2-300B
almost 2 years ago