Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Caiyun-AI
/
DCFormer-2.8B
like
2
Text Generation
Transformers
PyTorch
English
dcformer
causal-lm
dcmha
custom_code
arxiv:
2405.08553
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
DCFormer-2.8B
5.81 GB
3 contributors
History:
10 commits
Hilbertmeng
fix k_mask
51d254e
over 1 year ago
.gitattributes
1.52 kB
initial commit
over 1 year ago
README.md
2.42 kB
add paper link
over 1 year ago
config.json
751 Bytes
upload model and code
over 1 year ago
configuration_dcformer.py
2.51 kB
upload model and code
over 1 year ago
generation_demo.py
1.31 kB
update readme
over 1 year ago
modeling_dcformer.py
32.7 kB
fix k_mask
over 1 year ago
pytorch_model.bin
5.81 GB
xet
upload model and code
over 1 year ago
tokenizer.json
2.11 MB
upload model and code
over 1 year ago
tokenizer_config.json
264 Bytes
upload model and code
over 1 year ago