Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Caiyun-AI
/
DCFormer-2.8B
like
2
Text Generation
Transformers
PyTorch
English
dcformer
causal-lm
dcmha
custom_code
arxiv:
2405.08553
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
refs/pr/1
DCFormer-2.8B
11.6 GB
3 contributors
History:
11 commits
SFconvertbot
Adding `safetensors` variant of this model
3fb928a
verified
9 months ago
.gitattributes
1.52 kB
initial commit
over 1 year ago
README.md
2.42 kB
add paper link
over 1 year ago
config.json
751 Bytes
upload model and code
over 1 year ago
configuration_dcformer.py
2.51 kB
upload model and code
over 1 year ago
generation_demo.py
1.31 kB
update readme
over 1 year ago
model.safetensors
5.81 GB
xet
Adding `safetensors` variant of this model
9 months ago
modeling_dcformer.py
32.7 kB
fix k_mask
over 1 year ago
pytorch_model.bin
5.81 GB
xet
upload model and code
over 1 year ago
tokenizer.json
2.11 MB
upload model and code
over 1 year ago
tokenizer_config.json
264 Bytes
upload model and code
over 1 year ago