Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
dongyh
/
FANformer-1B
like
5
Text Generation
Transformers
Safetensors
allenai/dolma
English
hf_olmo
custom_code
arxiv:
2502.21309
arxiv:
2410.02675
License:
mit
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
main
FANformer-1B
4.91 GB
1 contributor
History:
17 commits
dongyh
Update README.md
ebd97cc
verified
10 months ago
.gitattributes
1.52 kB
initial commit
10 months ago
README.md
5.89 kB
Update README.md
10 months ago
__init__.py
314 Bytes
Upload 2 files
10 months ago
aliases.py
109 Bytes
Upload 2 files
10 months ago
beam_search.py
46.6 kB
Upload 15 files
10 months ago
checkpoint.py
88.2 kB
Upload 15 files
10 months ago
config.json
1.5 kB
Upload 15 files
10 months ago
config.py
41.7 kB
Upload 15 files
10 months ago
configuration_olmo.py
2.07 kB
first commit
10 months ago
exceptions.py
838 Bytes
Upload 15 files
10 months ago
generation_config.json
115 Bytes
first commit
10 months ago
initialization.py
597 Bytes
Upload 15 files
10 months ago
model.py
81.7 kB
Upload 15 files
10 months ago
model.safetensors
4.91 GB
xet
first commit
10 months ago
modeling_fan.py
11.2 kB
Upload 15 files
10 months ago
optim.py
47.1 kB
Upload 15 files
10 months ago
safetensors_util.py
2.45 kB
Upload 15 files
10 months ago
special_tokens_map.json
293 Bytes
add tokenizer
10 months ago
tokenizer.json
3.57 MB
add tokenizer
10 months ago
tokenizer_config.json
5.4 kB
add tokenizer
10 months ago
torch_util.py
4.75 kB
Upload 15 files
10 months ago
train.py
59.2 kB
Upload 15 files
10 months ago
util.py
33.6 kB
Upload 15 files
10 months ago
version.py
407 Bytes
Upload 15 files
10 months ago