Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
eyad-silx
/
llm
like
0
Model card
Files
Files and versions
xet
Community
be88aeb
llm
2.56 GB
Ctrl+K
Ctrl+K
2 contributors
History:
38 commits
eyad-silx
Upload best_baseline.pt with huggingface_hub
be88aeb
verified
over 1 year ago
assets
Update repository
over 1 year ago
checkpoints
Upload checkpoints/dtat/checkpoint_004000.pt with huggingface_hub
over 1 year ago
config
Update config/baseline_config.py
over 1 year ago
data
Upload train.bin
over 1 year ago
wandb
Update repository
over 1 year ago
.gitattributes
Safe
1.52 kB
initial commit
over 1 year ago
LICENSE
Safe
1.07 kB
Update repository
over 1 year ago
README.md
Safe
13.6 kB
Update repository
over 1 year ago
bench.py
Safe
4.82 kB
Update repository
over 1 year ago
best_baseline.pt
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
What is a pickle import?
155 MB
xet
Upload best_baseline.pt with huggingface_hub
over 1 year ago
configurator.py
Safe
1.76 kB
Update repository
over 1 year ago
model.py
Safe
16.3 kB
Update repository
over 1 year ago
model_baseline.py
Safe
8.16 kB
Update model_baseline.py
over 1 year ago
model_dtat.py
Safe
13.6 kB
Update model_dtat.py
over 1 year ago
model_modified.py
Safe
8.41 kB
Update repository
over 1 year ago
prepare_data.py
Safe
1.15 kB
Update repository
over 1 year ago
resume_training.py
11.3 kB
Upload resume_training.py
over 1 year ago
sample.py
Safe
3.94 kB
Update repository
over 1 year ago
scaling_laws.ipynb
Safe
269 kB
Update repository
over 1 year ago
train.py
Safe
14.9 kB
Update repository
over 1 year ago
train_baseline.py
Safe
6.8 kB
Update train_baseline.py
over 1 year ago
train_dtat.py
Safe
10.8 kB
Update train_dtat.py
over 1 year ago
train_enwik8.py
3.83 kB
Update repository
over 1 year ago
transformer_sizing.ipynb
Safe
14.6 kB
Update repository
over 1 year ago