Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
bertin-project
/
bertin-base-stepwise
like
0
Follow
BERTIN Project
32
Fill-Mask
Transformers
PyTorch
JAX
TensorBoard
Joblib
Safetensors
Spanish
roberta
spanish
License:
cc-by-4.0
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
1
Deploy
Use this model
refs/pr/1
bertin-base-stepwise
/
outputs
5.29 GB
4 contributors
History:
37 commits
versae
Step... (249000/250000 | Loss: 1.714625358581543, Acc: 0.6543225646018982): 100%|ββββββββββββββββββββββββββ| 250000/250000 [35:27:20<00:00, 1.68s/it]
5227e76
over 4 years ago
checkpoints
Step... (249000/250000 | Loss: 1.714625358581543, Acc: 0.6543225646018982): 100%|ββββββββββββββββββββββββββ| 250000/250000 [35:27:20<00:00, 1.68s/it]
over 4 years ago
config.json
618 Bytes
Training dump
over 4 years ago
data_collator.joblib
1.47 MB
xet
Step... (186000/250000 | Loss: 1.7381113767623901, Acc: 0.6502522826194763): 74%|βββββββββββββββββ | 186090/250000 [2:39:06<32:58:04, 1.86s/it]
over 4 years ago
events.out.tfevents.1626172316.underestimate.4022703.3.v2
27.7 MB
xet
Dataset stats
over 4 years ago
events.out.tfevents.1627122688.tablespoon.2185269.3.v2
40 Bytes
xet
Step... (186000/250000 | Loss: 1.7381113767623901, Acc: 0.6502522826194763): 74%|βββββββββββββββββ | 186090/250000 [2:39:06<32:58:04, 1.86s/it]
over 4 years ago
events.out.tfevents.1627122817.tablespoon.2191003.3.v2
149 kB
xet
Step... (186000/250000 | Loss: 1.7381113767623901, Acc: 0.6502522826194763): 74%|βββββββββββββββββ | 186090/250000 [2:39:06<32:58:04, 1.86s/it]
over 4 years ago
events.out.tfevents.1627125745.tablespoon.2266135.3.v2
149 kB
xet
Step... (186000/250000 | Loss: 1.7381113767623901, Acc: 0.6502522826194763): 74%|βββββββββββββββββ | 186090/250000 [2:39:06<32:58:04, 1.86s/it]
over 4 years ago
events.out.tfevents.1627128247.tablespoon.2330108.3.v2
10.2 MB
xet
Step... (249000/250000 | Loss: 1.714625358581543, Acc: 0.6543225646018982): 100%|ββββββββββββββββββββββββββ| 250000/250000 [35:27:20<00:00, 1.68s/it]
over 4 years ago
flax_model.msgpack
250 MB
xet
Step... (249000/250000 | Loss: 1.714625358581543, Acc: 0.6543225646018982): 100%|ββββββββββββββββββββββββββ| 250000/250000 [35:27:20<00:00, 1.68s/it]
over 4 years ago
optimizer_state.msgpack
500 MB
xet
Step... (249000/250000 | Loss: 1.714625358581543, Acc: 0.6543225646018982): 100%|ββββββββββββββββββββββββββ| 250000/250000 [35:27:20<00:00, 1.68s/it]
over 4 years ago
training_args.joblib
1.87 kB
xet
Step... (186000/250000 | Loss: 1.7381113767623901, Acc: 0.6502522826194763): 74%|βββββββββββββββββ | 186090/250000 [2:39:06<32:58:04, 1.86s/it]
over 4 years ago
training_state.json
16 Bytes
Step... (249000/250000 | Loss: 1.714625358581543, Acc: 0.6543225646018982): 100%|ββββββββββββββββββββββββββ| 250000/250000 [35:27:20<00:00, 1.68s/it]
over 4 years ago