Upload tokenizer
98de2ec
verified
-
last-checkpoint
Training in progress, step 10000, checkpoint
-
runs
Training in progress, step 10000
-
trial-number=0-learning_rate=1.3e-05-warmup_ratio=0.100-num_cycles=7.500
Training in progress, step 10000
-
trial-number=1-learning_rate=2.0e-05-warmup_ratio=0.090-num_cycles=8.500
Training in progress, step 10000
-
trial-number=10-learning_rate=4.4e-04-warmup_ratio=0.020-num_cycles=5.500
Training in progress, step 10000
-
trial-number=11-learning_rate=1.0e-04-warmup_ratio=0.080-num_cycles=3.000
Training in progress, step 10000
-
trial-number=12-learning_rate=2.2e-06-warmup_ratio=0.050-num_cycles=10.000
Training in progress, step 10000
-
trial-number=13-learning_rate=4.2e-06-warmup_ratio=0.100-num_cycles=6.500
Training in progress, step 10000
-
trial-number=14-learning_rate=3.5e-05-warmup_ratio=0.040-num_cycles=1.000
Training in progress, step 10000
-
trial-number=15-learning_rate=2.3e-04-warmup_ratio=0.070-num_cycles=8.500
Training in progress, step 10000
-
trial-number=16-learning_rate=2.7e-05-warmup_ratio=0.020-num_cycles=4.000
Training in progress, step 10000
-
trial-number=17-learning_rate=2.3e-05-warmup_ratio=0.070-num_cycles=5.500
Training in progress, step 10000
-
trial-number=18-learning_rate=2.0e-04-warmup_ratio=0.010-num_cycles=2.500
Training in progress, step 10000
-
trial-number=19-learning_rate=4.1e-05-warmup_ratio=0.100-num_cycles=10.000
Training in progress, step 10000
-
trial-number=2-learning_rate=4.1e-04-warmup_ratio=0.050-num_cycles=5.000
Training in progress, step 10000
-
trial-number=20-learning_rate=4.8e-06-warmup_ratio=0.030-num_cycles=3.000
Training in progress, step 10000
-
trial-number=21-learning_rate=1.6e-06-warmup_ratio=0.080-num_cycles=1.500
Training in progress, step 10000
-
trial-number=22-learning_rate=7.8e-05-warmup_ratio=0.050-num_cycles=7.000
Training in progress, step 10000
-
trial-number=23-learning_rate=5.9e-04-warmup_ratio=0.070-num_cycles=4.500
Training in progress, step 10000
-
trial-number=24-learning_rate=1.2e-05-warmup_ratio=0.020-num_cycles=8.500
Training in progress, step 10000
-
trial-number=3-learning_rate=5.4e-05-warmup_ratio=0.060-num_cycles=7.500
Training in progress, step 10000
-
trial-number=4-learning_rate=2.6e-06-warmup_ratio=0.030-num_cycles=0.500
Training in progress, step 10000
-
trial-number=5-learning_rate=1.3e-06-warmup_ratio=0.080-num_cycles=4.000
Training in progress, step 10000
-
trial-number=6-learning_rate=1.6e-04-warmup_ratio=0.010-num_cycles=9.500
Training in progress, step 10000
-
trial-number=7-learning_rate=7.8e-04-warmup_ratio=0.090-num_cycles=2.000
Training in progress, step 10000
-
trial-number=8-learning_rate=6.7e-06-warmup_ratio=0.040-num_cycles=6.500
Training in progress, step 10000
-
trial-number=9-learning_rate=9.1e-06-warmup_ratio=0.060-num_cycles=2.000
Training in progress, step 10000
-
1.52 kB
initial commit
-
7.08 kB
Upload tokenizer
-
1.67 kB
Upload best_hyperparameters.md with huggingface_hub
-
4.78 kB
Training in progress, step 5000
-
177 Bytes
Initial version
-
3.21 kB
Training in progress, step 5000
-
410 MB
Training in progress, step 10000
-
958 Bytes
Training in progress, step 5000
-
23.5 kB
Training in progress, step 5000
-
1.23 kB
Training in progress, step 5000
-
7.42 kB
Training in progress, step 10000
-
7.9 kB
Training in progress, step 5000