Model save
Browse files- README.md +84 -0
- adapter_model.safetensors +1 -1
- experiment_log.txt +289 -0
- nohup.log +289 -0
- runs/Mar29_21-40-02_65d8b3854c39/events.out.tfevents.1774829051.65d8b3854c39.94930.1 +3 -0
README.md
ADDED
|
@@ -0,0 +1,84 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
library_name: peft
|
| 3 |
+
license: apache-2.0
|
| 4 |
+
base_model: openai/whisper-base
|
| 5 |
+
tags:
|
| 6 |
+
- generated_from_trainer
|
| 7 |
+
metrics:
|
| 8 |
+
- wer
|
| 9 |
+
model-index:
|
| 10 |
+
- name: exp_002_base_lora
|
| 11 |
+
results: []
|
| 12 |
+
---
|
| 13 |
+
|
| 14 |
+
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
| 15 |
+
should probably proofread and complete it, then remove this comment. -->
|
| 16 |
+
|
| 17 |
+
# exp_002_base_lora
|
| 18 |
+
|
| 19 |
+
This model is a fine-tuned version of [openai/whisper-base](https://huggingface.co/openai/whisper-base) on an unknown dataset.
|
| 20 |
+
It achieves the following results on the evaluation set:
|
| 21 |
+
- Loss: 0.5781
|
| 22 |
+
- Wer: 54.5829
|
| 23 |
+
- Wer Ortho: 57.7728
|
| 24 |
+
- Cer: 17.2107
|
| 25 |
+
|
| 26 |
+
## Model description
|
| 27 |
+
|
| 28 |
+
More information needed
|
| 29 |
+
|
| 30 |
+
## Intended uses & limitations
|
| 31 |
+
|
| 32 |
+
More information needed
|
| 33 |
+
|
| 34 |
+
## Training and evaluation data
|
| 35 |
+
|
| 36 |
+
More information needed
|
| 37 |
+
|
| 38 |
+
## Training procedure
|
| 39 |
+
|
| 40 |
+
### Training hyperparameters
|
| 41 |
+
|
| 42 |
+
The following hyperparameters were used during training:
|
| 43 |
+
- learning_rate: 0.0001
|
| 44 |
+
- train_batch_size: 32
|
| 45 |
+
- eval_batch_size: 16
|
| 46 |
+
- seed: 42
|
| 47 |
+
- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
|
| 48 |
+
- lr_scheduler_type: linear
|
| 49 |
+
- lr_scheduler_warmup_steps: 1000
|
| 50 |
+
- training_steps: 10000
|
| 51 |
+
|
| 52 |
+
### Training results
|
| 53 |
+
|
| 54 |
+
| Training Loss | Epoch | Step | Validation Loss | Wer | Wer Ortho | Cer |
|
| 55 |
+
|:-------------:|:-----:|:-----:|:---------------:|:-------:|:---------:|:-------:|
|
| 56 |
+
| 2.364 | 0.05 | 500 | 1.3775 | 89.3018 | 90.4210 | 30.2665 |
|
| 57 |
+
| 1.9822 | 0.1 | 1000 | 1.0647 | 79.1535 | 81.1369 | 25.7959 |
|
| 58 |
+
| 1.773 | 0.15 | 1500 | 0.8929 | 71.8646 | 74.4727 | 22.9622 |
|
| 59 |
+
| 1.6612 | 1.044 | 2000 | 0.8104 | 68.2874 | 70.8852 | 20.9447 |
|
| 60 |
+
| 1.5587 | 1.094 | 2500 | 0.7594 | 65.4784 | 68.2362 | 20.1626 |
|
| 61 |
+
| 1.5063 | 1.144 | 3000 | 0.7234 | 63.3287 | 66.1393 | 19.5334 |
|
| 62 |
+
| 1.4318 | 2.038 | 3500 | 0.6917 | 61.0404 | 64.1920 | 19.0537 |
|
| 63 |
+
| 1.3879 | 2.088 | 4000 | 0.6697 | 59.6591 | 62.7346 | 18.5726 |
|
| 64 |
+
| 1.3831 | 2.138 | 4500 | 0.6524 | 58.5128 | 61.6924 | 17.9122 |
|
| 65 |
+
| 1.3231 | 3.032 | 5000 | 0.6385 | 59.3316 | 62.5270 | 19.6542 |
|
| 66 |
+
| 1.3064 | 3.082 | 5500 | 0.6270 | 58.2777 | 61.2398 | 18.6335 |
|
| 67 |
+
| 1.2766 | 3.132 | 6000 | 0.6143 | 56.9425 | 60.0897 | 18.0292 |
|
| 68 |
+
| 1.255 | 4.026 | 6500 | 0.6063 | 57.1315 | 60.3056 | 18.3886 |
|
| 69 |
+
| 1.2481 | 4.076 | 7000 | 0.5986 | 56.2203 | 59.3298 | 17.9156 |
|
| 70 |
+
| 1.2161 | 4.126 | 7500 | 0.5905 | 55.6115 | 58.8274 | 17.8844 |
|
| 71 |
+
| 1.2109 | 5.02 | 8000 | 0.5860 | 55.6535 | 58.8233 | 17.7709 |
|
| 72 |
+
| 1.2183 | 5.07 | 8500 | 0.5834 | 55.3218 | 58.5202 | 17.4484 |
|
| 73 |
+
| 1.1953 | 5.12 | 9000 | 0.5781 | 54.5829 | 57.7728 | 17.2107 |
|
| 74 |
+
| 1.1721 | 6.014 | 9500 | 0.5760 | 54.8012 | 57.9347 | 17.4963 |
|
| 75 |
+
| 1.1816 | 6.064 | 10000 | 0.5753 | 54.7718 | 57.8641 | 17.3142 |
|
| 76 |
+
|
| 77 |
+
|
| 78 |
+
### Framework versions
|
| 79 |
+
|
| 80 |
+
- PEFT 0.12.0
|
| 81 |
+
- Transformers 4.48.3
|
| 82 |
+
- Pytorch 2.8.0+cu128
|
| 83 |
+
- Datasets 3.6.0
|
| 84 |
+
- Tokenizers 0.21.4
|
adapter_model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 9447288
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b53ff3848589132b36bcfce67569abe3dcf48bcd428b6ba293254c461c55484e
|
| 3 |
size 9447288
|
experiment_log.txt
CHANGED
|
@@ -3821,3 +3821,292 @@ The attention mask is not set and cannot be inferred from input because pad toke
|
|
| 3821 |
|
| 3822 |
|
| 3823 |
[A
|
| 3824 |
|
| 3825 |
|
|
|
|
|
|
|
|
|
|
| 3826 |
0%| | 0/176 [00:00<?, ?it/s]
|
| 3827 |
1%| | 2/176 [00:00<01:03, 2.73it/s]
|
| 3828 |
2%|β | 3/176 [00:01<01:29, 1.93it/s]
|
| 3829 |
2%|β | 4/176 [00:02<01:45, 1.63it/s]
|
| 3830 |
3%|β | 5/176 [00:02<01:53, 1.51it/s]
|
| 3831 |
3%|β | 6/176 [00:03<01:59, 1.43it/s]
|
| 3832 |
4%|β | 7/176 [00:04<02:03, 1.37it/s]
|
| 3833 |
5%|β | 8/176 [00:05<02:03, 1.36it/s]
|
| 3834 |
5%|β | 9/176 [00:06<02:04, 1.34it/s]
|
| 3835 |
6%|β | 10/176 [00:06<02:06, 1.32it/s]
|
| 3836 |
6%|β | 11/176 [00:07<02:05, 1.32it/s]
|
| 3837 |
7%|β | 12/176 [00:08<02:05, 1.31it/s]
|
| 3838 |
7%|β | 13/176 [00:09<02:04, 1.31it/s]
|
| 3839 |
8%|β | 14/176 [00:09<02:01, 1.34it/s]
|
| 3840 |
9%|β | 15/176 [00:10<02:01, 1.32it/s]
|
| 3841 |
9%|β | 16/176 [00:11<01:59, 1.34it/s]
|
| 3842 |
10%|β | 17/176 [00:12<02:00, 1.32it/s]
|
| 3843 |
10%|β | 18/176 [00:12<02:02, 1.29it/s]
|
| 3844 |
11%|β | 19/176 [00:13<02:02, 1.28it/s]
|
| 3845 |
11%|ββ | 20/176 [00:14<01:59, 1.30it/s]
|
| 3846 |
12%|ββ | 21/176 [00:15<01:59, 1.30it/s]
|
| 3847 |
12%|ββ | 22/176 [00:16<01:59, 1.29it/s]
|
| 3848 |
13%|ββ | 23/176 [00:16<01:57, 1.30it/s]
|
| 3849 |
14%|ββ | 24/176 [00:17<01:56, 1.31it/s]
|
| 3850 |
14%|ββ | 25/176 [00:18<01:56, 1.30it/s]
|
| 3851 |
15%|ββ | 26/176 [00:19<01:56, 1.29it/s]
|
| 3852 |
15%|ββ | 27/176 [00:19<01:55, 1.29it/s]
|
| 3853 |
16%|ββ | 28/176 [00:20<01:52, 1.31it/s]
|
| 3854 |
16%|ββ | 29/176 [00:21<01:54, 1.28it/s]
|
| 3855 |
17%|ββ | 30/176 [00:22<01:53, 1.29it/s]
|
| 3856 |
18%|ββ | 31/176 [00:23<01:51, 1.30it/s]
|
| 3857 |
18%|ββ | 32/176 [00:23<01:50, 1.31it/s]
|
| 3858 |
19%|ββ | 33/176 [00:24<01:49, 1.31it/s]
|
| 3859 |
19%|ββ | 34/176 [00:25<01:49, 1.29it/s]
|
| 3860 |
20%|ββ | 35/176 [00:26<01:47, 1.31it/s]
|
| 3861 |
20%|ββ | 36/176 [00:26<01:46, 1.32it/s]
|
| 3862 |
21%|ββ | 37/176 [00:27<01:46, 1.30it/s]
|
| 3863 |
22%|βββ | 38/176 [00:28<01:44, 1.32it/s]
|
| 3864 |
22%|βββ | 39/176 [00:29<01:44, 1.31it/s]
|
| 3865 |
23%|βββ | 40/176 [00:29<01:43, 1.32it/s]
|
| 3866 |
23%|βββ | 41/176 [00:30<01:42, 1.31it/s]
|
| 3867 |
24%|βββ | 42/176 [00:31<01:42, 1.30it/s]
|
| 3868 |
24%|βββ | 43/176 [00:32<01:42, 1.30it/s]
|
| 3869 |
25%|βββ | 44/176 [00:32<01:41, 1.30it/s]
|
| 3870 |
26%|βββ | 45/176 [00:33<01:39, 1.32it/s]
|
| 3871 |
26%|βββ | 46/176 [00:34<01:39, 1.30it/s]
|
| 3872 |
27%|βββ | 47/176 [00:35<01:37, 1.32it/s]
|
| 3873 |
27%|βββ | 48/176 [00:35<01:37, 1.32it/s]
|
| 3874 |
28%|βββ | 49/176 [00:36<01:37, 1.30it/s]
|
| 3875 |
28%|βββ | 50/176 [00:37<01:33, 1.34it/s]
|
| 3876 |
29%|βββ | 51/176 [00:38<01:32, 1.35it/s]
|
| 3877 |
30%|βββ | 52/176 [00:38<01:34, 1.32it/s]
|
| 3878 |
30%|βββ | 53/176 [00:39<01:37, 1.26it/s]
|
| 3879 |
31%|βββ | 54/176 [00:40<01:35, 1.27it/s]
|
| 3880 |
31%|ββββ | 55/176 [00:41<01:35, 1.27it/s]
|
| 3881 |
32%|ββββ | 56/176 [00:42<01:33, 1.28it/s]
|
| 3882 |
32%|ββββ | 57/176 [00:42<01:31, 1.30it/s]
|
| 3883 |
33%|ββββ | 58/176 [00:43<01:29, 1.32it/s]
|
| 3884 |
34%|ββββ | 59/176 [00:44<01:29, 1.31it/s]
|
| 3885 |
34%|ββββ | 60/176 [00:45<01:28, 1.31it/s]
|
| 3886 |
35%|ββββ | 61/176 [00:46<01:36, 1.19it/s]
|
| 3887 |
35%|ββββ | 62/176 [00:46<01:32, 1.23it/s]
|
| 3888 |
36%|ββββ | 63/176 [00:47<01:30, 1.25it/s]
|
| 3889 |
36%|ββββ | 64/176 [00:48<01:27, 1.27it/s]
|
| 3890 |
37%|ββββ | 65/176 [00:49<01:26, 1.29it/s]
|
| 3891 |
38%|ββββ | 66/176 [00:49<01:23, 1.32it/s]
|
| 3892 |
38%|ββββ | 67/176 [00:50<01:23, 1.30it/s]
|
| 3893 |
39%|ββββ | 68/176 [00:51<01:23, 1.29it/s]
|
| 3894 |
39%|ββββ | 69/176 [00:52<01:22, 1.29it/s]
|
| 3895 |
40%|ββββ | 70/176 [00:53<01:19, 1.33it/s]
|
| 3896 |
40%|ββββ | 71/176 [00:53<01:19, 1.32it/s]
|
| 3897 |
41%|ββββ | 72/176 [00:54<01:18, 1.32it/s]
|
| 3898 |
41%|βββββ | 73/176 [00:55<01:18, 1.31it/s]
|
| 3899 |
42%|βββββ | 74/176 [00:56<01:16, 1.33it/s]
|
| 3900 |
43%|βββββ | 75/176 [00:56<01:15, 1.33it/s]
|
| 3901 |
43%|βββββ | 76/176 [00:57<01:15, 1.32it/s]
|
| 3902 |
44%|βββββ | 77/176 [00:58<01:15, 1.31it/s]
|
| 3903 |
44%|βββββ | 78/176 [00:59<01:13, 1.33it/s]
|
| 3904 |
45%|βββββ | 79/176 [00:59<01:13, 1.32it/s]
|
| 3905 |
45%|βββββ | 80/176 [01:00<01:12, 1.32it/s]
|
| 3906 |
46%|βββββ | 81/176 [01:01<01:11, 1.33it/s]
|
| 3907 |
47%|βββββ | 82/176 [01:02<01:09, 1.34it/s]
|
| 3908 |
47%|βββββ | 83/176 [01:02<01:10, 1.32it/s]
|
| 3909 |
48%|βββββ | 84/176 [01:03<01:10, 1.31it/s]
|
| 3910 |
48%|βββββ | 85/176 [01:04<01:08, 1.33it/s]
|
| 3911 |
49%|βββββ | 86/176 [01:05<01:05, 1.36it/s]
|
| 3912 |
49%|βββββ | 87/176 [01:05<01:06, 1.34it/s]
|
| 3913 |
50%|βββββ | 88/176 [01:06<01:06, 1.33it/s]
|
| 3914 |
51%|βββββ | 89/176 [01:07<01:05, 1.33it/s]
|
| 3915 |
51%|βββββ | 90/176 [01:08<01:04, 1.33it/s]
|
| 3916 |
52%|ββββββ | 91/176 [01:08<01:02, 1.35it/s]
|
| 3917 |
52%|ββββββ | 92/176 [01:09<01:01, 1.36it/s]
|
| 3918 |
53%|ββββββ | 93/176 [01:10<01:07, 1.22it/s]
|
| 3919 |
53%|ββββββ | 94/176 [01:11<01:05, 1.26it/s]
|
| 3920 |
54%|ββββββ | 95/176 [01:13<01:28, 1.09s/it]
|
| 3921 |
55%|ββββββ | 96/176 [01:13<01:17, 1.03it/s]
|
| 3922 |
55%|ββββββ | 97/176 [01:14<01:12, 1.08it/s]
|
| 3923 |
56%|ββββββ | 98/176 [01:15<01:08, 1.15it/s]
|
| 3924 |
56%|ββββββ | 99/176 [01:16<01:05, 1.17it/s]
|
| 3925 |
57%|ββββββ | 100/176 [01:16<01:02, 1.23it/s]
|
| 3926 |
57%|ββββββ | 101/176 [01:17<01:00, 1.25it/s]
|
| 3927 |
58%|ββββββ | 102/176 [01:18<00:58, 1.27it/s]
|
| 3928 |
59%|ββββββ | 103/176 [01:19<00:56, 1.30it/s]
|
| 3929 |
59%|ββββββ | 104/176 [01:19<00:55, 1.30it/s]
|
| 3930 |
60%|ββββββ | 105/176 [01:20<00:54, 1.30it/s]
|
| 3931 |
60%|ββββββ | 106/176 [01:21<00:55, 1.26it/s]
|
| 3932 |
61%|ββββββ | 107/176 [01:22<00:53, 1.29it/s]
|
| 3933 |
61%|βββββββ | 108/176 [01:22<00:52, 1.30it/s]
|
| 3934 |
62%|βββββββ | 109/176 [01:23<00:50, 1.32it/s]
|
| 3935 |
62%|βββββββ | 110/176 [01:24<00:49, 1.33it/s]
|
| 3936 |
63%|βββββββ | 111/176 [01:25<00:48, 1.34it/s]
|
| 3937 |
64%|βββββββ | 112/176 [01:25<00:47, 1.35it/s]
|
| 3938 |
64%|βββββββ | 113/176 [01:26<00:46, 1.35it/s]
|
| 3939 |
65%|βββββββ | 114/176 [01:27<00:46, 1.32it/s]
|
| 3940 |
65%|βββββββ | 115/176 [01:28<00:46, 1.32it/s]
|
| 3941 |
66%|βββββββ | 116/176 [01:28<00:45, 1.33it/s]
|
| 3942 |
66%|βββββββ | 117/176 [01:29<00:44, 1.32it/s]
|
| 3943 |
67%|βββββββ | 118/176 [01:30<00:44, 1.30it/s]
|
| 3944 |
68%|βββββββ | 119/176 [01:31<00:43, 1.30it/s]
|
| 3945 |
68%|βββββββ | 120/176 [01:32<00:43, 1.28it/s]
|
| 3946 |
69%|βββββββ | 121/176 [01:32<00:43, 1.28it/s]
|
| 3947 |
69%|βββββββ | 122/176 [01:33<00:41, 1.29it/s]
|
| 3948 |
70%|βββββββ | 123/176 [01:34<00:40, 1.30it/s]
|
| 3949 |
70%|βββββββ | 124/176 [01:35<00:40, 1.29it/s]
|
| 3950 |
71%|βββββββ | 125/176 [01:35<00:39, 1.30it/s]
|
| 3951 |
72%|ββββββββ | 126/176 [01:36<00:38, 1.30it/s]
|
| 3952 |
72%|ββββββββ | 127/176 [01:37<00:37, 1.31it/s]
|
| 3953 |
73%|ββββββββ | 128/176 [01:38<00:36, 1.31it/s]
|
| 3954 |
73%|ββββββββ | 129/176 [01:38<00:35, 1.31it/s]
|
| 3955 |
74%|ββββββββ | 130/176 [01:39<00:35, 1.30it/s]
|
| 3956 |
74%|ββββββββ | 131/176 [01:40<00:33, 1.32it/s]
|
| 3957 |
75%|ββββββββ | 132/176 [01:41<00:33, 1.31it/s]
|
| 3958 |
76%|ββββββββ | 133/176 [01:41<00:32, 1.32it/s]
|
| 3959 |
76%|ββββββββ | 134/176 [01:42<00:31, 1.31it/s]
|
| 3960 |
77%|ββββββββ | 135/176 [01:43<00:31, 1.30it/s]
|
| 3961 |
77%|ββββββββ | 136/176 [01:44<00:30, 1.30it/s]
|
| 3962 |
78%|ββββββββ | 137/176 [01:45<00:29, 1.32it/s]
|
| 3963 |
78%|ββββββββ | 138/176 [01:45<00:28, 1.31it/s]
|
| 3964 |
79%|ββββββββ | 139/176 [01:46<00:27, 1.33it/s]
|
| 3965 |
80%|ββββββββ | 140/176 [01:47<00:27, 1.33it/s]
|
| 3966 |
80%|ββββββββ | 141/176 [01:48<00:26, 1.31it/s]
|
| 3967 |
81%|ββββββββ | 142/176 [01:48<00:25, 1.32it/s]
|
| 3968 |
81%|βββββββββ | 143/176 [01:49<00:24, 1.34it/s]
|
| 3969 |
82%|βββββββββ | 144/176 [01:50<00:23, 1.34it/s]
|
| 3970 |
82%|βββββββββ | 145/176 [01:51<00:23, 1.34it/s]
|
| 3971 |
83%|βββββββββ | 146/176 [01:51<00:22, 1.32it/s]
|
| 3972 |
84%|βββββββββ | 147/176 [01:52<00:22, 1.31it/s]
|
| 3973 |
84%|βββββββββ | 148/176 [01:53<00:21, 1.31it/s]
|
| 3974 |
85%|βββββββββ | 149/176 [01:54<00:20, 1.30it/s]
|
| 3975 |
85%|βββββββββ | 150/176 [01:54<00:19, 1.30it/s]
|
| 3976 |
86%|βββββββββ | 151/176 [01:55<00:19, 1.31it/s]
|
| 3977 |
86%|βββββββββ | 152/176 [01:56<00:18, 1.30it/s]
|
| 3978 |
87%|βββββββββ | 153/176 [01:57<00:17, 1.29it/s]
|
| 3979 |
88%|βββββββββ | 154/176 [01:57<00:16, 1.32it/s]
|
| 3980 |
88%|βββββββββ | 155/176 [01:58<00:16, 1.31it/s]
|
| 3981 |
89%|βββββββββ | 156/176 [01:59<00:15, 1.31it/s]
|
| 3982 |
89%|βββββββββ | 157/176 [02:00<00:14, 1.35it/s]
|
| 3983 |
90%|βββββββββ | 158/176 [02:00<00:13, 1.33it/s]
|
| 3984 |
90%|βββββββββ | 159/176 [02:01<00:12, 1.31it/s]
|
| 3985 |
91%|βββββββββ | 160/176 [02:02<00:12, 1.32it/s]
|
| 3986 |
91%|ββββββββββ| 161/176 [02:03<00:11, 1.30it/s]
|
| 3987 |
92%|ββββββββββ| 162/176 [02:04<00:10, 1.31it/s]
|
| 3988 |
93%|οΏ½οΏ½βββββββββ| 163/176 [02:04<00:09, 1.32it/s]
|
| 3989 |
93%|ββββββββββ| 164/176 [02:05<00:08, 1.34it/s]
|
| 3990 |
94%|ββββββββββ| 165/176 [02:06<00:08, 1.30it/s]
|
| 3991 |
94%|ββββββββββ| 166/176 [02:07<00:07, 1.27it/s]
|
| 3992 |
95%|ββββββββββ| 167/176 [02:07<00:07, 1.28it/s]
|
| 3993 |
95%|ββββββββββ| 168/176 [02:08<00:06, 1.29it/s]
|
| 3994 |
96%|ββββββββββ| 169/176 [02:09<00:05, 1.32it/s]
|
| 3995 |
97%|ββββββββββ| 170/176 [02:10<00:04, 1.31it/s]
|
| 3996 |
97%|ββββββββββ| 171/176 [02:10<00:03, 1.32it/s]
|
| 3997 |
98%|ββββββββββ| 172/176 [02:11<00:03, 1.32it/s]
|
| 3998 |
98%|ββββββββββ| 173/176 [02:12<00:02, 1.31it/s]
|
| 3999 |
99%|ββββββββββ| 174/176 [02:13<00:01, 1.27it/s]
|
| 4000 |
99%|ββββββββββ| 175/176 [02:14<00:00, 1.27it/s]
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4001 |
...se_lora/training_args.bin: 100%|ββββββββββ| 5.91kB / 5.91kB [A[A
|
|
|
|
|
|
|
|
|
|
| 4002 |
...4567.65d8b3854c39.93630.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4003 |
...9876.65d8b3854c39.94606.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4004 |
...9767.65d8b3854c39.92682.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4005 |
...4970.65d8b3854c39.93828.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4006 |
...0403.65d8b3854c39.94930.0: 100%|ββββββββββ| 36.7kB / 36.7kB [A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4007 |
...4234.65d8b3854c39.93425.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4008 |
...8288.65d8b3854c39.94241.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4009 |
...adapter_model.safetensors: 100%|ββββββββββ| 9.45MB / 9.45MB [A[A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4010 |
...9051.65d8b3854c39.94930.1: 100%|ββββββββββ| 506B / 506B [A[A[A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
| 4011 |
...se_lora/training_args.bin: 100%|ββββββββββ| 5.91kB / 5.91kB [A[A
|
|
|
|
|
|
|
|
|
|
| 4012 |
...4567.65d8b3854c39.93630.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4013 |
...9876.65d8b3854c39.94606.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4014 |
...9767.65d8b3854c39.92682.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4015 |
...4970.65d8b3854c39.93828.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4016 |
...0403.65d8b3854c39.94930.0: 100%|ββββββββββ| 36.7kB / 36.7kB [A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4017 |
...4234.65d8b3854c39.93425.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4018 |
...8288.65d8b3854c39.94241.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4019 |
...adapter_model.safetensors: 100%|ββββββββββ| 9.45MB / 9.45MB [A[A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4020 |
...9051.65d8b3854c39.94930.1: 100%|ββββββββββ| 506B / 506B [A[A[A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
| 4021 |
...se_lora/training_args.bin: 100%|ββββββββββ| 5.91kB / 5.91kB [A[A
|
|
|
|
|
|
|
|
|
|
| 4022 |
...4567.65d8b3854c39.93630.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4023 |
...9876.65d8b3854c39.94606.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4024 |
...9767.65d8b3854c39.92682.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4025 |
...4970.65d8b3854c39.93828.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4026 |
...0403.65d8b3854c39.94930.0: 100%|ββββββββββ| 36.7kB / 36.7kB [A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4027 |
...4234.65d8b3854c39.93425.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4028 |
...8288.65d8b3854c39.94241.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4029 |
...adapter_model.safetensors: 100%|ββββββββββ| 9.45MB / 9.45MB [A[A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4030 |
...9051.65d8b3854c39.94930.1: 100%|ββββββββββ| 506B / 506B [A[A[A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
| 4031 |
...se_lora/training_args.bin: 100%|ββββββββββ| 5.91kB / 5.91kB [A[A
|
|
|
|
|
|
|
|
|
|
| 4032 |
...4567.65d8b3854c39.93630.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4033 |
...9876.65d8b3854c39.94606.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4034 |
...9767.65d8b3854c39.92682.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4035 |
...4970.65d8b3854c39.93828.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4036 |
...0403.65d8b3854c39.94930.0: 100%|ββββββββββ| 36.7kB / 36.7kB [A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4037 |
...4234.65d8b3854c39.93425.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4038 |
...8288.65d8b3854c39.94241.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4039 |
...adapter_model.safetensors: 100%|ββββββββββ| 9.45MB / 9.45MB [A[A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4040 |
...9051.65d8b3854c39.94930.1: 100%|ββββββββββ| 506B / 506B [A[A[A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
| 4041 |
...se_lora/training_args.bin: 100%|ββββββββββ| 5.91kB / 5.91kB
|
|
|
|
| 4042 |
...4567.65d8b3854c39.93630.0: 100%|ββββββββββ| 6.04kB / 6.04kB
|
|
|
|
| 4043 |
...9876.65d8b3854c39.94606.0: 100%|ββββββββββ| 6.90kB / 6.90kB
|
|
|
|
| 4044 |
...9767.65d8b3854c39.92682.0: 100%|ββββββββββ| 6.90kB / 6.90kB
|
|
|
|
| 4045 |
...4970.65d8b3854c39.93828.0: 100%|ββββββββββ| 6.04kB / 6.04kB
|
|
|
|
| 4046 |
...0403.65d8b3854c39.94930.0: 100%|ββββββββββ| 36.7kB / 36.7kB
|
|
|
|
| 4047 |
...4234.65d8b3854c39.93425.0: 100%|ββββββββββ| 6.04kB / 6.04kB
|
|
|
|
| 4048 |
...8288.65d8b3854c39.94241.0: 100%|ββββββββββ| 6.90kB / 6.90kB
|
|
|
|
| 4049 |
...adapter_model.safetensors: 100%|ββββββββββ| 9.45MB / 9.45MB
|
|
|
|
| 4050 |
...9051.65d8b3854c39.94930.1: 100%|ββββββββββ| 506B / 506B
|
|
|
|
| 3821 |
|
| 3822 |
|
| 3823 |
[A
|
| 3824 |
|
| 3825 |
|
| 3826 |
+
|
| 3827 |
+
Running final evaluation...
|
| 3828 |
+
|
| 3829 |
0%| | 0/176 [00:00<?, ?it/s]
|
| 3830 |
1%| | 2/176 [00:00<01:03, 2.73it/s]
|
| 3831 |
2%|β | 3/176 [00:01<01:29, 1.93it/s]
|
| 3832 |
2%|β | 4/176 [00:02<01:45, 1.63it/s]
|
| 3833 |
3%|β | 5/176 [00:02<01:53, 1.51it/s]
|
| 3834 |
3%|β | 6/176 [00:03<01:59, 1.43it/s]
|
| 3835 |
4%|β | 7/176 [00:04<02:03, 1.37it/s]
|
| 3836 |
5%|β | 8/176 [00:05<02:03, 1.36it/s]
|
| 3837 |
5%|β | 9/176 [00:06<02:04, 1.34it/s]
|
| 3838 |
6%|β | 10/176 [00:06<02:06, 1.32it/s]
|
| 3839 |
6%|β | 11/176 [00:07<02:05, 1.32it/s]
|
| 3840 |
7%|β | 12/176 [00:08<02:05, 1.31it/s]
|
| 3841 |
7%|β | 13/176 [00:09<02:04, 1.31it/s]
|
| 3842 |
8%|β | 14/176 [00:09<02:01, 1.34it/s]
|
| 3843 |
9%|β | 15/176 [00:10<02:01, 1.32it/s]
|
| 3844 |
9%|β | 16/176 [00:11<01:59, 1.34it/s]
|
| 3845 |
10%|β | 17/176 [00:12<02:00, 1.32it/s]
|
| 3846 |
10%|β | 18/176 [00:12<02:02, 1.29it/s]
|
| 3847 |
11%|β | 19/176 [00:13<02:02, 1.28it/s]
|
| 3848 |
11%|ββ | 20/176 [00:14<01:59, 1.30it/s]
|
| 3849 |
12%|ββ | 21/176 [00:15<01:59, 1.30it/s]
|
| 3850 |
12%|ββ | 22/176 [00:16<01:59, 1.29it/s]
|
| 3851 |
13%|ββ | 23/176 [00:16<01:57, 1.30it/s]
|
| 3852 |
14%|ββ | 24/176 [00:17<01:56, 1.31it/s]
|
| 3853 |
14%|ββ | 25/176 [00:18<01:56, 1.30it/s]
|
| 3854 |
15%|ββ | 26/176 [00:19<01:56, 1.29it/s]
|
| 3855 |
15%|ββ | 27/176 [00:19<01:55, 1.29it/s]
|
| 3856 |
16%|ββ | 28/176 [00:20<01:52, 1.31it/s]
|
| 3857 |
16%|ββ | 29/176 [00:21<01:54, 1.28it/s]
|
| 3858 |
17%|ββ | 30/176 [00:22<01:53, 1.29it/s]
|
| 3859 |
18%|ββ | 31/176 [00:23<01:51, 1.30it/s]
|
| 3860 |
18%|ββ | 32/176 [00:23<01:50, 1.31it/s]
|
| 3861 |
19%|ββ | 33/176 [00:24<01:49, 1.31it/s]
|
| 3862 |
19%|ββ | 34/176 [00:25<01:49, 1.29it/s]
|
| 3863 |
20%|ββ | 35/176 [00:26<01:47, 1.31it/s]
|
| 3864 |
20%|ββ | 36/176 [00:26<01:46, 1.32it/s]
|
| 3865 |
21%|ββ | 37/176 [00:27<01:46, 1.30it/s]
|
| 3866 |
22%|βββ | 38/176 [00:28<01:44, 1.32it/s]
|
| 3867 |
22%|βββ | 39/176 [00:29<01:44, 1.31it/s]
|
| 3868 |
23%|βββ | 40/176 [00:29<01:43, 1.32it/s]
|
| 3869 |
23%|βββ | 41/176 [00:30<01:42, 1.31it/s]
|
| 3870 |
24%|βββ | 42/176 [00:31<01:42, 1.30it/s]
|
| 3871 |
24%|βββ | 43/176 [00:32<01:42, 1.30it/s]
|
| 3872 |
25%|βββ | 44/176 [00:32<01:41, 1.30it/s]
|
| 3873 |
26%|βββ | 45/176 [00:33<01:39, 1.32it/s]
|
| 3874 |
26%|βββ | 46/176 [00:34<01:39, 1.30it/s]
|
| 3875 |
27%|βββ | 47/176 [00:35<01:37, 1.32it/s]
|
| 3876 |
27%|βββ | 48/176 [00:35<01:37, 1.32it/s]
|
| 3877 |
28%|βββ | 49/176 [00:36<01:37, 1.30it/s]
|
| 3878 |
28%|βββ | 50/176 [00:37<01:33, 1.34it/s]
|
| 3879 |
29%|βββ | 51/176 [00:38<01:32, 1.35it/s]
|
| 3880 |
30%|βββ | 52/176 [00:38<01:34, 1.32it/s]
|
| 3881 |
30%|βββ | 53/176 [00:39<01:37, 1.26it/s]
|
| 3882 |
31%|βββ | 54/176 [00:40<01:35, 1.27it/s]
|
| 3883 |
31%|ββββ | 55/176 [00:41<01:35, 1.27it/s]
|
| 3884 |
32%|ββββ | 56/176 [00:42<01:33, 1.28it/s]
|
| 3885 |
32%|ββββ | 57/176 [00:42<01:31, 1.30it/s]
|
| 3886 |
33%|ββββ | 58/176 [00:43<01:29, 1.32it/s]
|
| 3887 |
34%|ββββ | 59/176 [00:44<01:29, 1.31it/s]
|
| 3888 |
34%|ββββ | 60/176 [00:45<01:28, 1.31it/s]
|
| 3889 |
35%|ββββ | 61/176 [00:46<01:36, 1.19it/s]
|
| 3890 |
35%|ββββ | 62/176 [00:46<01:32, 1.23it/s]
|
| 3891 |
36%|ββββ | 63/176 [00:47<01:30, 1.25it/s]
|
| 3892 |
36%|ββββ | 64/176 [00:48<01:27, 1.27it/s]
|
| 3893 |
37%|ββββ | 65/176 [00:49<01:26, 1.29it/s]
|
| 3894 |
38%|ββββ | 66/176 [00:49<01:23, 1.32it/s]
|
| 3895 |
38%|ββββ | 67/176 [00:50<01:23, 1.30it/s]
|
| 3896 |
39%|ββββ | 68/176 [00:51<01:23, 1.29it/s]
|
| 3897 |
39%|ββββ | 69/176 [00:52<01:22, 1.29it/s]
|
| 3898 |
40%|ββββ | 70/176 [00:53<01:19, 1.33it/s]
|
| 3899 |
40%|ββββ | 71/176 [00:53<01:19, 1.32it/s]
|
| 3900 |
41%|ββββ | 72/176 [00:54<01:18, 1.32it/s]
|
| 3901 |
41%|βββββ | 73/176 [00:55<01:18, 1.31it/s]
|
| 3902 |
42%|βββββ | 74/176 [00:56<01:16, 1.33it/s]
|
| 3903 |
43%|βββββ | 75/176 [00:56<01:15, 1.33it/s]
|
| 3904 |
43%|βββββ | 76/176 [00:57<01:15, 1.32it/s]
|
| 3905 |
44%|βββββ | 77/176 [00:58<01:15, 1.31it/s]
|
| 3906 |
44%|βββββ | 78/176 [00:59<01:13, 1.33it/s]
|
| 3907 |
45%|βββββ | 79/176 [00:59<01:13, 1.32it/s]
|
| 3908 |
45%|βββββ | 80/176 [01:00<01:12, 1.32it/s]
|
| 3909 |
46%|βββββ | 81/176 [01:01<01:11, 1.33it/s]
|
| 3910 |
47%|βββββ | 82/176 [01:02<01:09, 1.34it/s]
|
| 3911 |
47%|βββββ | 83/176 [01:02<01:10, 1.32it/s]
|
| 3912 |
48%|βββββ | 84/176 [01:03<01:10, 1.31it/s]
|
| 3913 |
48%|βββββ | 85/176 [01:04<01:08, 1.33it/s]
|
| 3914 |
49%|βββββ | 86/176 [01:05<01:05, 1.36it/s]
|
| 3915 |
49%|βββββ | 87/176 [01:05<01:06, 1.34it/s]
|
| 3916 |
50%|βββββ | 88/176 [01:06<01:06, 1.33it/s]
|
| 3917 |
51%|βββββ | 89/176 [01:07<01:05, 1.33it/s]
|
| 3918 |
51%|βββββ | 90/176 [01:08<01:04, 1.33it/s]
|
| 3919 |
52%|ββββββ | 91/176 [01:08<01:02, 1.35it/s]
|
| 3920 |
52%|ββββββ | 92/176 [01:09<01:01, 1.36it/s]
|
| 3921 |
53%|ββββββ | 93/176 [01:10<01:07, 1.22it/s]
|
| 3922 |
53%|ββββββ | 94/176 [01:11<01:05, 1.26it/s]
|
| 3923 |
54%|ββββββ | 95/176 [01:13<01:28, 1.09s/it]
|
| 3924 |
55%|ββββββ | 96/176 [01:13<01:17, 1.03it/s]
|
| 3925 |
55%|ββββββ | 97/176 [01:14<01:12, 1.08it/s]
|
| 3926 |
56%|ββββββ | 98/176 [01:15<01:08, 1.15it/s]
|
| 3927 |
56%|ββββββ | 99/176 [01:16<01:05, 1.17it/s]
|
| 3928 |
57%|ββββββ | 100/176 [01:16<01:02, 1.23it/s]
|
| 3929 |
57%|ββββββ | 101/176 [01:17<01:00, 1.25it/s]
|
| 3930 |
58%|ββββββ | 102/176 [01:18<00:58, 1.27it/s]
|
| 3931 |
59%|ββββββ | 103/176 [01:19<00:56, 1.30it/s]
|
| 3932 |
59%|ββββββ | 104/176 [01:19<00:55, 1.30it/s]
|
| 3933 |
60%|ββββββ | 105/176 [01:20<00:54, 1.30it/s]
|
| 3934 |
60%|ββββββ | 106/176 [01:21<00:55, 1.26it/s]
|
| 3935 |
61%|ββββββ | 107/176 [01:22<00:53, 1.29it/s]
|
| 3936 |
61%|βββββββ | 108/176 [01:22<00:52, 1.30it/s]
|
| 3937 |
62%|βββββββ | 109/176 [01:23<00:50, 1.32it/s]
|
| 3938 |
62%|βββββββ | 110/176 [01:24<00:49, 1.33it/s]
|
| 3939 |
63%|βββββββ | 111/176 [01:25<00:48, 1.34it/s]
|
| 3940 |
64%|βββββββ | 112/176 [01:25<00:47, 1.35it/s]
|
| 3941 |
64%|βββββββ | 113/176 [01:26<00:46, 1.35it/s]
|
| 3942 |
65%|βββββββ | 114/176 [01:27<00:46, 1.32it/s]
|
| 3943 |
65%|βββββββ | 115/176 [01:28<00:46, 1.32it/s]
|
| 3944 |
66%|βββββββ | 116/176 [01:28<00:45, 1.33it/s]
|
| 3945 |
66%|βββββββ | 117/176 [01:29<00:44, 1.32it/s]
|
| 3946 |
67%|βββββββ | 118/176 [01:30<00:44, 1.30it/s]
|
| 3947 |
68%|βββββββ | 119/176 [01:31<00:43, 1.30it/s]
|
| 3948 |
68%|βββββββ | 120/176 [01:32<00:43, 1.28it/s]
|
| 3949 |
69%|βββββββ | 121/176 [01:32<00:43, 1.28it/s]
|
| 3950 |
69%|βββββββ | 122/176 [01:33<00:41, 1.29it/s]
|
| 3951 |
70%|βββββββ | 123/176 [01:34<00:40, 1.30it/s]
|
| 3952 |
70%|βββββββ | 124/176 [01:35<00:40, 1.29it/s]
|
| 3953 |
71%|βββββββ | 125/176 [01:35<00:39, 1.30it/s]
|
| 3954 |
72%|ββββββββ | 126/176 [01:36<00:38, 1.30it/s]
|
| 3955 |
72%|ββββββββ | 127/176 [01:37<00:37, 1.31it/s]
|
| 3956 |
73%|ββββββββ | 128/176 [01:38<00:36, 1.31it/s]
|
| 3957 |
73%|ββββββββ | 129/176 [01:38<00:35, 1.31it/s]
|
| 3958 |
74%|ββββββββ | 130/176 [01:39<00:35, 1.30it/s]
|
| 3959 |
74%|ββββββββ | 131/176 [01:40<00:33, 1.32it/s]
|
| 3960 |
75%|ββββββββ | 132/176 [01:41<00:33, 1.31it/s]
|
| 3961 |
76%|ββββββββ | 133/176 [01:41<00:32, 1.32it/s]
|
| 3962 |
76%|ββββββββ | 134/176 [01:42<00:31, 1.31it/s]
|
| 3963 |
77%|ββββββββ | 135/176 [01:43<00:31, 1.30it/s]
|
| 3964 |
77%|ββββββββ | 136/176 [01:44<00:30, 1.30it/s]
|
| 3965 |
78%|ββββββββ | 137/176 [01:45<00:29, 1.32it/s]
|
| 3966 |
78%|ββββββββ | 138/176 [01:45<00:28, 1.31it/s]
|
| 3967 |
79%|ββββββββ | 139/176 [01:46<00:27, 1.33it/s]
|
| 3968 |
80%|ββββββββ | 140/176 [01:47<00:27, 1.33it/s]
|
| 3969 |
80%|ββββββββ | 141/176 [01:48<00:26, 1.31it/s]
|
| 3970 |
81%|ββββββββ | 142/176 [01:48<00:25, 1.32it/s]
|
| 3971 |
81%|βββββββββ | 143/176 [01:49<00:24, 1.34it/s]
|
| 3972 |
82%|βββββββββ | 144/176 [01:50<00:23, 1.34it/s]
|
| 3973 |
82%|βββββββββ | 145/176 [01:51<00:23, 1.34it/s]
|
| 3974 |
83%|βββββββββ | 146/176 [01:51<00:22, 1.32it/s]
|
| 3975 |
84%|βββββββββ | 147/176 [01:52<00:22, 1.31it/s]
|
| 3976 |
84%|βββββββββ | 148/176 [01:53<00:21, 1.31it/s]
|
| 3977 |
85%|βββββββββ | 149/176 [01:54<00:20, 1.30it/s]
|
| 3978 |
85%|βββββββββ | 150/176 [01:54<00:19, 1.30it/s]
|
| 3979 |
86%|βββββββββ | 151/176 [01:55<00:19, 1.31it/s]
|
| 3980 |
86%|βββββββββ | 152/176 [01:56<00:18, 1.30it/s]
|
| 3981 |
87%|βββββββββ | 153/176 [01:57<00:17, 1.29it/s]
|
| 3982 |
88%|βββββββββ | 154/176 [01:57<00:16, 1.32it/s]
|
| 3983 |
88%|βββββββββ | 155/176 [01:58<00:16, 1.31it/s]
|
| 3984 |
89%|βββββββββ | 156/176 [01:59<00:15, 1.31it/s]
|
| 3985 |
89%|βββββββββ | 157/176 [02:00<00:14, 1.35it/s]
|
| 3986 |
90%|βββββββββ | 158/176 [02:00<00:13, 1.33it/s]
|
| 3987 |
90%|βββββββββ | 159/176 [02:01<00:12, 1.31it/s]
|
| 3988 |
91%|βββββββββ | 160/176 [02:02<00:12, 1.32it/s]
|
| 3989 |
91%|ββββββββββ| 161/176 [02:03<00:11, 1.30it/s]
|
| 3990 |
92%|ββββββββββ| 162/176 [02:04<00:10, 1.31it/s]
|
| 3991 |
93%|οΏ½οΏ½βββββββββ| 163/176 [02:04<00:09, 1.32it/s]
|
| 3992 |
93%|ββββββββββ| 164/176 [02:05<00:08, 1.34it/s]
|
| 3993 |
94%|ββββββββββ| 165/176 [02:06<00:08, 1.30it/s]
|
| 3994 |
94%|ββββββββββ| 166/176 [02:07<00:07, 1.27it/s]
|
| 3995 |
95%|ββββββββββ| 167/176 [02:07<00:07, 1.28it/s]
|
| 3996 |
95%|ββββββββββ| 168/176 [02:08<00:06, 1.29it/s]
|
| 3997 |
96%|ββββββββββ| 169/176 [02:09<00:05, 1.32it/s]
|
| 3998 |
97%|ββββββββββ| 170/176 [02:10<00:04, 1.31it/s]
|
| 3999 |
97%|ββββββββββ| 171/176 [02:10<00:03, 1.32it/s]
|
| 4000 |
98%|ββββββββββ| 172/176 [02:11<00:03, 1.32it/s]
|
| 4001 |
98%|ββββββββββ| 173/176 [02:12<00:02, 1.31it/s]
|
| 4002 |
99%|ββββββββββ| 174/176 [02:13<00:01, 1.27it/s]
|
| 4003 |
99%|ββββββββββ| 175/176 [02:14<00:00, 1.27it/s]
|
| 4004 |
+
|
| 4005 |
+
Final Evaluation Results:
|
| 4006 |
+
eval_loss: 0.5781
|
| 4007 |
+
eval_wer: 54.5829
|
| 4008 |
+
eval_wer_ortho: 57.7728
|
| 4009 |
+
eval_cer: 17.2107
|
| 4010 |
+
eval_runtime: 194.9835
|
| 4011 |
+
eval_samples_per_second: 14.3650
|
| 4012 |
+
eval_steps_per_second: 0.9030
|
| 4013 |
+
epoch: 6.0640
|
| 4014 |
+
|
| 4015 |
+
Saving final model to /workspace/experiments/exp_002_base_lora...
|
| 4016 |
+
|
| 4017 |
+
|
| 4018 |
+
|
| 4019 |
+
|
| 4020 |
...se_lora/training_args.bin: 100%|ββββββββββ| 5.91kB / 5.91kB [A[A
|
| 4021 |
+
|
| 4022 |
+
|
| 4023 |
+
|
| 4024 |
...4567.65d8b3854c39.93630.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A
|
| 4025 |
+
|
| 4026 |
+
|
| 4027 |
+
|
| 4028 |
+
|
| 4029 |
...9876.65d8b3854c39.94606.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A
|
| 4030 |
+
|
| 4031 |
+
|
| 4032 |
+
|
| 4033 |
+
|
| 4034 |
+
|
| 4035 |
...9767.65d8b3854c39.92682.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A
|
| 4036 |
+
|
| 4037 |
+
|
| 4038 |
+
|
| 4039 |
+
|
| 4040 |
+
|
| 4041 |
+
|
| 4042 |
...4970.65d8b3854c39.93828.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A
|
| 4043 |
+
|
| 4044 |
+
|
| 4045 |
+
|
| 4046 |
+
|
| 4047 |
+
|
| 4048 |
+
|
| 4049 |
+
|
| 4050 |
...0403.65d8b3854c39.94930.0: 100%|ββββββββββ| 36.7kB / 36.7kB [A[A[A[A[A[A[A
|
| 4051 |
+
|
| 4052 |
+
|
| 4053 |
+
|
| 4054 |
+
|
| 4055 |
+
|
| 4056 |
+
|
| 4057 |
+
|
| 4058 |
+
|
| 4059 |
...4234.65d8b3854c39.93425.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A[A[A
|
| 4060 |
+
|
| 4061 |
+
|
| 4062 |
+
|
| 4063 |
+
|
| 4064 |
+
|
| 4065 |
+
|
| 4066 |
+
|
| 4067 |
+
|
| 4068 |
+
|
| 4069 |
...8288.65d8b3854c39.94241.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A[A[A[A[A
|
| 4070 |
+
|
| 4071 |
+
|
| 4072 |
+
|
| 4073 |
+
|
| 4074 |
+
|
| 4075 |
+
|
| 4076 |
+
|
| 4077 |
+
|
| 4078 |
+
|
| 4079 |
+
|
| 4080 |
...adapter_model.safetensors: 100%|ββββββββββ| 9.45MB / 9.45MB [A[A[A[A[A[A[A[A[A[A
|
| 4081 |
+
|
| 4082 |
+
|
| 4083 |
+
|
| 4084 |
+
|
| 4085 |
+
|
| 4086 |
+
|
| 4087 |
+
|
| 4088 |
+
|
| 4089 |
+
|
| 4090 |
+
|
| 4091 |
+
|
| 4092 |
...9051.65d8b3854c39.94930.1: 100%|ββββββββββ| 506B / 506B [A[A[A[A[A[A[A[A[A[A[A
|
| 4093 |
+
|
| 4094 |
+
|
| 4095 |
...se_lora/training_args.bin: 100%|ββββββββββ| 5.91kB / 5.91kB [A[A
|
| 4096 |
+
|
| 4097 |
+
|
| 4098 |
+
|
| 4099 |
...4567.65d8b3854c39.93630.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A
|
| 4100 |
+
|
| 4101 |
+
|
| 4102 |
+
|
| 4103 |
+
|
| 4104 |
...9876.65d8b3854c39.94606.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A
|
| 4105 |
+
|
| 4106 |
+
|
| 4107 |
+
|
| 4108 |
+
|
| 4109 |
+
|
| 4110 |
...9767.65d8b3854c39.92682.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A
|
| 4111 |
+
|
| 4112 |
+
|
| 4113 |
+
|
| 4114 |
+
|
| 4115 |
+
|
| 4116 |
+
|
| 4117 |
...4970.65d8b3854c39.93828.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A
|
| 4118 |
+
|
| 4119 |
+
|
| 4120 |
+
|
| 4121 |
+
|
| 4122 |
+
|
| 4123 |
+
|
| 4124 |
+
|
| 4125 |
...0403.65d8b3854c39.94930.0: 100%|ββββββββββ| 36.7kB / 36.7kB [A[A[A[A[A[A[A
|
| 4126 |
+
|
| 4127 |
+
|
| 4128 |
+
|
| 4129 |
+
|
| 4130 |
+
|
| 4131 |
+
|
| 4132 |
+
|
| 4133 |
+
|
| 4134 |
...4234.65d8b3854c39.93425.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A[A[A
|
| 4135 |
+
|
| 4136 |
+
|
| 4137 |
+
|
| 4138 |
+
|
| 4139 |
+
|
| 4140 |
+
|
| 4141 |
+
|
| 4142 |
+
|
| 4143 |
+
|
| 4144 |
...8288.65d8b3854c39.94241.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A[A[A[A[A
|
| 4145 |
+
|
| 4146 |
+
|
| 4147 |
+
|
| 4148 |
+
|
| 4149 |
+
|
| 4150 |
+
|
| 4151 |
+
|
| 4152 |
+
|
| 4153 |
+
|
| 4154 |
+
|
| 4155 |
...adapter_model.safetensors: 100%|ββββββββββ| 9.45MB / 9.45MB [A[A[A[A[A[A[A[A[A[A
|
| 4156 |
+
|
| 4157 |
+
|
| 4158 |
+
|
| 4159 |
+
|
| 4160 |
+
|
| 4161 |
+
|
| 4162 |
+
|
| 4163 |
+
|
| 4164 |
+
|
| 4165 |
+
|
| 4166 |
+
|
| 4167 |
...9051.65d8b3854c39.94930.1: 100%|ββββββββββ| 506B / 506B [A[A[A[A[A[A[A[A[A[A[A
|
| 4168 |
+
|
| 4169 |
+
|
| 4170 |
+
|
| 4171 |
...se_lora/training_args.bin: 100%|ββββββββββ| 5.91kB / 5.91kB [A[A
|
| 4172 |
+
|
| 4173 |
+
|
| 4174 |
+
|
| 4175 |
...4567.65d8b3854c39.93630.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A
|
| 4176 |
+
|
| 4177 |
+
|
| 4178 |
+
|
| 4179 |
+
|
| 4180 |
...9876.65d8b3854c39.94606.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A
|
| 4181 |
+
|
| 4182 |
+
|
| 4183 |
+
|
| 4184 |
+
|
| 4185 |
+
|
| 4186 |
...9767.65d8b3854c39.92682.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A
|
| 4187 |
+
|
| 4188 |
+
|
| 4189 |
+
|
| 4190 |
+
|
| 4191 |
+
|
| 4192 |
+
|
| 4193 |
...4970.65d8b3854c39.93828.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A
|
| 4194 |
+
|
| 4195 |
+
|
| 4196 |
+
|
| 4197 |
+
|
| 4198 |
+
|
| 4199 |
+
|
| 4200 |
+
|
| 4201 |
...0403.65d8b3854c39.94930.0: 100%|ββββββββββ| 36.7kB / 36.7kB [A[A[A[A[A[A[A
|
| 4202 |
+
|
| 4203 |
+
|
| 4204 |
+
|
| 4205 |
+
|
| 4206 |
+
|
| 4207 |
+
|
| 4208 |
+
|
| 4209 |
+
|
| 4210 |
...4234.65d8b3854c39.93425.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A[A[A
|
| 4211 |
+
|
| 4212 |
+
|
| 4213 |
+
|
| 4214 |
+
|
| 4215 |
+
|
| 4216 |
+
|
| 4217 |
+
|
| 4218 |
+
|
| 4219 |
+
|
| 4220 |
...8288.65d8b3854c39.94241.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A[A[A[A[A
|
| 4221 |
+
|
| 4222 |
+
|
| 4223 |
+
|
| 4224 |
+
|
| 4225 |
+
|
| 4226 |
+
|
| 4227 |
+
|
| 4228 |
+
|
| 4229 |
+
|
| 4230 |
+
|
| 4231 |
...adapter_model.safetensors: 100%|ββββββββββ| 9.45MB / 9.45MB [A[A[A[A[A[A[A[A[A[A
|
| 4232 |
+
|
| 4233 |
+
|
| 4234 |
+
|
| 4235 |
+
|
| 4236 |
+
|
| 4237 |
+
|
| 4238 |
+
|
| 4239 |
+
|
| 4240 |
+
|
| 4241 |
+
|
| 4242 |
+
|
| 4243 |
...9051.65d8b3854c39.94930.1: 100%|ββββββββββ| 506B / 506B [A[A[A[A[A[A[A[A[A[A[A
|
| 4244 |
+
|
| 4245 |
+
|
| 4246 |
...se_lora/training_args.bin: 100%|ββββββββββ| 5.91kB / 5.91kB [A[A
|
| 4247 |
+
|
| 4248 |
+
|
| 4249 |
+
|
| 4250 |
...4567.65d8b3854c39.93630.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A
|
| 4251 |
+
|
| 4252 |
+
|
| 4253 |
+
|
| 4254 |
+
|
| 4255 |
...9876.65d8b3854c39.94606.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A
|
| 4256 |
+
|
| 4257 |
+
|
| 4258 |
+
|
| 4259 |
+
|
| 4260 |
+
|
| 4261 |
...9767.65d8b3854c39.92682.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A
|
| 4262 |
+
|
| 4263 |
+
|
| 4264 |
+
|
| 4265 |
+
|
| 4266 |
+
|
| 4267 |
+
|
| 4268 |
...4970.65d8b3854c39.93828.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A
|
| 4269 |
+
|
| 4270 |
+
|
| 4271 |
+
|
| 4272 |
+
|
| 4273 |
+
|
| 4274 |
+
|
| 4275 |
+
|
| 4276 |
...0403.65d8b3854c39.94930.0: 100%|ββββββββββ| 36.7kB / 36.7kB [A[A[A[A[A[A[A
|
| 4277 |
+
|
| 4278 |
+
|
| 4279 |
+
|
| 4280 |
+
|
| 4281 |
+
|
| 4282 |
+
|
| 4283 |
+
|
| 4284 |
+
|
| 4285 |
...4234.65d8b3854c39.93425.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A[A[A
|
| 4286 |
+
|
| 4287 |
+
|
| 4288 |
+
|
| 4289 |
+
|
| 4290 |
+
|
| 4291 |
+
|
| 4292 |
+
|
| 4293 |
+
|
| 4294 |
+
|
| 4295 |
...8288.65d8b3854c39.94241.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A[A[A[A[A
|
| 4296 |
+
|
| 4297 |
+
|
| 4298 |
+
|
| 4299 |
+
|
| 4300 |
+
|
| 4301 |
+
|
| 4302 |
+
|
| 4303 |
+
|
| 4304 |
+
|
| 4305 |
+
|
| 4306 |
...adapter_model.safetensors: 100%|ββββββββββ| 9.45MB / 9.45MB [A[A[A[A[A[A[A[A[A[A
|
| 4307 |
+
|
| 4308 |
+
|
| 4309 |
+
|
| 4310 |
+
|
| 4311 |
+
|
| 4312 |
+
|
| 4313 |
+
|
| 4314 |
+
|
| 4315 |
+
|
| 4316 |
+
|
| 4317 |
+
|
| 4318 |
...9051.65d8b3854c39.94930.1: 100%|ββββββββββ| 506B / 506B [A[A[A[A[A[A[A[A[A[A[A
|
| 4319 |
+
|
| 4320 |
+
|
| 4321 |
...se_lora/training_args.bin: 100%|ββββββββββ| 5.91kB / 5.91kB
|
| 4322 |
+
|
| 4323 |
...4567.65d8b3854c39.93630.0: 100%|ββββββββββ| 6.04kB / 6.04kB
|
| 4324 |
+
|
| 4325 |
...9876.65d8b3854c39.94606.0: 100%|ββββββββββ| 6.90kB / 6.90kB
|
| 4326 |
+
|
| 4327 |
...9767.65d8b3854c39.92682.0: 100%|ββββββββββ| 6.90kB / 6.90kB
|
| 4328 |
+
|
| 4329 |
...4970.65d8b3854c39.93828.0: 100%|ββββββββββ| 6.04kB / 6.04kB
|
| 4330 |
+
|
| 4331 |
...0403.65d8b3854c39.94930.0: 100%|ββββββββββ| 36.7kB / 36.7kB
|
| 4332 |
+
|
| 4333 |
...4234.65d8b3854c39.93425.0: 100%|ββββββββββ| 6.04kB / 6.04kB
|
| 4334 |
+
|
| 4335 |
...8288.65d8b3854c39.94241.0: 100%|ββββββββββ| 6.90kB / 6.90kB
|
| 4336 |
+
|
| 4337 |
...adapter_model.safetensors: 100%|ββββββββββ| 9.45MB / 9.45MB
|
| 4338 |
+
|
| 4339 |
...9051.65d8b3854c39.94930.1: 100%|ββββββββββ| 506B / 506B
|
nohup.log
CHANGED
|
@@ -3828,3 +3828,292 @@ The attention mask is not set and cannot be inferred from input because pad toke
|
|
| 3828 |
|
| 3829 |
|
| 3830 |
[A
|
| 3831 |
|
| 3832 |
|
|
|
|
|
|
|
|
|
|
| 3833 |
0%| | 0/176 [00:00<?, ?it/s]
|
| 3834 |
1%| | 2/176 [00:00<01:03, 2.73it/s]
|
| 3835 |
2%|β | 3/176 [00:01<01:29, 1.93it/s]
|
| 3836 |
2%|β | 4/176 [00:02<01:45, 1.63it/s]
|
| 3837 |
3%|β | 5/176 [00:02<01:53, 1.51it/s]
|
| 3838 |
3%|β | 6/176 [00:03<01:59, 1.43it/s]
|
| 3839 |
4%|β | 7/176 [00:04<02:03, 1.37it/s]
|
| 3840 |
5%|β | 8/176 [00:05<02:03, 1.36it/s]
|
| 3841 |
5%|β | 9/176 [00:06<02:04, 1.34it/s]
|
| 3842 |
6%|β | 10/176 [00:06<02:06, 1.32it/s]
|
| 3843 |
6%|β | 11/176 [00:07<02:05, 1.32it/s]
|
| 3844 |
7%|β | 12/176 [00:08<02:05, 1.31it/s]
|
| 3845 |
7%|β | 13/176 [00:09<02:04, 1.31it/s]
|
| 3846 |
8%|β | 14/176 [00:09<02:01, 1.34it/s]
|
| 3847 |
9%|β | 15/176 [00:10<02:01, 1.32it/s]
|
| 3848 |
9%|β | 16/176 [00:11<01:59, 1.34it/s]
|
| 3849 |
10%|β | 17/176 [00:12<02:00, 1.32it/s]
|
| 3850 |
10%|β | 18/176 [00:12<02:02, 1.29it/s]
|
| 3851 |
11%|β | 19/176 [00:13<02:02, 1.28it/s]
|
| 3852 |
11%|ββ | 20/176 [00:14<01:59, 1.30it/s]
|
| 3853 |
12%|ββ | 21/176 [00:15<01:59, 1.30it/s]
|
| 3854 |
12%|ββ | 22/176 [00:16<01:59, 1.29it/s]
|
| 3855 |
13%|ββ | 23/176 [00:16<01:57, 1.30it/s]
|
| 3856 |
14%|ββ | 24/176 [00:17<01:56, 1.31it/s]
|
| 3857 |
14%|ββ | 25/176 [00:18<01:56, 1.30it/s]
|
| 3858 |
15%|ββ | 26/176 [00:19<01:56, 1.29it/s]
|
| 3859 |
15%|ββ | 27/176 [00:19<01:55, 1.29it/s]
|
| 3860 |
16%|ββ | 28/176 [00:20<01:52, 1.31it/s]
|
| 3861 |
16%|ββ | 29/176 [00:21<01:54, 1.28it/s]
|
| 3862 |
17%|ββ | 30/176 [00:22<01:53, 1.29it/s]
|
| 3863 |
18%|ββ | 31/176 [00:23<01:51, 1.30it/s]
|
| 3864 |
18%|ββ | 32/176 [00:23<01:50, 1.31it/s]
|
| 3865 |
19%|ββ | 33/176 [00:24<01:49, 1.31it/s]
|
| 3866 |
19%|ββ | 34/176 [00:25<01:49, 1.29it/s]
|
| 3867 |
20%|ββ | 35/176 [00:26<01:47, 1.31it/s]
|
| 3868 |
20%|ββ | 36/176 [00:26<01:46, 1.32it/s]
|
| 3869 |
21%|ββ | 37/176 [00:27<01:46, 1.30it/s]
|
| 3870 |
22%|βββ | 38/176 [00:28<01:44, 1.32it/s]
|
| 3871 |
22%|βββ | 39/176 [00:29<01:44, 1.31it/s]
|
| 3872 |
23%|βββ | 40/176 [00:29<01:43, 1.32it/s]
|
| 3873 |
23%|βββ | 41/176 [00:30<01:42, 1.31it/s]
|
| 3874 |
24%|βββ | 42/176 [00:31<01:42, 1.30it/s]
|
| 3875 |
24%|βββ | 43/176 [00:32<01:42, 1.30it/s]
|
| 3876 |
25%|βββ | 44/176 [00:32<01:41, 1.30it/s]
|
| 3877 |
26%|βββ | 45/176 [00:33<01:39, 1.32it/s]
|
| 3878 |
26%|βββ | 46/176 [00:34<01:39, 1.30it/s]
|
| 3879 |
27%|βββ | 47/176 [00:35<01:37, 1.32it/s]
|
| 3880 |
27%|βββ | 48/176 [00:35<01:37, 1.32it/s]
|
| 3881 |
28%|βββ | 49/176 [00:36<01:37, 1.30it/s]
|
| 3882 |
28%|βββ | 50/176 [00:37<01:33, 1.34it/s]
|
| 3883 |
29%|βββ | 51/176 [00:38<01:32, 1.35it/s]
|
| 3884 |
30%|βββ | 52/176 [00:38<01:34, 1.32it/s]
|
| 3885 |
30%|βββ | 53/176 [00:39<01:37, 1.26it/s]
|
| 3886 |
31%|βββ | 54/176 [00:40<01:35, 1.27it/s]
|
| 3887 |
31%|ββββ | 55/176 [00:41<01:35, 1.27it/s]
|
| 3888 |
32%|ββββ | 56/176 [00:42<01:33, 1.28it/s]
|
| 3889 |
32%|ββββ | 57/176 [00:42<01:31, 1.30it/s]
|
| 3890 |
33%|ββββ | 58/176 [00:43<01:29, 1.32it/s]
|
| 3891 |
34%|ββββ | 59/176 [00:44<01:29, 1.31it/s]
|
| 3892 |
34%|ββββ | 60/176 [00:45<01:28, 1.31it/s]
|
| 3893 |
35%|ββββ | 61/176 [00:46<01:36, 1.19it/s]
|
| 3894 |
35%|ββββ | 62/176 [00:46<01:32, 1.23it/s]
|
| 3895 |
36%|ββββ | 63/176 [00:47<01:30, 1.25it/s]
|
| 3896 |
36%|ββββ | 64/176 [00:48<01:27, 1.27it/s]
|
| 3897 |
37%|ββββ | 65/176 [00:49<01:26, 1.29it/s]
|
| 3898 |
38%|ββββ | 66/176 [00:49<01:23, 1.32it/s]
|
| 3899 |
38%|ββββ | 67/176 [00:50<01:23, 1.30it/s]
|
| 3900 |
39%|ββββ | 68/176 [00:51<01:23, 1.29it/s]
|
| 3901 |
39%|ββββ | 69/176 [00:52<01:22, 1.29it/s]
|
| 3902 |
40%|ββββ | 70/176 [00:53<01:19, 1.33it/s]
|
| 3903 |
40%|ββββ | 71/176 [00:53<01:19, 1.32it/s]
|
| 3904 |
41%|ββββ | 72/176 [00:54<01:18, 1.32it/s]
|
| 3905 |
41%|βββββ | 73/176 [00:55<01:18, 1.31it/s]
|
| 3906 |
42%|βββββ | 74/176 [00:56<01:16, 1.33it/s]
|
| 3907 |
43%|βββββ | 75/176 [00:56<01:15, 1.33it/s]
|
| 3908 |
43%|βββββ | 76/176 [00:57<01:15, 1.32it/s]
|
| 3909 |
44%|βββββ | 77/176 [00:58<01:15, 1.31it/s]
|
| 3910 |
44%|βββββ | 78/176 [00:59<01:13, 1.33it/s]
|
| 3911 |
45%|βββββ | 79/176 [00:59<01:13, 1.32it/s]
|
| 3912 |
45%|βββββ | 80/176 [01:00<01:12, 1.32it/s]
|
| 3913 |
46%|βββββ | 81/176 [01:01<01:11, 1.33it/s]
|
| 3914 |
47%|βββββ | 82/176 [01:02<01:09, 1.34it/s]
|
| 3915 |
47%|βββββ | 83/176 [01:02<01:10, 1.32it/s]
|
| 3916 |
48%|βββββ | 84/176 [01:03<01:10, 1.31it/s]
|
| 3917 |
48%|βββββ | 85/176 [01:04<01:08, 1.33it/s]
|
| 3918 |
49%|βββββ | 86/176 [01:05<01:05, 1.36it/s]
|
| 3919 |
49%|βββββ | 87/176 [01:05<01:06, 1.34it/s]
|
| 3920 |
50%|βββββ | 88/176 [01:06<01:06, 1.33it/s]
|
| 3921 |
51%|βββββ | 89/176 [01:07<01:05, 1.33it/s]
|
| 3922 |
51%|βββββ | 90/176 [01:08<01:04, 1.33it/s]
|
| 3923 |
52%|ββββββ | 91/176 [01:08<01:02, 1.35it/s]
|
| 3924 |
52%|ββββββ | 92/176 [01:09<01:01, 1.36it/s]
|
| 3925 |
53%|ββββββ | 93/176 [01:10<01:07, 1.22it/s]
|
| 3926 |
53%|ββββββ | 94/176 [01:11<01:05, 1.26it/s]
|
| 3927 |
54%|ββββββ | 95/176 [01:13<01:28, 1.09s/it]
|
| 3928 |
55%|ββββββ | 96/176 [01:13<01:17, 1.03it/s]
|
| 3929 |
55%|ββββββ | 97/176 [01:14<01:12, 1.08it/s]
|
| 3930 |
56%|ββββββ | 98/176 [01:15<01:08, 1.15it/s]
|
| 3931 |
56%|ββββββ | 99/176 [01:16<01:05, 1.17it/s]
|
| 3932 |
57%|ββββββ | 100/176 [01:16<01:02, 1.23it/s]
|
| 3933 |
57%|ββββββ | 101/176 [01:17<01:00, 1.25it/s]
|
| 3934 |
58%|ββββββ | 102/176 [01:18<00:58, 1.27it/s]
|
| 3935 |
59%|ββββββ | 103/176 [01:19<00:56, 1.30it/s]
|
| 3936 |
59%|ββββββ | 104/176 [01:19<00:55, 1.30it/s]
|
| 3937 |
60%|ββββββ | 105/176 [01:20<00:54, 1.30it/s]
|
| 3938 |
60%|ββββββ | 106/176 [01:21<00:55, 1.26it/s]
|
| 3939 |
61%|ββββββ | 107/176 [01:22<00:53, 1.29it/s]
|
| 3940 |
61%|βββββββ | 108/176 [01:22<00:52, 1.30it/s]
|
| 3941 |
62%|βββββββ | 109/176 [01:23<00:50, 1.32it/s]
|
| 3942 |
62%|βββββββ | 110/176 [01:24<00:49, 1.33it/s]
|
| 3943 |
63%|βββββββ | 111/176 [01:25<00:48, 1.34it/s]
|
| 3944 |
64%|βββββββ | 112/176 [01:25<00:47, 1.35it/s]
|
| 3945 |
64%|βββββββ | 113/176 [01:26<00:46, 1.35it/s]
|
| 3946 |
65%|βββββββ | 114/176 [01:27<00:46, 1.32it/s]
|
| 3947 |
65%|βββββββ | 115/176 [01:28<00:46, 1.32it/s]
|
| 3948 |
66%|βββββββ | 116/176 [01:28<00:45, 1.33it/s]
|
| 3949 |
66%|βββββββ | 117/176 [01:29<00:44, 1.32it/s]
|
| 3950 |
67%|βββββββ | 118/176 [01:30<00:44, 1.30it/s]
|
| 3951 |
68%|βββββββ | 119/176 [01:31<00:43, 1.30it/s]
|
| 3952 |
68%|βββββββ | 120/176 [01:32<00:43, 1.28it/s]
|
| 3953 |
69%|βββββββ | 121/176 [01:32<00:43, 1.28it/s]
|
| 3954 |
69%|βββββββ | 122/176 [01:33<00:41, 1.29it/s]
|
| 3955 |
70%|βββββββ | 123/176 [01:34<00:40, 1.30it/s]
|
| 3956 |
70%|βββββββ | 124/176 [01:35<00:40, 1.29it/s]
|
| 3957 |
71%|βββββββ | 125/176 [01:35<00:39, 1.30it/s]
|
| 3958 |
72%|ββββββββ | 126/176 [01:36<00:38, 1.30it/s]
|
| 3959 |
72%|ββββββββ | 127/176 [01:37<00:37, 1.31it/s]
|
| 3960 |
73%|ββββββββ | 128/176 [01:38<00:36, 1.31it/s]
|
| 3961 |
73%|ββββββββ | 129/176 [01:38<00:35, 1.31it/s]
|
| 3962 |
74%|ββββββββ | 130/176 [01:39<00:35, 1.30it/s]
|
| 3963 |
74%|ββββββββ | 131/176 [01:40<00:33, 1.32it/s]
|
| 3964 |
75%|ββββββββ | 132/176 [01:41<00:33, 1.31it/s]
|
| 3965 |
76%|ββββββββ | 133/176 [01:41<00:32, 1.32it/s]
|
| 3966 |
76%|ββββββββ | 134/176 [01:42<00:31, 1.31it/s]
|
| 3967 |
77%|ββββββββ | 135/176 [01:43<00:31, 1.30it/s]
|
| 3968 |
77%|ββββββββ | 136/176 [01:44<00:30, 1.30it/s]
|
| 3969 |
78%|ββββββββ | 137/176 [01:45<00:29, 1.32it/s]
|
| 3970 |
78%|ββββββββ | 138/176 [01:45<00:28, 1.31it/s]
|
| 3971 |
79%|ββββββββ | 139/176 [01:46<00:27, 1.33it/s]
|
| 3972 |
80%|ββββββββ | 140/176 [01:47<00:27, 1.33it/s]
|
| 3973 |
80%|ββββββββ | 141/176 [01:48<00:26, 1.31it/s]
|
| 3974 |
81%|ββββββββ | 142/176 [01:48<00:25, 1.32it/s]
|
| 3975 |
81%|βββββββββ | 143/176 [01:49<00:24, 1.34it/s]
|
| 3976 |
82%|βββββββββ | 144/176 [01:50<00:23, 1.34it/s]
|
| 3977 |
82%|βββββββββ | 145/176 [01:51<00:23, 1.34it/s]
|
| 3978 |
83%|βββββββββ | 146/176 [01:51<00:22, 1.32it/s]
|
| 3979 |
84%|βββββββββ | 147/176 [01:52<00:22, 1.31it/s]
|
| 3980 |
84%|βββββββββ | 148/176 [01:53<00:21, 1.31it/s]
|
| 3981 |
85%|βββββββββ | 149/176 [01:54<00:20, 1.30it/s]
|
| 3982 |
85%|βββββββββ | 150/176 [01:54<00:19, 1.30it/s]
|
| 3983 |
86%|βββββββββ | 151/176 [01:55<00:19, 1.31it/s]
|
| 3984 |
86%|βββββββββ | 152/176 [01:56<00:18, 1.30it/s]
|
| 3985 |
87%|βββββββββ | 153/176 [01:57<00:17, 1.29it/s]
|
| 3986 |
88%|βββββββββ | 154/176 [01:57<00:16, 1.32it/s]
|
| 3987 |
88%|βββββββββ | 155/176 [01:58<00:16, 1.31it/s]
|
| 3988 |
89%|βββββββββ | 156/176 [01:59<00:15, 1.31it/s]
|
| 3989 |
89%|βββββββββ | 157/176 [02:00<00:14, 1.35it/s]
|
| 3990 |
90%|βββββββββ | 158/176 [02:00<00:13, 1.33it/s]
|
| 3991 |
90%|βββββββββ | 159/176 [02:01<00:12, 1.31it/s]
|
| 3992 |
91%|βββββββββ | 160/176 [02:02<00:12, 1.32it/s]
|
| 3993 |
91%|ββββββββββ| 161/176 [02:03<00:11, 1.30it/s]
|
| 3994 |
92%|ββββββββββ| 162/176 [02:04<00:10, 1.31it/s]
|
| 3995 |
93%|οΏ½οΏ½βββββββββ| 163/176 [02:04<00:09, 1.32it/s]
|
| 3996 |
93%|ββββββββββ| 164/176 [02:05<00:08, 1.34it/s]
|
| 3997 |
94%|ββββββββββ| 165/176 [02:06<00:08, 1.30it/s]
|
| 3998 |
94%|ββββββββββ| 166/176 [02:07<00:07, 1.27it/s]
|
| 3999 |
95%|ββββββββββ| 167/176 [02:07<00:07, 1.28it/s]
|
| 4000 |
95%|ββββββββββ| 168/176 [02:08<00:06, 1.29it/s]
|
| 4001 |
96%|ββββββββββ| 169/176 [02:09<00:05, 1.32it/s]
|
| 4002 |
97%|ββββββββββ| 170/176 [02:10<00:04, 1.31it/s]
|
| 4003 |
97%|ββββββββββ| 171/176 [02:10<00:03, 1.32it/s]
|
| 4004 |
98%|ββββββββββ| 172/176 [02:11<00:03, 1.32it/s]
|
| 4005 |
98%|ββββββββββ| 173/176 [02:12<00:02, 1.31it/s]
|
| 4006 |
99%|ββββββββββ| 174/176 [02:13<00:01, 1.27it/s]
|
| 4007 |
99%|ββββββββββ| 175/176 [02:14<00:00, 1.27it/s]
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4008 |
...se_lora/training_args.bin: 100%|ββββββββββ| 5.91kB / 5.91kB [A[A
|
|
|
|
|
|
|
|
|
|
| 4009 |
...4567.65d8b3854c39.93630.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4010 |
...9876.65d8b3854c39.94606.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4011 |
...9767.65d8b3854c39.92682.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4012 |
...4970.65d8b3854c39.93828.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4013 |
...0403.65d8b3854c39.94930.0: 100%|ββββββββββ| 36.7kB / 36.7kB [A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4014 |
...4234.65d8b3854c39.93425.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4015 |
...8288.65d8b3854c39.94241.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4016 |
...adapter_model.safetensors: 100%|ββββββββββ| 9.45MB / 9.45MB [A[A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4017 |
...9051.65d8b3854c39.94930.1: 100%|ββββββββββ| 506B / 506B [A[A[A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
| 4018 |
...se_lora/training_args.bin: 100%|ββββββββββ| 5.91kB / 5.91kB [A[A
|
|
|
|
|
|
|
|
|
|
| 4019 |
...4567.65d8b3854c39.93630.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4020 |
...9876.65d8b3854c39.94606.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4021 |
...9767.65d8b3854c39.92682.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4022 |
...4970.65d8b3854c39.93828.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4023 |
...0403.65d8b3854c39.94930.0: 100%|ββββββββββ| 36.7kB / 36.7kB [A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4024 |
...4234.65d8b3854c39.93425.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4025 |
...8288.65d8b3854c39.94241.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4026 |
...adapter_model.safetensors: 100%|ββββββββββ| 9.45MB / 9.45MB [A[A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4027 |
...9051.65d8b3854c39.94930.1: 100%|ββββββββββ| 506B / 506B [A[A[A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
| 4028 |
...se_lora/training_args.bin: 100%|ββββββββββ| 5.91kB / 5.91kB [A[A
|
|
|
|
|
|
|
|
|
|
| 4029 |
...4567.65d8b3854c39.93630.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4030 |
...9876.65d8b3854c39.94606.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4031 |
...9767.65d8b3854c39.92682.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4032 |
...4970.65d8b3854c39.93828.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4033 |
...0403.65d8b3854c39.94930.0: 100%|ββββββββββ| 36.7kB / 36.7kB [A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4034 |
...4234.65d8b3854c39.93425.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4035 |
...8288.65d8b3854c39.94241.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4036 |
...adapter_model.safetensors: 100%|ββββββββββ| 9.45MB / 9.45MB [A[A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4037 |
...9051.65d8b3854c39.94930.1: 100%|ββββββββββ| 506B / 506B [A[A[A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
| 4038 |
...se_lora/training_args.bin: 100%|ββββββββββ| 5.91kB / 5.91kB [A[A
|
|
|
|
|
|
|
|
|
|
| 4039 |
...4567.65d8b3854c39.93630.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4040 |
...9876.65d8b3854c39.94606.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4041 |
...9767.65d8b3854c39.92682.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4042 |
...4970.65d8b3854c39.93828.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4043 |
...0403.65d8b3854c39.94930.0: 100%|ββββββββββ| 36.7kB / 36.7kB [A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4044 |
...4234.65d8b3854c39.93425.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4045 |
...8288.65d8b3854c39.94241.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4046 |
...adapter_model.safetensors: 100%|ββββββββββ| 9.45MB / 9.45MB [A[A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4047 |
...9051.65d8b3854c39.94930.1: 100%|ββββββββββ| 506B / 506B [A[A[A[A[A[A[A[A[A[A[A
|
|
|
|
|
|
|
| 4048 |
...se_lora/training_args.bin: 100%|ββββββββββ| 5.91kB / 5.91kB
|
|
|
|
| 4049 |
...4567.65d8b3854c39.93630.0: 100%|ββββββββββ| 6.04kB / 6.04kB
|
|
|
|
| 4050 |
...9876.65d8b3854c39.94606.0: 100%|ββββββββββ| 6.90kB / 6.90kB
|
|
|
|
| 4051 |
...9767.65d8b3854c39.92682.0: 100%|ββββββββββ| 6.90kB / 6.90kB
|
|
|
|
| 4052 |
...4970.65d8b3854c39.93828.0: 100%|ββββββββββ| 6.04kB / 6.04kB
|
|
|
|
| 4053 |
...0403.65d8b3854c39.94930.0: 100%|ββββββββββ| 36.7kB / 36.7kB
|
|
|
|
| 4054 |
...4234.65d8b3854c39.93425.0: 100%|ββββββββββ| 6.04kB / 6.04kB
|
|
|
|
| 4055 |
...8288.65d8b3854c39.94241.0: 100%|ββββββββββ| 6.90kB / 6.90kB
|
|
|
|
| 4056 |
...adapter_model.safetensors: 100%|ββββββββββ| 9.45MB / 9.45MB
|
|
|
|
| 4057 |
...9051.65d8b3854c39.94930.1: 100%|ββββββββββ| 506B / 506B
|
|
|
|
| 3828 |
|
| 3829 |
|
| 3830 |
[A
|
| 3831 |
|
| 3832 |
|
| 3833 |
+
|
| 3834 |
+
Running final evaluation...
|
| 3835 |
+
|
| 3836 |
0%| | 0/176 [00:00<?, ?it/s]
|
| 3837 |
1%| | 2/176 [00:00<01:03, 2.73it/s]
|
| 3838 |
2%|β | 3/176 [00:01<01:29, 1.93it/s]
|
| 3839 |
2%|β | 4/176 [00:02<01:45, 1.63it/s]
|
| 3840 |
3%|β | 5/176 [00:02<01:53, 1.51it/s]
|
| 3841 |
3%|β | 6/176 [00:03<01:59, 1.43it/s]
|
| 3842 |
4%|β | 7/176 [00:04<02:03, 1.37it/s]
|
| 3843 |
5%|β | 8/176 [00:05<02:03, 1.36it/s]
|
| 3844 |
5%|β | 9/176 [00:06<02:04, 1.34it/s]
|
| 3845 |
6%|β | 10/176 [00:06<02:06, 1.32it/s]
|
| 3846 |
6%|β | 11/176 [00:07<02:05, 1.32it/s]
|
| 3847 |
7%|β | 12/176 [00:08<02:05, 1.31it/s]
|
| 3848 |
7%|β | 13/176 [00:09<02:04, 1.31it/s]
|
| 3849 |
8%|β | 14/176 [00:09<02:01, 1.34it/s]
|
| 3850 |
9%|β | 15/176 [00:10<02:01, 1.32it/s]
|
| 3851 |
9%|β | 16/176 [00:11<01:59, 1.34it/s]
|
| 3852 |
10%|β | 17/176 [00:12<02:00, 1.32it/s]
|
| 3853 |
10%|β | 18/176 [00:12<02:02, 1.29it/s]
|
| 3854 |
11%|β | 19/176 [00:13<02:02, 1.28it/s]
|
| 3855 |
11%|ββ | 20/176 [00:14<01:59, 1.30it/s]
|
| 3856 |
12%|ββ | 21/176 [00:15<01:59, 1.30it/s]
|
| 3857 |
12%|ββ | 22/176 [00:16<01:59, 1.29it/s]
|
| 3858 |
13%|ββ | 23/176 [00:16<01:57, 1.30it/s]
|
| 3859 |
14%|ββ | 24/176 [00:17<01:56, 1.31it/s]
|
| 3860 |
14%|ββ | 25/176 [00:18<01:56, 1.30it/s]
|
| 3861 |
15%|ββ | 26/176 [00:19<01:56, 1.29it/s]
|
| 3862 |
15%|ββ | 27/176 [00:19<01:55, 1.29it/s]
|
| 3863 |
16%|ββ | 28/176 [00:20<01:52, 1.31it/s]
|
| 3864 |
16%|ββ | 29/176 [00:21<01:54, 1.28it/s]
|
| 3865 |
17%|ββ | 30/176 [00:22<01:53, 1.29it/s]
|
| 3866 |
18%|ββ | 31/176 [00:23<01:51, 1.30it/s]
|
| 3867 |
18%|ββ | 32/176 [00:23<01:50, 1.31it/s]
|
| 3868 |
19%|ββ | 33/176 [00:24<01:49, 1.31it/s]
|
| 3869 |
19%|ββ | 34/176 [00:25<01:49, 1.29it/s]
|
| 3870 |
20%|ββ | 35/176 [00:26<01:47, 1.31it/s]
|
| 3871 |
20%|ββ | 36/176 [00:26<01:46, 1.32it/s]
|
| 3872 |
21%|ββ | 37/176 [00:27<01:46, 1.30it/s]
|
| 3873 |
22%|βββ | 38/176 [00:28<01:44, 1.32it/s]
|
| 3874 |
22%|βββ | 39/176 [00:29<01:44, 1.31it/s]
|
| 3875 |
23%|βββ | 40/176 [00:29<01:43, 1.32it/s]
|
| 3876 |
23%|βββ | 41/176 [00:30<01:42, 1.31it/s]
|
| 3877 |
24%|βββ | 42/176 [00:31<01:42, 1.30it/s]
|
| 3878 |
24%|βββ | 43/176 [00:32<01:42, 1.30it/s]
|
| 3879 |
25%|βββ | 44/176 [00:32<01:41, 1.30it/s]
|
| 3880 |
26%|βββ | 45/176 [00:33<01:39, 1.32it/s]
|
| 3881 |
26%|βββ | 46/176 [00:34<01:39, 1.30it/s]
|
| 3882 |
27%|βββ | 47/176 [00:35<01:37, 1.32it/s]
|
| 3883 |
27%|βββ | 48/176 [00:35<01:37, 1.32it/s]
|
| 3884 |
28%|βββ | 49/176 [00:36<01:37, 1.30it/s]
|
| 3885 |
28%|βββ | 50/176 [00:37<01:33, 1.34it/s]
|
| 3886 |
29%|βββ | 51/176 [00:38<01:32, 1.35it/s]
|
| 3887 |
30%|βββ | 52/176 [00:38<01:34, 1.32it/s]
|
| 3888 |
30%|βββ | 53/176 [00:39<01:37, 1.26it/s]
|
| 3889 |
31%|βββ | 54/176 [00:40<01:35, 1.27it/s]
|
| 3890 |
31%|ββββ | 55/176 [00:41<01:35, 1.27it/s]
|
| 3891 |
32%|ββββ | 56/176 [00:42<01:33, 1.28it/s]
|
| 3892 |
32%|ββββ | 57/176 [00:42<01:31, 1.30it/s]
|
| 3893 |
33%|ββββ | 58/176 [00:43<01:29, 1.32it/s]
|
| 3894 |
34%|ββββ | 59/176 [00:44<01:29, 1.31it/s]
|
| 3895 |
34%|ββββ | 60/176 [00:45<01:28, 1.31it/s]
|
| 3896 |
35%|ββββ | 61/176 [00:46<01:36, 1.19it/s]
|
| 3897 |
35%|ββββ | 62/176 [00:46<01:32, 1.23it/s]
|
| 3898 |
36%|ββββ | 63/176 [00:47<01:30, 1.25it/s]
|
| 3899 |
36%|ββββ | 64/176 [00:48<01:27, 1.27it/s]
|
| 3900 |
37%|ββββ | 65/176 [00:49<01:26, 1.29it/s]
|
| 3901 |
38%|ββββ | 66/176 [00:49<01:23, 1.32it/s]
|
| 3902 |
38%|ββββ | 67/176 [00:50<01:23, 1.30it/s]
|
| 3903 |
39%|ββββ | 68/176 [00:51<01:23, 1.29it/s]
|
| 3904 |
39%|ββββ | 69/176 [00:52<01:22, 1.29it/s]
|
| 3905 |
40%|ββββ | 70/176 [00:53<01:19, 1.33it/s]
|
| 3906 |
40%|ββββ | 71/176 [00:53<01:19, 1.32it/s]
|
| 3907 |
41%|ββββ | 72/176 [00:54<01:18, 1.32it/s]
|
| 3908 |
41%|βββββ | 73/176 [00:55<01:18, 1.31it/s]
|
| 3909 |
42%|βββββ | 74/176 [00:56<01:16, 1.33it/s]
|
| 3910 |
43%|βββββ | 75/176 [00:56<01:15, 1.33it/s]
|
| 3911 |
43%|βββββ | 76/176 [00:57<01:15, 1.32it/s]
|
| 3912 |
44%|βββββ | 77/176 [00:58<01:15, 1.31it/s]
|
| 3913 |
44%|βββββ | 78/176 [00:59<01:13, 1.33it/s]
|
| 3914 |
45%|βββββ | 79/176 [00:59<01:13, 1.32it/s]
|
| 3915 |
45%|βββββ | 80/176 [01:00<01:12, 1.32it/s]
|
| 3916 |
46%|βββββ | 81/176 [01:01<01:11, 1.33it/s]
|
| 3917 |
47%|βββββ | 82/176 [01:02<01:09, 1.34it/s]
|
| 3918 |
47%|βββββ | 83/176 [01:02<01:10, 1.32it/s]
|
| 3919 |
48%|βββββ | 84/176 [01:03<01:10, 1.31it/s]
|
| 3920 |
48%|βββββ | 85/176 [01:04<01:08, 1.33it/s]
|
| 3921 |
49%|βββββ | 86/176 [01:05<01:05, 1.36it/s]
|
| 3922 |
49%|βββββ | 87/176 [01:05<01:06, 1.34it/s]
|
| 3923 |
50%|βββββ | 88/176 [01:06<01:06, 1.33it/s]
|
| 3924 |
51%|βββββ | 89/176 [01:07<01:05, 1.33it/s]
|
| 3925 |
51%|βββββ | 90/176 [01:08<01:04, 1.33it/s]
|
| 3926 |
52%|ββββββ | 91/176 [01:08<01:02, 1.35it/s]
|
| 3927 |
52%|ββββββ | 92/176 [01:09<01:01, 1.36it/s]
|
| 3928 |
53%|ββββββ | 93/176 [01:10<01:07, 1.22it/s]
|
| 3929 |
53%|ββββββ | 94/176 [01:11<01:05, 1.26it/s]
|
| 3930 |
54%|ββββββ | 95/176 [01:13<01:28, 1.09s/it]
|
| 3931 |
55%|ββββββ | 96/176 [01:13<01:17, 1.03it/s]
|
| 3932 |
55%|ββββββ | 97/176 [01:14<01:12, 1.08it/s]
|
| 3933 |
56%|ββββββ | 98/176 [01:15<01:08, 1.15it/s]
|
| 3934 |
56%|ββββββ | 99/176 [01:16<01:05, 1.17it/s]
|
| 3935 |
57%|ββββββ | 100/176 [01:16<01:02, 1.23it/s]
|
| 3936 |
57%|ββββββ | 101/176 [01:17<01:00, 1.25it/s]
|
| 3937 |
58%|ββββββ | 102/176 [01:18<00:58, 1.27it/s]
|
| 3938 |
59%|ββββββ | 103/176 [01:19<00:56, 1.30it/s]
|
| 3939 |
59%|ββββββ | 104/176 [01:19<00:55, 1.30it/s]
|
| 3940 |
60%|ββββββ | 105/176 [01:20<00:54, 1.30it/s]
|
| 3941 |
60%|ββββββ | 106/176 [01:21<00:55, 1.26it/s]
|
| 3942 |
61%|ββββββ | 107/176 [01:22<00:53, 1.29it/s]
|
| 3943 |
61%|βββββββ | 108/176 [01:22<00:52, 1.30it/s]
|
| 3944 |
62%|βββββββ | 109/176 [01:23<00:50, 1.32it/s]
|
| 3945 |
62%|βββββββ | 110/176 [01:24<00:49, 1.33it/s]
|
| 3946 |
63%|βββββββ | 111/176 [01:25<00:48, 1.34it/s]
|
| 3947 |
64%|βββββββ | 112/176 [01:25<00:47, 1.35it/s]
|
| 3948 |
64%|βββββββ | 113/176 [01:26<00:46, 1.35it/s]
|
| 3949 |
65%|βββββββ | 114/176 [01:27<00:46, 1.32it/s]
|
| 3950 |
65%|βββββββ | 115/176 [01:28<00:46, 1.32it/s]
|
| 3951 |
66%|βββββββ | 116/176 [01:28<00:45, 1.33it/s]
|
| 3952 |
66%|βββββββ | 117/176 [01:29<00:44, 1.32it/s]
|
| 3953 |
67%|βββββββ | 118/176 [01:30<00:44, 1.30it/s]
|
| 3954 |
68%|βββββββ | 119/176 [01:31<00:43, 1.30it/s]
|
| 3955 |
68%|βββββββ | 120/176 [01:32<00:43, 1.28it/s]
|
| 3956 |
69%|βββββββ | 121/176 [01:32<00:43, 1.28it/s]
|
| 3957 |
69%|βββββββ | 122/176 [01:33<00:41, 1.29it/s]
|
| 3958 |
70%|βββββββ | 123/176 [01:34<00:40, 1.30it/s]
|
| 3959 |
70%|βββββββ | 124/176 [01:35<00:40, 1.29it/s]
|
| 3960 |
71%|βββββββ | 125/176 [01:35<00:39, 1.30it/s]
|
| 3961 |
72%|ββββββββ | 126/176 [01:36<00:38, 1.30it/s]
|
| 3962 |
72%|ββββββββ | 127/176 [01:37<00:37, 1.31it/s]
|
| 3963 |
73%|ββββββββ | 128/176 [01:38<00:36, 1.31it/s]
|
| 3964 |
73%|ββββββββ | 129/176 [01:38<00:35, 1.31it/s]
|
| 3965 |
74%|ββββββββ | 130/176 [01:39<00:35, 1.30it/s]
|
| 3966 |
74%|ββββββββ | 131/176 [01:40<00:33, 1.32it/s]
|
| 3967 |
75%|ββββββββ | 132/176 [01:41<00:33, 1.31it/s]
|
| 3968 |
76%|ββββββββ | 133/176 [01:41<00:32, 1.32it/s]
|
| 3969 |
76%|ββββββββ | 134/176 [01:42<00:31, 1.31it/s]
|
| 3970 |
77%|ββββββββ | 135/176 [01:43<00:31, 1.30it/s]
|
| 3971 |
77%|ββββββββ | 136/176 [01:44<00:30, 1.30it/s]
|
| 3972 |
78%|ββββββββ | 137/176 [01:45<00:29, 1.32it/s]
|
| 3973 |
78%|ββββββββ | 138/176 [01:45<00:28, 1.31it/s]
|
| 3974 |
79%|ββββββββ | 139/176 [01:46<00:27, 1.33it/s]
|
| 3975 |
80%|ββββββββ | 140/176 [01:47<00:27, 1.33it/s]
|
| 3976 |
80%|ββββββββ | 141/176 [01:48<00:26, 1.31it/s]
|
| 3977 |
81%|ββββββββ | 142/176 [01:48<00:25, 1.32it/s]
|
| 3978 |
81%|βββββββββ | 143/176 [01:49<00:24, 1.34it/s]
|
| 3979 |
82%|βββββββββ | 144/176 [01:50<00:23, 1.34it/s]
|
| 3980 |
82%|βββββββββ | 145/176 [01:51<00:23, 1.34it/s]
|
| 3981 |
83%|βββββββββ | 146/176 [01:51<00:22, 1.32it/s]
|
| 3982 |
84%|βββββββββ | 147/176 [01:52<00:22, 1.31it/s]
|
| 3983 |
84%|βββββββββ | 148/176 [01:53<00:21, 1.31it/s]
|
| 3984 |
85%|βββββββββ | 149/176 [01:54<00:20, 1.30it/s]
|
| 3985 |
85%|βββββββββ | 150/176 [01:54<00:19, 1.30it/s]
|
| 3986 |
86%|βββββββββ | 151/176 [01:55<00:19, 1.31it/s]
|
| 3987 |
86%|βββββββββ | 152/176 [01:56<00:18, 1.30it/s]
|
| 3988 |
87%|βββββββββ | 153/176 [01:57<00:17, 1.29it/s]
|
| 3989 |
88%|βββββββββ | 154/176 [01:57<00:16, 1.32it/s]
|
| 3990 |
88%|βββββββββ | 155/176 [01:58<00:16, 1.31it/s]
|
| 3991 |
89%|βββββββββ | 156/176 [01:59<00:15, 1.31it/s]
|
| 3992 |
89%|βββββββββ | 157/176 [02:00<00:14, 1.35it/s]
|
| 3993 |
90%|βββββββββ | 158/176 [02:00<00:13, 1.33it/s]
|
| 3994 |
90%|βββββββββ | 159/176 [02:01<00:12, 1.31it/s]
|
| 3995 |
91%|βββββββββ | 160/176 [02:02<00:12, 1.32it/s]
|
| 3996 |
91%|ββββββββββ| 161/176 [02:03<00:11, 1.30it/s]
|
| 3997 |
92%|ββββββββββ| 162/176 [02:04<00:10, 1.31it/s]
|
| 3998 |
93%|οΏ½οΏ½βββββββββ| 163/176 [02:04<00:09, 1.32it/s]
|
| 3999 |
93%|ββββββββββ| 164/176 [02:05<00:08, 1.34it/s]
|
| 4000 |
94%|ββββββββββ| 165/176 [02:06<00:08, 1.30it/s]
|
| 4001 |
94%|ββββββββββ| 166/176 [02:07<00:07, 1.27it/s]
|
| 4002 |
95%|ββββββββββ| 167/176 [02:07<00:07, 1.28it/s]
|
| 4003 |
95%|ββββββββββ| 168/176 [02:08<00:06, 1.29it/s]
|
| 4004 |
96%|ββββββββββ| 169/176 [02:09<00:05, 1.32it/s]
|
| 4005 |
97%|ββββββββββ| 170/176 [02:10<00:04, 1.31it/s]
|
| 4006 |
97%|ββββββββββ| 171/176 [02:10<00:03, 1.32it/s]
|
| 4007 |
98%|ββββββββββ| 172/176 [02:11<00:03, 1.32it/s]
|
| 4008 |
98%|ββββββββββ| 173/176 [02:12<00:02, 1.31it/s]
|
| 4009 |
99%|ββββββββββ| 174/176 [02:13<00:01, 1.27it/s]
|
| 4010 |
99%|ββββββββββ| 175/176 [02:14<00:00, 1.27it/s]
|
| 4011 |
+
|
| 4012 |
+
Final Evaluation Results:
|
| 4013 |
+
eval_loss: 0.5781
|
| 4014 |
+
eval_wer: 54.5829
|
| 4015 |
+
eval_wer_ortho: 57.7728
|
| 4016 |
+
eval_cer: 17.2107
|
| 4017 |
+
eval_runtime: 194.9835
|
| 4018 |
+
eval_samples_per_second: 14.3650
|
| 4019 |
+
eval_steps_per_second: 0.9030
|
| 4020 |
+
epoch: 6.0640
|
| 4021 |
+
|
| 4022 |
+
Saving final model to /workspace/experiments/exp_002_base_lora...
|
| 4023 |
+
|
| 4024 |
+
|
| 4025 |
+
|
| 4026 |
+
|
| 4027 |
...se_lora/training_args.bin: 100%|ββββββββββ| 5.91kB / 5.91kB [A[A
|
| 4028 |
+
|
| 4029 |
+
|
| 4030 |
+
|
| 4031 |
...4567.65d8b3854c39.93630.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A
|
| 4032 |
+
|
| 4033 |
+
|
| 4034 |
+
|
| 4035 |
+
|
| 4036 |
...9876.65d8b3854c39.94606.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A
|
| 4037 |
+
|
| 4038 |
+
|
| 4039 |
+
|
| 4040 |
+
|
| 4041 |
+
|
| 4042 |
...9767.65d8b3854c39.92682.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A
|
| 4043 |
+
|
| 4044 |
+
|
| 4045 |
+
|
| 4046 |
+
|
| 4047 |
+
|
| 4048 |
+
|
| 4049 |
...4970.65d8b3854c39.93828.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A
|
| 4050 |
+
|
| 4051 |
+
|
| 4052 |
+
|
| 4053 |
+
|
| 4054 |
+
|
| 4055 |
+
|
| 4056 |
+
|
| 4057 |
...0403.65d8b3854c39.94930.0: 100%|ββββββββββ| 36.7kB / 36.7kB [A[A[A[A[A[A[A
|
| 4058 |
+
|
| 4059 |
+
|
| 4060 |
+
|
| 4061 |
+
|
| 4062 |
+
|
| 4063 |
+
|
| 4064 |
+
|
| 4065 |
+
|
| 4066 |
...4234.65d8b3854c39.93425.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A[A[A
|
| 4067 |
+
|
| 4068 |
+
|
| 4069 |
+
|
| 4070 |
+
|
| 4071 |
+
|
| 4072 |
+
|
| 4073 |
+
|
| 4074 |
+
|
| 4075 |
+
|
| 4076 |
...8288.65d8b3854c39.94241.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A[A[A[A[A
|
| 4077 |
+
|
| 4078 |
+
|
| 4079 |
+
|
| 4080 |
+
|
| 4081 |
+
|
| 4082 |
+
|
| 4083 |
+
|
| 4084 |
+
|
| 4085 |
+
|
| 4086 |
+
|
| 4087 |
...adapter_model.safetensors: 100%|ββββββββββ| 9.45MB / 9.45MB [A[A[A[A[A[A[A[A[A[A
|
| 4088 |
+
|
| 4089 |
+
|
| 4090 |
+
|
| 4091 |
+
|
| 4092 |
+
|
| 4093 |
+
|
| 4094 |
+
|
| 4095 |
+
|
| 4096 |
+
|
| 4097 |
+
|
| 4098 |
+
|
| 4099 |
...9051.65d8b3854c39.94930.1: 100%|ββββββββββ| 506B / 506B [A[A[A[A[A[A[A[A[A[A[A
|
| 4100 |
+
|
| 4101 |
+
|
| 4102 |
...se_lora/training_args.bin: 100%|ββββββββββ| 5.91kB / 5.91kB [A[A
|
| 4103 |
+
|
| 4104 |
+
|
| 4105 |
+
|
| 4106 |
...4567.65d8b3854c39.93630.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A
|
| 4107 |
+
|
| 4108 |
+
|
| 4109 |
+
|
| 4110 |
+
|
| 4111 |
...9876.65d8b3854c39.94606.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A
|
| 4112 |
+
|
| 4113 |
+
|
| 4114 |
+
|
| 4115 |
+
|
| 4116 |
+
|
| 4117 |
...9767.65d8b3854c39.92682.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A
|
| 4118 |
+
|
| 4119 |
+
|
| 4120 |
+
|
| 4121 |
+
|
| 4122 |
+
|
| 4123 |
+
|
| 4124 |
...4970.65d8b3854c39.93828.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A
|
| 4125 |
+
|
| 4126 |
+
|
| 4127 |
+
|
| 4128 |
+
|
| 4129 |
+
|
| 4130 |
+
|
| 4131 |
+
|
| 4132 |
...0403.65d8b3854c39.94930.0: 100%|ββββββββββ| 36.7kB / 36.7kB [A[A[A[A[A[A[A
|
| 4133 |
+
|
| 4134 |
+
|
| 4135 |
+
|
| 4136 |
+
|
| 4137 |
+
|
| 4138 |
+
|
| 4139 |
+
|
| 4140 |
+
|
| 4141 |
...4234.65d8b3854c39.93425.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A[A[A
|
| 4142 |
+
|
| 4143 |
+
|
| 4144 |
+
|
| 4145 |
+
|
| 4146 |
+
|
| 4147 |
+
|
| 4148 |
+
|
| 4149 |
+
|
| 4150 |
+
|
| 4151 |
...8288.65d8b3854c39.94241.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A[A[A[A[A
|
| 4152 |
+
|
| 4153 |
+
|
| 4154 |
+
|
| 4155 |
+
|
| 4156 |
+
|
| 4157 |
+
|
| 4158 |
+
|
| 4159 |
+
|
| 4160 |
+
|
| 4161 |
+
|
| 4162 |
...adapter_model.safetensors: 100%|ββββββββββ| 9.45MB / 9.45MB [A[A[A[A[A[A[A[A[A[A
|
| 4163 |
+
|
| 4164 |
+
|
| 4165 |
+
|
| 4166 |
+
|
| 4167 |
+
|
| 4168 |
+
|
| 4169 |
+
|
| 4170 |
+
|
| 4171 |
+
|
| 4172 |
+
|
| 4173 |
+
|
| 4174 |
...9051.65d8b3854c39.94930.1: 100%|ββββββββββ| 506B / 506B [A[A[A[A[A[A[A[A[A[A[A
|
| 4175 |
+
|
| 4176 |
+
|
| 4177 |
+
|
| 4178 |
...se_lora/training_args.bin: 100%|ββββββββββ| 5.91kB / 5.91kB [A[A
|
| 4179 |
+
|
| 4180 |
+
|
| 4181 |
+
|
| 4182 |
...4567.65d8b3854c39.93630.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A
|
| 4183 |
+
|
| 4184 |
+
|
| 4185 |
+
|
| 4186 |
+
|
| 4187 |
...9876.65d8b3854c39.94606.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A
|
| 4188 |
+
|
| 4189 |
+
|
| 4190 |
+
|
| 4191 |
+
|
| 4192 |
+
|
| 4193 |
...9767.65d8b3854c39.92682.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A
|
| 4194 |
+
|
| 4195 |
+
|
| 4196 |
+
|
| 4197 |
+
|
| 4198 |
+
|
| 4199 |
+
|
| 4200 |
...4970.65d8b3854c39.93828.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A
|
| 4201 |
+
|
| 4202 |
+
|
| 4203 |
+
|
| 4204 |
+
|
| 4205 |
+
|
| 4206 |
+
|
| 4207 |
+
|
| 4208 |
...0403.65d8b3854c39.94930.0: 100%|ββββββββββ| 36.7kB / 36.7kB [A[A[A[A[A[A[A
|
| 4209 |
+
|
| 4210 |
+
|
| 4211 |
+
|
| 4212 |
+
|
| 4213 |
+
|
| 4214 |
+
|
| 4215 |
+
|
| 4216 |
+
|
| 4217 |
...4234.65d8b3854c39.93425.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A[A[A
|
| 4218 |
+
|
| 4219 |
+
|
| 4220 |
+
|
| 4221 |
+
|
| 4222 |
+
|
| 4223 |
+
|
| 4224 |
+
|
| 4225 |
+
|
| 4226 |
+
|
| 4227 |
...8288.65d8b3854c39.94241.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A[A[A[A[A
|
| 4228 |
+
|
| 4229 |
+
|
| 4230 |
+
|
| 4231 |
+
|
| 4232 |
+
|
| 4233 |
+
|
| 4234 |
+
|
| 4235 |
+
|
| 4236 |
+
|
| 4237 |
+
|
| 4238 |
...adapter_model.safetensors: 100%|ββββββββββ| 9.45MB / 9.45MB [A[A[A[A[A[A[A[A[A[A
|
| 4239 |
+
|
| 4240 |
+
|
| 4241 |
+
|
| 4242 |
+
|
| 4243 |
+
|
| 4244 |
+
|
| 4245 |
+
|
| 4246 |
+
|
| 4247 |
+
|
| 4248 |
+
|
| 4249 |
+
|
| 4250 |
...9051.65d8b3854c39.94930.1: 100%|ββββββββββ| 506B / 506B [A[A[A[A[A[A[A[A[A[A[A
|
| 4251 |
+
|
| 4252 |
+
|
| 4253 |
...se_lora/training_args.bin: 100%|ββββββββββ| 5.91kB / 5.91kB [A[A
|
| 4254 |
+
|
| 4255 |
+
|
| 4256 |
+
|
| 4257 |
...4567.65d8b3854c39.93630.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A
|
| 4258 |
+
|
| 4259 |
+
|
| 4260 |
+
|
| 4261 |
+
|
| 4262 |
...9876.65d8b3854c39.94606.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A
|
| 4263 |
+
|
| 4264 |
+
|
| 4265 |
+
|
| 4266 |
+
|
| 4267 |
+
|
| 4268 |
...9767.65d8b3854c39.92682.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A
|
| 4269 |
+
|
| 4270 |
+
|
| 4271 |
+
|
| 4272 |
+
|
| 4273 |
+
|
| 4274 |
+
|
| 4275 |
...4970.65d8b3854c39.93828.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A
|
| 4276 |
+
|
| 4277 |
+
|
| 4278 |
+
|
| 4279 |
+
|
| 4280 |
+
|
| 4281 |
+
|
| 4282 |
+
|
| 4283 |
...0403.65d8b3854c39.94930.0: 100%|ββββββββββ| 36.7kB / 36.7kB [A[A[A[A[A[A[A
|
| 4284 |
+
|
| 4285 |
+
|
| 4286 |
+
|
| 4287 |
+
|
| 4288 |
+
|
| 4289 |
+
|
| 4290 |
+
|
| 4291 |
+
|
| 4292 |
...4234.65d8b3854c39.93425.0: 100%|ββββββββββ| 6.04kB / 6.04kB [A[A[A[A[A[A[A[A
|
| 4293 |
+
|
| 4294 |
+
|
| 4295 |
+
|
| 4296 |
+
|
| 4297 |
+
|
| 4298 |
+
|
| 4299 |
+
|
| 4300 |
+
|
| 4301 |
+
|
| 4302 |
...8288.65d8b3854c39.94241.0: 100%|ββββββββββ| 6.90kB / 6.90kB [A[A[A[A[A[A[A[A[A
|
| 4303 |
+
|
| 4304 |
+
|
| 4305 |
+
|
| 4306 |
+
|
| 4307 |
+
|
| 4308 |
+
|
| 4309 |
+
|
| 4310 |
+
|
| 4311 |
+
|
| 4312 |
+
|
| 4313 |
...adapter_model.safetensors: 100%|ββββββββββ| 9.45MB / 9.45MB [A[A[A[A[A[A[A[A[A[A
|
| 4314 |
+
|
| 4315 |
+
|
| 4316 |
+
|
| 4317 |
+
|
| 4318 |
+
|
| 4319 |
+
|
| 4320 |
+
|
| 4321 |
+
|
| 4322 |
+
|
| 4323 |
+
|
| 4324 |
+
|
| 4325 |
...9051.65d8b3854c39.94930.1: 100%|ββββββββββ| 506B / 506B [A[A[A[A[A[A[A[A[A[A[A
|
| 4326 |
+
|
| 4327 |
+
|
| 4328 |
...se_lora/training_args.bin: 100%|ββββββββββ| 5.91kB / 5.91kB
|
| 4329 |
+
|
| 4330 |
...4567.65d8b3854c39.93630.0: 100%|ββββββββββ| 6.04kB / 6.04kB
|
| 4331 |
+
|
| 4332 |
...9876.65d8b3854c39.94606.0: 100%|ββββββββββ| 6.90kB / 6.90kB
|
| 4333 |
+
|
| 4334 |
...9767.65d8b3854c39.92682.0: 100%|ββββββββββ| 6.90kB / 6.90kB
|
| 4335 |
+
|
| 4336 |
...4970.65d8b3854c39.93828.0: 100%|ββββββββββ| 6.04kB / 6.04kB
|
| 4337 |
+
|
| 4338 |
...0403.65d8b3854c39.94930.0: 100%|ββββββββββ| 36.7kB / 36.7kB
|
| 4339 |
+
|
| 4340 |
...4234.65d8b3854c39.93425.0: 100%|ββββββββββ| 6.04kB / 6.04kB
|
| 4341 |
+
|
| 4342 |
...8288.65d8b3854c39.94241.0: 100%|ββββββββββ| 6.90kB / 6.90kB
|
| 4343 |
+
|
| 4344 |
...adapter_model.safetensors: 100%|ββββββββββ| 9.45MB / 9.45MB
|
| 4345 |
+
|
| 4346 |
...9051.65d8b3854c39.94930.1: 100%|ββββββββββ| 506B / 506B
|
runs/Mar29_21-40-02_65d8b3854c39/events.out.tfevents.1774829051.65d8b3854c39.94930.1
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8b038aac50dbde232cbd38fdac6e3a54bdb6fa57b9134081732ea57c63b8a5af
|
| 3 |
+
size 506
|