Albin Thörn Cleland commited on
Commit
abd4157
·
1 Parent(s): f51c55a

Cleaning up + gitignore

Browse files
This view is limited to 50 files because it contains too many changes.   See raw diff
Files changed (50) hide show
  1. .gitignore +3 -0
  2. .ipynb_checkpoints/README-checkpoint.md +9 -0
  3. .ipynb_checkpoints/pane_output-checkpoint.txt +0 -1254
  4. .ipynb_checkpoints/prepare-train-val-test-checkpoint.py +4 -2
  5. README.md +9 -0
  6. data/depparse/sv_diachronic.dev.in.conllu +2 -2
  7. data/depparse/sv_diachronic.test.in.conllu +2 -2
  8. data/depparse/sv_diachronic.train.in.conllu +2 -2
  9. prepare-train-val-test.py +4 -2
  10. {saved_models_old/depparse → saved_models/depparse/conll17_bm}/sv_diachronic_charlm_parser.pt +2 -2
  11. {saved_models_old/depparse → saved_models/depparse/conll17_bm}/sv_diachronic_charlm_parser_checkpoint.pt +2 -2
  12. saved_models/depparse/sv_diachronic_charlm_parser.pt +2 -2
  13. saved_models/depparse/sv_diachronic_charlm_parser_checkpoint.pt +2 -2
  14. stanza/__pycache__/__init__.cpython-312.pyc +0 -0
  15. stanza/__pycache__/_version.cpython-312.pyc +0 -0
  16. stanza/models/__pycache__/__init__.cpython-312.pyc +0 -0
  17. stanza/models/__pycache__/_training_logging.cpython-312.pyc +0 -0
  18. stanza/models/__pycache__/parser.cpython-312.pyc +0 -0
  19. stanza/models/__pycache__/tagger.cpython-312.pyc +0 -0
  20. stanza/models/classifiers/__pycache__/__init__.cpython-312.pyc +0 -0
  21. stanza/models/classifiers/__pycache__/base_classifier.cpython-312.pyc +0 -0
  22. stanza/models/classifiers/__pycache__/cnn_classifier.cpython-312.pyc +0 -0
  23. stanza/models/classifiers/__pycache__/config.cpython-312.pyc +0 -0
  24. stanza/models/classifiers/__pycache__/constituency_classifier.cpython-312.pyc +0 -0
  25. stanza/models/classifiers/__pycache__/data.cpython-312.pyc +0 -0
  26. stanza/models/classifiers/__pycache__/trainer.cpython-312.pyc +0 -0
  27. stanza/models/classifiers/__pycache__/utils.cpython-312.pyc +0 -0
  28. stanza/models/common/__pycache__/__init__.cpython-312.pyc +0 -0
  29. stanza/models/common/__pycache__/beam.cpython-312.pyc +0 -0
  30. stanza/models/common/__pycache__/bert_embedding.cpython-312.pyc +0 -0
  31. stanza/models/common/__pycache__/biaffine.cpython-312.pyc +0 -0
  32. stanza/models/common/__pycache__/char_model.cpython-312.pyc +0 -0
  33. stanza/models/common/__pycache__/chuliu_edmonds.cpython-312.pyc +0 -0
  34. stanza/models/common/__pycache__/constant.cpython-312.pyc +0 -0
  35. stanza/models/common/__pycache__/crf.cpython-312.pyc +0 -0
  36. stanza/models/common/__pycache__/data.cpython-312.pyc +0 -0
  37. stanza/models/common/__pycache__/doc.cpython-312.pyc +0 -0
  38. stanza/models/common/__pycache__/dropout.cpython-312.pyc +0 -0
  39. stanza/models/common/__pycache__/exceptions.cpython-312.pyc +0 -0
  40. stanza/models/common/__pycache__/foundation_cache.cpython-312.pyc +0 -0
  41. stanza/models/common/__pycache__/hlstm.cpython-312.pyc +0 -0
  42. stanza/models/common/__pycache__/loss.cpython-312.pyc +0 -0
  43. stanza/models/common/__pycache__/maxout_linear.cpython-312.pyc +0 -0
  44. stanza/models/common/__pycache__/packed_lstm.cpython-312.pyc +0 -0
  45. stanza/models/common/__pycache__/peft_config.cpython-312.pyc +0 -0
  46. stanza/models/common/__pycache__/pretrain.cpython-312.pyc +0 -0
  47. stanza/models/common/__pycache__/relative_attn.cpython-312.pyc +0 -0
  48. stanza/models/common/__pycache__/seq2seq_constant.cpython-312.pyc +0 -0
  49. stanza/models/common/__pycache__/seq2seq_model.cpython-312.pyc +0 -0
  50. stanza/models/common/__pycache__/seq2seq_modules.cpython-312.pyc +0 -0
.gitignore ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ .ipynb_checkpoints
2
+ data/
3
+ ud/
.ipynb_checkpoints/README-checkpoint.md CHANGED
@@ -20,6 +20,8 @@ python -m stanza.utils.training.run_depparse UD_Swedish-diachronic --wordvec_pre
20
 
21
  ## Pretrained vectors
22
 
 
 
23
  Jag konverterade först kubhist2-vektorerna från gensim fasttext .ft till en vanlig textfil med gensims pythonpaket, sedan använde jag stanzas konverterare till .pt:
24
 
25
  ```
@@ -29,3 +31,10 @@ pt.load()
29
  ```
30
 
31
  Resultatet finns komprimerat i `diachronic.pt.xz`.
 
 
 
 
 
 
 
 
20
 
21
  ## Pretrained vectors
22
 
23
+ We use the incremental vectors up until 1880 from Henchen & Tahmasebi 2021.
24
+
25
  Jag konverterade först kubhist2-vektorerna från gensim fasttext .ft till en vanlig textfil med gensims pythonpaket, sedan använde jag stanzas konverterare till .pt:
26
 
27
  ```
 
31
  ```
32
 
33
  Resultatet finns komprimerat i `diachronic.pt.xz`.
34
+
35
+ ## References
36
+
37
+ **Hengchen, Simon & Tahmasebi, Nina. (2021).**
38
+ *A collection of Swedish diachronic word embedding models trained on historical newspaper data.*
39
+ **Journal of Open Humanities Data**, 7(2), 1–7.
40
+ https://doi.org/10.5334/johd.22
.ipynb_checkpoints/pane_output-checkpoint.txt DELETED
@@ -1,1254 +0,0 @@
1
- (SyllaMBERT) python -m stanza.utils.training.run_depparse UD_Swedish-diachronic
2
- --batch_size 32 --dropout 0.33
3
- 2025-12-01 16:13:59 INFO: Training program called with:
4
- /mimer/NOBACKUP/groups/dionysus/cleland/stanza-digphil/stanza/utils/training/run
5
- _depparse.py UD_Swedish-diachronic --batch_size 32 --dropout 0.33
6
- 2025-12-01 16:13:59 DEBUG: UD_Swedish-diachronic: sv_diachronic
7
- 2025-12-01 16:13:59 INFO: Using model /cephyr/users/cleland/Alvis/stanza_resourc
8
- es/sv/forward_charlm/conll17.pt for forward charlm
9
- 2025-12-01 16:13:59 INFO: Using model /cephyr/users/cleland/Alvis/stanza_resourc
10
- es/sv/backward_charlm/conll17.pt for backward charlm
11
- 2025-12-01 16:13:59 INFO: Using default pretrain for language, found in /cephyr/
12
- users/cleland/Alvis/stanza_resources/sv/pretrain/conll17.pt To use a different
13
- pretrain, specify --wordvec_pretrain_file
14
- 2025-12-01 16:13:59 INFO: UD_Swedish-diachronic: saved_models/depparse/sv_diachr
15
- onic_charlm_parser.pt does not exist, training new model
16
- 2025-12-01 16:13:59 INFO: Using model /cephyr/users/cleland/Alvis/stanza_resourc
17
- es/sv/forward_charlm/conll17.pt for forward charlm
18
- 2025-12-01 16:13:59 INFO: Using model /cephyr/users/cleland/Alvis/stanza_resourc
19
- es/sv/backward_charlm/conll17.pt for backward charlm
20
- 2025-12-01 16:13:59 INFO: Using default pretrain for language, found in /cephyr/
21
- users/cleland/Alvis/stanza_resources/sv/pretrain/conll17.pt To use a different
22
- pretrain, specify --wordvec_pretrain_file
23
- 2025-12-01 16:13:59 INFO: Running train depparse for UD_Swedish-diachronic with
24
- args ['--wordvec_dir', '/cephyr/users/cleland/Alvis/stanza_resources/sv/pretrain
25
- ', '--train_file', '/mimer/NOBACKUP/groups/dionysus/cleland/stanza-digphil/data/
26
- depparse/sv_diachronic.train.in.conllu', '--eval_file', '/mimer/NOBACKUP/groups/
27
- dionysus/cleland/stanza-digphil/data/depparse/sv_diachronic.dev.in.conllu', '--b
28
- atch_size', '5000', '--lang', 'sv', '--shorthand', 'sv_diachronic', '--mode', 't
29
- rain', '--wordvec_pretrain_file', '/cephyr/users/cleland/Alvis/stanza_resources/
30
- sv/pretrain/conll17.pt', '--charlm', '--charlm_shorthand', 'sv_conll17', '--char
31
- lm_forward_file', '/cephyr/users/cleland/Alvis/stanza_resources/sv/forward_charl
32
- m/conll17.pt', '--charlm_backward_file', '/cephyr/users/cleland/Alvis/stanza_res
33
- ources/sv/backward_charlm/conll17.pt', '--batch_size', '32', '--dropout', '0.33'
34
- ]
35
- 2025-12-01 16:13:59 INFO: Running parser in train mode
36
- 2025-12-01 16:13:59 INFO: Directory saved_models/depparse does not exist; creati
37
- ng...
38
- 2025-12-01 16:13:59 INFO: Using pretrained contextualized char embedding
39
- 2025-12-01 16:13:59 INFO: Loading data with batch size 32...
40
- 2025-12-01 16:14:05 INFO: Train File /mimer/NOBACKUP/groups/dionysus/cleland/sta
41
- nza-digphil/data/depparse/sv_diachronic.train.in.conllu, Data Size: 60141
42
- 2025-12-01 16:14:05 INFO: Original data size: 60141
43
- 2025-12-01 16:14:06 INFO: Augmented data size: 60265
44
- 2025-12-01 16:14:22 WARNING: sv_diachronic is not a known dataset. Examining th
45
- e data to choose which xpos vocab to use
46
- 2025-12-01 16:14:22 INFO: Original length = 60265
47
- 2025-12-01 16:14:22 INFO: Filtered length = 60265
48
- 2025-12-01 16:14:35 WARNING: Chose XPOSDescription(xpos_type=<XPOSType.XPOS: 1>,
49
- sep='|') for the xpos factory for sv_diachronic
50
- 2025-12-01 16:14:42 DEBUG: Loaded pretrain from /cephyr/users/cleland/Alvis/stan
51
- za_resources/sv/pretrain/conll17.pt
52
- 2025-12-01 16:14:54 DEBUG: 34980 batches created.
53
- 2025-12-01 16:14:59 DEBUG: 3869 batches created.
54
- 2025-12-01 16:14:59 INFO: Training parser...
55
- 2025-12-01 16:14:59 DEBUG: Depparse model loading charmodels: /cephyr/users/clel
56
- and/Alvis/stanza_resources/sv/forward_charlm/conll17.pt and /cephyr/users/clelan
57
- d/Alvis/stanza_resources/sv/backward_charlm/conll17.pt
58
- 2025-12-01 16:14:59 DEBUG: Loading charlm from /cephyr/users/cleland/Alvis/stanz
59
- a_resources/sv/forward_charlm/conll17.pt
60
- 2025-12-01 16:14:59 DEBUG: Loading charlm from /cephyr/users/cleland/Alvis/stanz
61
- a_resources/sv/backward_charlm/conll17.pt
62
- 2025-12-01 16:14:59 DEBUG: Building Adam with lr=0.003000, betas=(0.9, 0.95), ep
63
- s=0.000001
64
- ^[[A2025-12-01 16:15:07 INFO: Finished STEP 20/50000, loss = 7.766887 (0.198 sec
65
- /batch), lr: 0.003000
66
- 2025-12-01 16:15:11 INFO: Finished STEP 40/50000, loss = 5.882998 (0.176 sec/bat
67
- ch), lr: 0.003000
68
- 2025-12-01 16:15:14 INFO: Finished STEP 60/50000, loss = 5.150016 (0.156 sec/bat
69
- ch), lr: 0.003000
70
- ^Z
71
- [1]+ Stopped python -m stanza.utils.training.run_depparse UD_Sw
72
- edish-diachronic --batch_size 32 --dropout 0.33
73
- (SyllaMBERT)
74
- (SyllaMBERT)
75
- (SyllaMBERT)
76
- (SyllaMBERT)
77
- (SyllaMBERT)
78
- (SyllaMBERT)
79
- (SyllaMBERT)
80
- (SyllaMBERT) python -m stanza.utils.training.run_depparse UD_Swedish-diachronic
81
- --batch_size 32 --dropout 0.33
82
- 2025-12-01 16:16:19 INFO: Training program called with:
83
- /mimer/NOBACKUP/groups/dionysus/cleland/stanza-digphil/stanza/utils/training/run
84
- _depparse.py UD_Swedish-diachronic --batch_size 32 --dropout 0.33
85
- 2025-12-01 16:16:19 DEBUG: UD_Swedish-diachronic: sv_diachronic
86
- 2025-12-01 16:16:19 INFO: Using model /cephyr/users/cleland/Alvis/stanza_resourc
87
- es/sv/forward_charlm/conll17.pt for forward charlm
88
- 2025-12-01 16:16:19 INFO: Using model /cephyr/users/cleland/Alvis/stanza_resourc
89
- es/sv/backward_charlm/conll17.pt for backward charlm
90
- 2025-12-01 16:16:19 INFO: Default pretrain should be /cephyr/users/cleland/Alvis
91
- /stanza_resources/sv/pretrain/conll17.pt Attempting to download
92
- 2025-12-01 16:16:19 DEBUG: Downloading resource file from https://raw.githubuser
93
- content.com/stanfordnlp/stanza-resources/main/resources_1.11.0.json
94
- Downloading https://raw.githubusercontent.com/stanfordnlp/stanza-resources/main/
95
- resources_1.11.0.json: 435kB [00:00, 115MB/s]
96
- 2025-12-01 16:16:19 INFO: Downloaded file to /cephyr/users/cleland/Alvis/stanza_
97
- resources/resources.json
98
- 2025-12-01 16:16:19 DEBUG: Processing parameter "processors"...
99
- 2025-12-01 16:16:19 DEBUG: Found pretrain: conll17.
100
- 2025-12-01 16:16:19 DEBUG: Found dependencies [] for processor pretrain model co
101
- nll17
102
- 2025-12-01 16:16:19 INFO: Downloading these customized packages for language: sv
103
- (Swedish)...
104
- =======================
105
- | Processor | Package |
106
- -----------------------
107
- | pretrain | conll17 |
108
- =======================
109
-
110
- Downloading https://huggingface.co/stanfordnlp/stanza-sv/resolve/v1.11.0/models/
111
- pretrain/conll17.pt: 100%|█| 107M/107M [00:00
112
- 2025-12-01 16:16:20 INFO: Downloaded file to /cephyr/users/cleland/Alvis/stanza_
113
- resources/sv/pretrain/conll17.pt
114
- 2025-12-01 16:16:20 INFO: Finished downloading models and saved to /cephyr/users
115
- /cleland/Alvis/stanza_resources
116
- 2025-12-01 16:16:20 INFO: Using default pretrain for language, found in /cephyr/
117
- users/cleland/Alvis/stanza_resources/sv/pretrain/conll17.pt To use a different
118
- pretrain, specify --wordvec_pretrain_file
119
- 2025-12-01 16:16:20 INFO: UD_Swedish-diachronic: saved_models/depparse/sv_diachr
120
- onic_charlm_parser.pt does not exist, training new model
121
- 2025-12-01 16:16:20 INFO: Using model /cephyr/users/cleland/Alvis/stanza_resourc
122
- es/sv/forward_charlm/conll17.pt for forward charlm
123
- 2025-12-01 16:16:20 INFO: Using model /cephyr/users/cleland/Alvis/stanza_resourc
124
- es/sv/backward_charlm/conll17.pt for backward charlm
125
- 2025-12-01 16:16:20 INFO: Using default pretrain for language, found in /cephyr/
126
- users/cleland/Alvis/stanza_resources/sv/pretrain/conll17.pt To use a different
127
- pretrain, specify --wordvec_pretrain_file
128
- 2025-12-01 16:16:20 INFO: Running train depparse for UD_Swedish-diachronic with
129
- args ['--wordvec_dir', '/cephyr/users/cleland/Alvis/stanza_resources/sv/pretrain
130
- ', '--train_file', '/mimer/NOBACKUP/groups/dionysus/cleland/stanza-digphil/data/
131
- depparse/sv_diachronic.train.in.conllu', '--eval_file', '/mimer/NOBACKUP/groups/
132
- dionysus/cleland/stanza-digphil/data/depparse/sv_diachronic.dev.in.conllu', '--b
133
- atch_size', '5000', '--lang', 'sv', '--shorthand', 'sv_diachronic', '--mode', 't
134
- rain', '--wordvec_pretrain_file', '/cephyr/users/cleland/Alvis/stanza_resources/
135
- sv/pretrain/conll17.pt', '--charlm', '--charlm_shorthand', 'sv_conll17', '--char
136
- lm_forward_file', '/cephyr/users/cleland/Alvis/stanza_resources/sv/forward_charl
137
- m/conll17.pt', '--charlm_backward_file', '/cephyr/users/cleland/Alvis/stanza_res
138
- ources/sv/backward_charlm/conll17.pt', '--batch_size', '32', '--dropout', '0.33'
139
- ]
140
- 2025-12-01 16:16:20 INFO: Running parser in train mode
141
- 2025-12-01 16:16:20 INFO: Using pretrained contextualized char embedding
142
- 2025-12-01 16:16:20 INFO: Loading data with batch size 32...
143
- ^[[A^[[A2025-12-01 16:16:26 INFO: Train File /mimer/NOBACKUP/groups/dionysus/cle
144
- land/stanza-digphil/data/depparse/sv_diachronic.train.in.conllu, Data Size: 6014
145
- 1
146
- 2025-12-01 16:16:26 INFO: Original data size: 60141
147
- 2025-12-01 16:16:27 INFO: Augmented data size: 60265
148
- 2025-12-01 16:16:42 WARNING: sv_diachronic is not a known dataset. Examining th
149
- e data to choose which xpos vocab to use
150
- 2025-12-01 16:16:42 INFO: Original length = 60265
151
- 2025-12-01 16:16:42 INFO: Filtered length = 60265
152
- 2025-12-01 16:16:55 WARNING: Chose XPOSDescription(xpos_type=<XPOSType.XPOS: 1>,
153
- sep='|') for the xpos factory for sv_diachronic
154
- 2025-12-01 16:17:02 DEBUG: Loaded pretrain from /cephyr/users/cleland/Alvis/stan
155
- za_resources/sv/pretrain/conll17.pt
156
- 2025-12-01 16:17:14 DEBUG: 34980 batches created.
157
- 2025-12-01 16:17:16 DEBUG: 3869 batches created.
158
- 2025-12-01 16:17:16 INFO: Training parser...
159
- 2025-12-01 16:17:16 DEBUG: Depparse model loading charmodels: /cephyr/users/clel
160
- and/Alvis/stanza_resources/sv/forward_charlm/conll17.pt and /cephyr/users/clelan
161
- d/Alvis/stanza_resources/sv/backward_charlm/conll17.pt
162
- 2025-12-01 16:17:16 DEBUG: Loading charlm from /cephyr/users/cleland/Alvis/stanz
163
- a_resources/sv/forward_charlm/conll17.pt
164
- 2025-12-01 16:17:17 DEBUG: Loading charlm from /cephyr/users/cleland/Alvis/stanz
165
- a_resources/sv/backward_charlm/conll17.pt
166
- 2025-12-01 16:17:17 DEBUG: Building Adam with lr=0.003000, betas=(0.9, 0.95), ep
167
- s=0.000001
168
- ^CTraceback (most recent call last):
169
- File "<frozen runpy>", line 198, in _run_module_as_main
170
- File "<frozen runpy>", line 88, in _run_code
171
- File "/mimer/NOBACKUP/groups/dionysus/cleland/stanza-digphil/stanza/utils/trai
172
- ning/run_depparse.py", line 145, in <module>
173
- ^C^C^C^C^Cobject address : 0x14cbf7cef520
174
- object refcount : 3
175
- object type : 0x14ccdb4498a0
176
- object type name: KeyboardInterrupt
177
- object repr : KeyboardInterrupt()
178
- lost sys.stderr
179
- ^CException ignored in atexit callback: <function dump_compile_times at 0x14cb63
180
- 0e1bc0>
181
- Traceback (most recent call last):
182
- File "/mimer/NOBACKUP/groups/dionysus/fresh/SyllaMBERT/lib/python3.12/site-pac
183
- kages/torch/_dynamo/utils.py", line 399, in dump_compile_times
184
- ^C^C File "/mimer/NOBACKUP/groups/dionysus/fresh/SyllaMBERT/lib/python3.12/site
185
- -packages/torch/_dynamo/utils.py", line 385, in compile_times
186
- ^C File "/mimer/NOBACKUP/groups/dionysus/fresh/SyllaMBERT/lib/python3.12/site-p
187
- ackages/torch/_dynamo/utils.py", line 148, in tabulate
188
- import tabulate
189
- File "<frozen importlib._bootstrap>", line 1360, in _find_and_load
190
- File "<frozen importlib._bootstrap>", line 1322, in _find_and_load_unlocked
191
- File "<frozen importlib._bootstrap>", line 1262, in _find_spec
192
- File "<frozen importlib._bootstrap_external>", line 1528, in find_spec
193
- File "<frozen importlib._bootstrap_external>", line 1502, in _get_spec
194
- File "<frozen importlib._bootstrap_external>", line 1620, in find_spec
195
- KeyboardInterrupt:
196
- ^CException ignored in atexit callback: <bound method finalize._exitfunc of <cla
197
- ss 'weakref.finalize'>>
198
- Traceback (most recent call last):
199
- File "/apps/Arch/software/Python/3.12.3-GCCcore-13.3.0/lib/python3.12/weakref.
200
- py", line 666, in _exitfunc
201
- ^C^C^C^C^C^C^C^CKeyboardInterrupt:
202
- ^CException ignored in atexit callback: <function dump_cache_stats at 0x14cc0db9
203
- 3380>
204
- Traceback (most recent call last):
205
- File "/mimer/NOBACKUP/groups/dionysus/fresh/SyllaMBERT/lib/python3.12/site-pac
206
- kages/torch/_subclasses/fake_tensor.py", line 2425, in dump_cache_stats
207
- ^C^C^C^C^C^C^C^CKeyboardInterrupt:
208
- ^CException ignored in atexit callback: <function shutdown at 0x14ccd9f360c0>
209
- Traceback (most recent call last):
210
- File "/apps/Arch/software/Python/3.12.3-GCCcore-13.3.0/lib/python3.12/logging/
211
- __init__.py", line 2245, in shutdown
212
- ^C^CKeyboardInterrupt:
213
-
214
- (SyllaMBERT)
215
- (SyllaMBERT)
216
- (SyllaMBERT)
217
- (SyllaMBERT)
218
- (SyllaMBERT)
219
- (SyllaMBERT)
220
- (SyllaMBERT)
221
- (SyllaMBERT)
222
- (SyllaMBERT)
223
- (SyllaMBERT)
224
- (SyllaMBERT)
225
- (SyllaMBERT)
226
- (SyllaMBERT)
227
- (SyllaMBERT)
228
- (SyllaMBERT)
229
- (SyllaMBERT)
230
- (SyllaMBERT)
231
- (SyllaMBERT)
232
- (SyllaMBERT)
233
- (SyllaMBERT)
234
- (SyllaMBERT)
235
- (SyllaMBERT)
236
- (SyllaMBERT)
237
- (SyllaMBERT)
238
- (SyllaMBERT)
239
- (SyllaMBERT)
240
- (SyllaMBERT)
241
- (SyllaMBERT)
242
- (SyllaMBERT)
243
- (SyllaMBERT)
244
- (SyllaMBERT)
245
- (SyllaMBERT)
246
- (SyllaMBERT)
247
- (SyllaMBERT)
248
- (SyllaMBERT)
249
- (SyllaMBERT)
250
- (SyllaMBERT) python -m stanza.utils.training.run_depparse UD_Swedish-diachronic
251
- --wordvec_pretrain_file /cephyr/users/cleland/Alvis/stanza_resources/sv/pretrain
252
- /diachronic.pt --batch_size 32 --dropout 0.33
253
- 2025-12-01 16:18:36 INFO: Training program called with:
254
- /mimer/NOBACKUP/groups/dionysus/cleland/stanza-digphil/stanza/utils/training/run
255
- _depparse.py UD_Swedish-diachronic --wordvec_pretrain_file /cephyr/users/cleland
256
- /Alvis/stanza_resources/sv/pretrain/diachronic.pt --batch_size 32 --dropout 0.33
257
- 2025-12-01 16:18:36 DEBUG: UD_Swedish-diachronic: sv_diachronic
258
- 2025-12-01 16:18:36 INFO: Using model /cephyr/users/cleland/Alvis/stanza_resourc
259
- es/sv/forward_charlm/conll17.pt for forward charlm
260
- 2025-12-01 16:18:36 INFO: Using model /cephyr/users/cleland/Alvis/stanza_resourc
261
- es/sv/backward_charlm/conll17.pt for backward charlm
262
- 2025-12-01 16:18:36 INFO: UD_Swedish-diachronic: saved_models/depparse/sv_diachr
263
- onic_charlm_parser.pt does not exist, training new model
264
- 2025-12-01 16:18:36 INFO: Using model /cephyr/users/cleland/Alvis/stanza_resourc
265
- es/sv/forward_charlm/conll17.pt for forward charlm
266
- 2025-12-01 16:18:36 INFO: Using model /cephyr/users/cleland/Alvis/stanza_resourc
267
- es/sv/backward_charlm/conll17.pt for backward charlm
268
- 2025-12-01 16:18:36 INFO: Running train depparse for UD_Swedish-diachronic with
269
- args ['--wordvec_dir', '/cephyr/users/cleland/Alvis/stanza_resources/sv/pretrain
270
- ', '--train_file', '/mimer/NOBACKUP/groups/dionysus/cleland/stanza-digphil/data/
271
- depparse/sv_diachronic.train.in.conllu', '--eval_file', '/mimer/NOBACKUP/groups/
272
- dionysus/cleland/stanza-digphil/data/depparse/sv_diachronic.dev.in.conllu', '--b
273
- atch_size', '5000', '--lang', 'sv', '--shorthand', 'sv_diachronic', '--mode', 't
274
- rain', '--charlm', '--charlm_shorthand', 'sv_conll17', '--charlm_forward_file',
275
- '/cephyr/users/cleland/Alvis/stanza_resources/sv/forward_charlm/conll17.pt', '--
276
- charlm_backward_file', '/cephyr/users/cleland/Alvis/stanza_resources/sv/backward
277
- _charlm/conll17.pt', '--wordvec_pretrain_file', '/cephyr/users/cleland/Alvis/sta
278
- nza_resources/sv/pretrain/diachronic.pt', '--batch_size', '32', '--dropout', '0.
279
- 33']
280
- 2025-12-01 16:18:36 INFO: Running parser in train mode
281
- 2025-12-01 16:18:36 INFO: Using pretrained contextualized char embedding
282
- 2025-12-01 16:18:36 INFO: Loading data with batch size 32...
283
- 2025-12-01 16:18:43 INFO: Train File /mimer/NOBACKUP/groups/dionysus/cleland/sta
284
- nza-digphil/data/depparse/sv_diachronic.train.in.conllu, Data Size: 60141
285
- 2025-12-01 16:18:43 INFO: Original data size: 60141
286
- 2025-12-01 16:18:43 INFO: Augmented data size: 60265
287
- 2025-12-01 16:18:59 WARNING: sv_diachronic is not a known dataset. Examining th
288
- e data to choose which xpos vocab to use
289
- 2025-12-01 16:18:59 INFO: Original length = 60265
290
- 2025-12-01 16:18:59 INFO: Filtered length = 60265
291
- 2025-12-01 16:19:12 WARNING: Chose XPOSDescription(xpos_type=<XPOSType.XPOS: 1>,
292
- sep='|') for the xpos factory for sv_diachronic
293
- 2025-12-01 16:19:19 DEBUG: Loaded pretrain from /cephyr/users/cleland/Alvis/stan
294
- za_resources/sv/pretrain/diachronic.pt
295
- 2025-12-01 16:19:31 DEBUG: 34980 batches created.
296
- 2025-12-01 16:19:36 DEBUG: 3869 batches created.
297
- 2025-12-01 16:19:36 INFO: Training parser...
298
- 2025-12-01 16:19:36 DEBUG: Depparse model loading charmodels: /cephyr/users/clel
299
- and/Alvis/stanza_resources/sv/forward_charlm/conll17.pt and /cephyr/users/clelan
300
- d/Alvis/stanza_resources/sv/backward_charlm/conll17.pt
301
- 2025-12-01 16:19:36 DEBUG: Loading charlm from /cephyr/users/cleland/Alvis/stanz
302
- a_resources/sv/forward_charlm/conll17.pt
303
- 2025-12-01 16:19:36 DEBUG: Loading charlm from /cephyr/users/cleland/Alvis/stanz
304
- a_resources/sv/backward_charlm/conll17.pt
305
- 2025-12-01 16:19:37 DEBUG: Building Adam with lr=0.003000, betas=(0.9, 0.95), ep
306
- s=0.000001
307
- 2025-12-01 16:19:44 INFO: Finished STEP 20/50000, loss = 7.816687 (0.197 sec/bat
308
- ch), lr: 0.003000
309
- 2025-12-01 16:19:48 INFO: Finished STEP 40/50000, loss = 5.938274 (0.176 sec/bat
310
- ch), lr: 0.003000
311
- 2025-12-01 16:19:51 INFO: Finished STEP 60/50000, loss = 5.364614 (0.155 sec/bat
312
- ch), lr: 0.003000
313
- 2025-12-01 16:19:54 INFO: Finished STEP 80/50000, loss = 5.002423 (0.152 sec/bat
314
- ch), lr: 0.003000
315
- 2025-12-01 16:19:56 INFO: Finished STEP 100/50000, loss = 5.159612 (0.150 sec/ba
316
- tch), lr: 0.003000
317
- 2025-12-01 16:19:56 INFO: Evaluating on dev set...
318
- 2025-12-01 16:21:47 INFO: LAS MLAS BLEX
319
- 2025-12-01 16:21:47 INFO: 35.52 25.51 31.38
320
- 2025-12-01 16:21:48 INFO: step 100: train_loss = 8.221023, dev_score = 0.3552
321
- 2025-12-01 16:21:48 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
322
- rlm_parser.pt
323
- 2025-12-01 16:21:48 INFO: new best model saved.
324
- 2025-12-01 16:21:48 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
325
- rlm_parser_checkpoint.pt
326
- 2025-12-01 16:21:48 INFO: new model checkpoint saved.
327
- 2025-12-01 16:21:51 INFO: Finished STEP 120/50000, loss = 5.322126 (0.132 sec/ba
328
- tch), lr: 0.003000
329
- 2025-12-01 16:21:54 INFO: Finished STEP 140/50000, loss = 8.141600 (0.105 sec/ba
330
- tch), lr: 0.003000
331
- 2025-12-01 16:21:56 INFO: Finished STEP 160/50000, loss = 4.586904 (0.133 sec/ba
332
- tch), lr: 0.003000
333
- 2025-12-01 16:21:59 INFO: Finished STEP 180/50000, loss = 5.173397 (0.122 sec/ba
334
- tch), lr: 0.003000
335
- 2025-12-01 16:22:01 INFO: Finished STEP 200/50000, loss = 4.824401 (0.130 sec/ba
336
- tch), lr: 0.003000
337
- 2025-12-01 16:22:01 INFO: Evaluating on dev set...
338
- 2025-12-01 16:23:57 INFO: LAS MLAS BLEX
339
- 2025-12-01 16:23:57 INFO: 44.81 34.78 39.39
340
- 2025-12-01 16:23:57 INFO: step 200: train_loss = 5.132569, dev_score = 0.4481
341
- 2025-12-01 16:23:57 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
342
- rlm_parser.pt
343
- 2025-12-01 16:23:57 INFO: new best model saved.
344
- 2025-12-01 16:23:58 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
345
- rlm_parser_checkpoint.pt
346
- 2025-12-01 16:23:58 INFO: new model checkpoint saved.
347
- 2025-12-01 16:24:00 INFO: Finished STEP 220/50000, loss = 5.426694 (0.120 sec/ba
348
- tch), lr: 0.003000
349
- 2025-12-01 16:24:02 INFO: Finished STEP 240/50000, loss = 3.989385 (0.115 sec/ba
350
- tch), lr: 0.003000
351
- 2025-12-01 16:24:05 INFO: Finished STEP 260/50000, loss = 5.430014 (0.112 sec/ba
352
- tch), lr: 0.003000
353
- 2025-12-01 16:24:07 INFO: Finished STEP 280/50000, loss = 5.211717 (0.118 sec/ba
354
- tch), lr: 0.003000
355
- 2025-12-01 16:24:09 INFO: Finished STEP 300/50000, loss = 4.077099 (0.112 sec/ba
356
- tch), lr: 0.003000
357
- 2025-12-01 16:24:09 INFO: Evaluating on dev set...
358
- 2025-12-01 16:26:01 INFO: LAS MLAS BLEX
359
- 2025-12-01 16:26:01 INFO: 51.07 41.38 43.75
360
- 2025-12-01 16:26:01 INFO: step 300: train_loss = 4.773311, dev_score = 0.5107
361
- 2025-12-01 16:26:01 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
362
- rlm_parser.pt
363
- 2025-12-01 16:26:01 INFO: new best model saved.
364
- 2025-12-01 16:26:01 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
365
- rlm_parser_checkpoint.pt
366
- 2025-12-01 16:26:01 INFO: new model checkpoint saved.
367
- 2025-12-01 16:26:04 INFO: Finished STEP 320/50000, loss = 4.404383 (0.116 sec/ba
368
- tch), lr: 0.003000
369
- 2025-12-01 16:26:06 INFO: Finished STEP 340/50000, loss = 5.495321 (0.109 sec/ba
370
- tch), lr: 0.003000
371
- 2025-12-01 16:26:08 INFO: Finished STEP 360/50000, loss = 4.231166 (0.097 sec/ba
372
- tch), lr: 0.003000
373
- 2025-12-01 16:26:10 INFO: Finished STEP 380/50000, loss = 3.526903 (0.115 sec/ba
374
- tch), lr: 0.003000
375
- 2025-12-01 16:26:12 INFO: Finished STEP 400/50000, loss = 3.672882 (0.105 sec/ba
376
- tch), lr: 0.003000
377
- 2025-12-01 16:26:12 INFO: Evaluating on dev set...
378
- 2025-12-01 16:28:11 INFO: LAS MLAS BLEX
379
- 2025-12-01 16:28:11 INFO: 54.88 43.48 46.64
380
- 2025-12-01 16:28:11 INFO: step 400: train_loss = 4.541314, dev_score = 0.5488
381
- 2025-12-01 16:28:11 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
382
- rlm_parser.pt
383
- 2025-12-01 16:28:11 INFO: new best model saved.
384
- 2025-12-01 16:28:11 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
385
- rlm_parser_checkpoint.pt
386
- 2025-12-01 16:28:11 INFO: new model checkpoint saved.
387
- 2025-12-01 16:28:14 INFO: Finished STEP 420/50000, loss = 4.930879 (0.108 sec/ba
388
- tch), lr: 0.003000
389
- 2025-12-01 16:28:16 INFO: Finished STEP 440/50000, loss = 3.510082 (0.093 sec/ba
390
- tch), lr: 0.003000
391
- 2025-12-01 16:28:18 INFO: Finished STEP 460/50000, loss = 3.485497 (0.112 sec/ba
392
- tch), lr: 0.003000
393
- 2025-12-01 16:28:20 INFO: Finished STEP 480/50000, loss = 3.652010 (0.106 sec/ba
394
- tch), lr: 0.003000
395
- 2025-12-01 16:28:22 INFO: Finished STEP 500/50000, loss = 3.712685 (0.100 sec/ba
396
- tch), lr: 0.003000
397
- 2025-12-01 16:28:22 INFO: Evaluating on dev set...
398
- 2025-12-01 16:30:16 INFO: LAS MLAS BLEX
399
- 2025-12-01 16:30:16 INFO: 53.93 46.20 48.86
400
- 2025-12-01 16:30:16 INFO: step 500: train_loss = 4.387830, dev_score = 0.5393
401
- 2025-12-01 16:30:17 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
402
- rlm_parser_checkpoint.pt
403
- 2025-12-01 16:30:17 INFO: new model checkpoint saved.
404
- 2025-12-01 16:30:19 INFO: Finished STEP 520/50000, loss = 3.515909 (0.100 sec/ba
405
- tch), lr: 0.003000
406
- 2025-12-01 16:30:21 INFO: Finished STEP 540/50000, loss = 4.955926 (0.105 sec/ba
407
- tch), lr: 0.003000
408
- 2025-12-01 16:30:23 INFO: Finished STEP 560/50000, loss = 3.391292 (0.105 sec/ba
409
- tch), lr: 0.003000
410
- 2025-12-01 16:30:25 INFO: Finished STEP 580/50000, loss = 3.793795 (0.098 sec/ba
411
- tch), lr: 0.003000
412
- 2025-12-01 16:30:27 INFO: Finished STEP 600/50000, loss = 3.982375 (0.090 sec/ba
413
- tch), lr: 0.003000
414
- 2025-12-01 16:30:27 INFO: Evaluating on dev set...
415
- ^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[B^[[B^[[B^[[B^[[B^[[B^[[B^[[B^[[B^[[B^[[B^[[B
416
- ^[[B^[[B^[[B^[[B^[[B^[[B^[[B^[[B^[[B^[[B^[[B^[[B^[[B^[[B^[[B^[[B^[[B^[[B^[[A^[[B
417
- ^[[B^[[B^[[A2025-12-01 16:32:19 INFO: LAS MLAS BLEX
418
- 2025-12-01 16:32:19 INFO: 55.72 45.49 49.33
419
- 2025-12-01 16:32:19 INFO: step 600: train_loss = 4.107662, dev_score = 0.5572
420
- 2025-12-01 16:32:19 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
421
- rlm_parser.pt
422
- 2025-12-01 16:32:19 INFO: new best model saved.
423
- 2025-12-01 16:32:19 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
424
- rlm_parser_checkpoint.pt
425
- 2025-12-01 16:32:19 INFO: new model checkpoint saved.
426
- 2025-12-01 16:32:21 INFO: Finished STEP 620/50000, loss = 4.117659 (0.094 sec/ba
427
- tch), lr: 0.003000
428
- 2025-12-01 16:32:23 INFO: Finished STEP 640/50000, loss = 5.035217 (0.104 sec/ba
429
- tch), lr: 0.003000
430
- 2025-12-01 16:32:25 INFO: Finished STEP 660/50000, loss = 3.393968 (0.097 sec/ba
431
- tch), lr: 0.003000
432
- 2025-12-01 16:32:27 INFO: Finished STEP 680/50000, loss = 3.680551 (0.095 sec/ba
433
- tch), lr: 0.003000
434
- 2025-12-01 16:32:29 INFO: Finished STEP 700/50000, loss = 3.164994 (0.092 sec/ba
435
- tch), lr: 0.003000
436
- 2025-12-01 16:32:29 INFO: Evaluating on dev set...
437
- 2025-12-01 16:34:26 INFO: LAS MLAS BLEX
438
- 2025-12-01 16:34:26 INFO: 58.57 49.48 53.00
439
- 2025-12-01 16:34:26 INFO: step 700: train_loss = 4.103022, dev_score = 0.5857
440
- 2025-12-01 16:34:26 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
441
- rlm_parser.pt
442
- 2025-12-01 16:34:26 INFO: new best model saved.
443
- 2025-12-01 16:34:27 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
444
- rlm_parser_checkpoint.pt
445
- 2025-12-01 16:34:27 INFO: new model checkpoint saved.
446
- 2025-12-01 16:34:29 INFO: Finished STEP 720/50000, loss = 3.945064 (0.098 sec/ba
447
- tch), lr: 0.003000
448
- 2025-12-01 16:34:31 INFO: Finished STEP 740/50000, loss = 3.435038 (0.096 sec/ba
449
- tch), lr: 0.003000
450
- 2025-12-01 16:34:33 INFO: Finished STEP 760/50000, loss = 6.817209 (0.091 sec/ba
451
- tch), lr: 0.003000
452
- 2025-12-01 16:34:35 INFO: Finished STEP 780/50000, loss = 4.048465 (0.096 sec/ba
453
- tch), lr: 0.003000
454
- 2025-12-01 16:34:37 INFO: Finished STEP 800/50000, loss = 3.421274 (0.102 sec/ba
455
- tch), lr: 0.003000
456
- 2025-12-01 16:34:37 INFO: Evaluating on dev set...
457
- 2025-12-01 16:36:27 INFO: LAS MLAS BLEX
458
- 2025-12-01 16:36:27 INFO: 58.47 47.98 51.83
459
- 2025-12-01 16:36:27 INFO: step 800: train_loss = 3.999019, dev_score = 0.5847
460
- 2025-12-01 16:36:27 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
461
- rlm_parser_checkpoint.pt
462
- 2025-12-01 16:36:27 INFO: new model checkpoint saved.
463
- 2025-12-01 16:36:29 INFO: Finished STEP 820/50000, loss = 3.610717 (0.084 sec/ba
464
- tch), lr: 0.003000
465
- 2025-12-01 16:36:31 INFO: Finished STEP 840/50000, loss = 4.302441 (0.104 sec/ba
466
- tch), lr: 0.003000
467
- 2025-12-01 16:36:33 INFO: Finished STEP 860/50000, loss = 4.264128 (0.087 sec/ba
468
- tch), lr: 0.003000
469
- 2025-12-01 16:36:35 INFO: Finished STEP 880/50000, loss = 2.931184 (0.102 sec/ba
470
- tch), lr: 0.003000
471
- 2025-12-01 16:36:37 INFO: Finished STEP 900/50000, loss = 3.759175 (0.095 sec/ba
472
- tch), lr: 0.003000
473
- 2025-12-01 16:36:37 INFO: Evaluating on dev set...
474
- 2025-12-01 16:38:33 INFO: LAS MLAS BLEX
475
- 2025-12-01 16:38:33 INFO: 59.24 49.46 52.65
476
- 2025-12-01 16:38:33 INFO: step 900: train_loss = 4.114415, dev_score = 0.5924
477
- 2025-12-01 16:38:33 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
478
- rlm_parser.pt
479
- 2025-12-01 16:38:33 INFO: new best model saved.
480
- 2025-12-01 16:38:34 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
481
- rlm_parser_checkpoint.pt
482
- 2025-12-01 16:38:34 INFO: new model checkpoint saved.
483
- 2025-12-01 16:38:35 INFO: Finished STEP 920/50000, loss = 4.831368 (0.083 sec/ba
484
- tch), lr: 0.003000
485
- 2025-12-01 16:38:37 INFO: Finished STEP 940/50000, loss = 3.790484 (0.084 sec/ba
486
- tch), lr: 0.003000
487
- 2025-12-01 16:38:39 INFO: Finished STEP 960/50000, loss = 3.031943 (0.088 sec/ba
488
- tch), lr: 0.003000
489
- 2025-12-01 16:38:41 INFO: Finished STEP 980/50000, loss = 4.735695 (0.086 sec/ba
490
- tch), lr: 0.003000
491
- 2025-12-01 16:38:42 INFO: Finished STEP 1000/50000, loss = 4.110371 (0.094 sec/b
492
- atch), lr: 0.003000
493
- 2025-12-01 16:38:42 INFO: Evaluating on dev set...
494
- 2025-12-01 16:40:36 INFO: LAS MLAS BLEX
495
- 2025-12-01 16:40:36 INFO: 59.88 51.88 54.44
496
- 2025-12-01 16:40:36 INFO: step 1000: train_loss = 3.912631, dev_score = 0.5988
497
- 2025-12-01 16:40:36 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
498
- rlm_parser.pt
499
- 2025-12-01 16:40:36 INFO: new best model saved.
500
- 2025-12-01 16:40:37 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
501
- rlm_parser_checkpoint.pt
502
- 2025-12-01 16:40:37 INFO: new model checkpoint saved.
503
- 2025-12-01 16:40:39 INFO: Finished STEP 1020/50000, loss = 3.910941 (0.089 sec/b
504
- atch), lr: 0.003000
505
- 2025-12-01 16:40:40 INFO: Finished STEP 1040/50000, loss = 2.339472 (0.088 sec/b
506
- atch), lr: 0.003000
507
- 2025-12-01 16:40:42 INFO: Finished STEP 1060/50000, loss = 3.969233 (0.090 sec/b
508
- atch), lr: 0.003000
509
- 2025-12-01 16:40:44 INFO: Finished STEP 1080/50000, loss = 3.765004 (0.084 sec/b
510
- atch), lr: 0.003000
511
- 2025-12-01 16:40:46 INFO: Finished STEP 1100/50000, loss = 3.257140 (0.085 sec/b
512
- atch), lr: 0.003000
513
- 2025-12-01 16:40:46 INFO: Evaluating on dev set...
514
- ^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A
515
- ^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A
516
- ^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A
517
- ^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A
518
- ^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A
519
- ^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[B
520
- ^[[B^[[B^[[B^[[B^[[B^[[B^[[B2025-12-01 16:42:43 INFO: LAS MLAS BLEX
521
- 2025-12-01 16:42:43 INFO: 61.13 52.21 55.48
522
- 2025-12-01 16:42:43 INFO: step 1100: train_loss = 3.874069, dev_score = 0.6113
523
- 2025-12-01 16:42:43 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
524
- rlm_parser.pt
525
- 2025-12-01 16:42:43 INFO: new best model saved.
526
- 2025-12-01 16:42:44 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
527
- rlm_parser_checkpoint.pt
528
- 2025-12-01 16:42:44 INFO: new model checkpoint saved.
529
- 2025-12-01 16:42:45 INFO: Finished STEP 1120/50000, loss = 3.144156 (0.083 sec/b
530
- atch), lr: 0.003000
531
- 2025-12-01 16:42:47 INFO: Finished STEP 1140/50000, loss = 4.174114 (0.088 sec/b
532
- atch), lr: 0.003000
533
- 2025-12-01 16:42:49 INFO: Finished STEP 1160/50000, loss = 4.442658 (0.086 sec/b
534
- atch), lr: 0.003000
535
- 2025-12-01 16:42:51 INFO: Finished STEP 1180/50000, loss = 5.325210 (0.087 sec/b
536
- atch), lr: 0.003000
537
- 2025-12-01 16:42:52 INFO: Finished STEP 1200/50000, loss = 3.087610 (0.078 sec/b
538
- atch), lr: 0.003000
539
- 2025-12-01 16:42:52 INFO: Evaluating on dev set...
540
- 2025-12-01 16:44:48 INFO: LAS MLAS BLEX
541
- 2025-12-01 16:44:48 INFO: 60.97 51.57 54.40
542
- 2025-12-01 16:44:48 INFO: step 1200: train_loss = 4.086701, dev_score = 0.6097
543
- 2025-12-01 16:44:49 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
544
- rlm_parser_checkpoint.pt
545
- 2025-12-01 16:44:49 INFO: new model checkpoint saved.
546
- 2025-12-01 16:44:50 INFO: Finished STEP 1220/50000, loss = 4.038983 (0.090 sec/b
547
- atch), lr: 0.003000
548
- 2025-12-01 16:44:52 INFO: Finished STEP 1240/50000, loss = 3.850602 (0.087 sec/b
549
- atch), lr: 0.003000
550
- 2025-12-01 16:44:54 INFO: Finished STEP 1260/50000, loss = 5.231193 (0.084 sec/b
551
- atch), lr: 0.003000
552
- 2025-12-01 16:44:55 INFO: Finished STEP 1280/50000, loss = 3.209912 (0.080 sec/b
553
- atch), lr: 0.003000
554
- 2025-12-01 16:44:57 INFO: Finished STEP 1300/50000, loss = 5.604947 (0.075 sec/b
555
- atch), lr: 0.003000
556
- 2025-12-01 16:44:57 INFO: Evaluating on dev set...
557
- 2025-12-01 16:46:56 INFO: LAS MLAS BLEX
558
- 2025-12-01 16:46:56 INFO: 59.37 52.73 55.46
559
- 2025-12-01 16:46:56 INFO: step 1300: train_loss = 3.929029, dev_score = 0.5937
560
- 2025-12-01 16:46:57 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
561
- rlm_parser_checkpoint.pt
562
- 2025-12-01 16:46:57 INFO: new model checkpoint saved.
563
- 2025-12-01 16:46:59 INFO: Finished STEP 1320/50000, loss = 4.691919 (0.081 sec/b
564
- atch), lr: 0.003000
565
- 2025-12-01 16:47:00 INFO: Finished STEP 1340/50000, loss = 2.670228 (0.080 sec/b
566
- atch), lr: 0.003000
567
- 2025-12-01 16:47:02 INFO: Finished STEP 1360/50000, loss = 3.275898 (0.080 sec/b
568
- atch), lr: 0.003000
569
- 2025-12-01 16:47:04 INFO: Finished STEP 1380/50000, loss = 3.515644 (0.077 sec/b
570
- atch), lr: 0.003000
571
- 2025-12-01 16:47:05 INFO: Finished STEP 1400/50000, loss = 3.714453 (0.082 sec/b
572
- atch), lr: 0.003000
573
- 2025-12-01 16:47:05 INFO: Evaluating on dev set...
574
- 2025-12-01 16:48:58 INFO: LAS MLAS BLEX
575
- 2025-12-01 16:48:58 INFO: 59.73 51.29 54.32
576
- 2025-12-01 16:48:58 INFO: step 1400: train_loss = 3.981437, dev_score = 0.5973
577
- 2025-12-01 16:48:59 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
578
- rlm_parser_checkpoint.pt
579
- 2025-12-01 16:48:59 INFO: new model checkpoint saved.
580
- 2025-12-01 16:49:00 INFO: Finished STEP 1420/50000, loss = 3.118863 (0.084 sec/b
581
- atch), lr: 0.003000
582
- 2025-12-01 16:49:02 INFO: Finished STEP 1440/50000, loss = 2.629301 (0.084 sec/b
583
- atch), lr: 0.003000
584
- 2025-12-01 16:49:04 INFO: Finished STEP 1460/50000, loss = 3.934369 (0.074 sec/b
585
- atch), lr: 0.003000
586
- 2025-12-01 16:49:05 INFO: Finished STEP 1480/50000, loss = 3.575847 (0.076 sec/b
587
- atch), lr: 0.003000
588
- 2025-12-01 16:49:07 INFO: Finished STEP 1500/50000, loss = 4.148717 (0.077 sec/b
589
- atch), lr: 0.003000
590
- 2025-12-01 16:49:07 INFO: Evaluating on dev set...
591
- 2025-12-01 16:51:05 INFO: LAS MLAS BLEX
592
- 2025-12-01 16:51:05 INFO: 60.71 51.33 54.91
593
- 2025-12-01 16:51:05 INFO: step 1500: train_loss = 3.924345, dev_score = 0.6071
594
- 2025-12-01 16:51:06 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
595
- rlm_parser_checkpoint.pt
596
- 2025-12-01 16:51:06 INFO: new model checkpoint saved.
597
- 2025-12-01 16:51:07 INFO: Finished STEP 1520/50000, loss = 3.567710 (0.077 sec/b
598
- atch), lr: 0.003000
599
- 2025-12-01 16:51:09 INFO: Finished STEP 1540/50000, loss = 3.242884 (0.077 sec/b
600
- atch), lr: 0.003000
601
- 2025-12-01 16:51:11 INFO: Finished STEP 1560/50000, loss = 4.593555 (0.065 sec/b
602
- atch), lr: 0.003000
603
- 2025-12-01 16:51:12 INFO: Finished STEP 1580/50000, loss = 4.356698 (0.079 sec/b
604
- atch), lr: 0.003000
605
- 2025-12-01 16:51:14 INFO: Finished STEP 1600/50000, loss = 5.182871 (0.080 sec/b
606
- atch), lr: 0.003000
607
- 2025-12-01 16:51:14 INFO: Evaluating on dev set...
608
- 2025-12-01 16:53:07 INFO: LAS MLAS BLEX
609
- 2025-12-01 16:53:07 INFO: 62.48 53.80 56.91
610
- 2025-12-01 16:53:07 INFO: step 1600: train_loss = 4.170646, dev_score = 0.6248
611
- 2025-12-01 16:53:07 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
612
- rlm_parser.pt
613
- 2025-12-01 16:53:07 INFO: new best model saved.
614
- 2025-12-01 16:53:08 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
615
- rlm_parser_checkpoint.pt
616
- 2025-12-01 16:53:08 INFO: new model checkpoint saved.
617
- 2025-12-01 16:53:09 INFO: Finished STEP 1620/50000, loss = 4.101035 (0.081 sec/b
618
- atch), lr: 0.003000
619
- 2025-12-01 16:53:11 INFO: Finished STEP 1640/50000, loss = 3.558151 (0.074 sec/b
620
- atch), lr: 0.003000
621
- 2025-12-01 16:53:13 INFO: Finished STEP 1660/50000, loss = 7.025590 (0.083 sec/b
622
- atch), lr: 0.003000
623
- 2025-12-01 16:53:14 INFO: Finished STEP 1680/50000, loss = 4.603199 (0.066 sec/b
624
- atch), lr: 0.003000
625
- 2025-12-01 16:53:16 INFO: Finished STEP 1700/50000, loss = 4.706470 (0.079 sec/b
626
- atch), lr: 0.003000
627
- 2025-12-01 16:53:16 INFO: Evaluating on dev set...
628
- 2025-12-01 16:55:13 INFO: LAS MLAS BLEX
629
- 2025-12-01 16:55:13 INFO: 62.02 52.18 54.94
630
- 2025-12-01 16:55:13 INFO: step 1700: train_loss = 4.060353, dev_score = 0.6202
631
- 2025-12-01 16:55:14 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
632
- rlm_parser_checkpoint.pt
633
- 2025-12-01 16:55:14 INFO: new model checkpoint saved.
634
- 2025-12-01 16:55:16 INFO: Finished STEP 1720/50000, loss = 2.938492 (0.077 sec/b
635
- atch), lr: 0.003000
636
- 2025-12-01 16:55:17 INFO: Finished STEP 1740/50000, loss = 3.804090 (0.076 sec/b
637
- atch), lr: 0.003000
638
- 2025-12-01 16:55:19 INFO: Finished STEP 1760/50000, loss = 3.830248 (0.075 sec/b
639
- atch), lr: 0.003000
640
- 2025-12-01 16:55:20 INFO: Finished STEP 1780/50000, loss = 3.865887 (0.075 sec/b
641
- atch), lr: 0.003000
642
- 2025-12-01 16:55:22 INFO: Finished STEP 1800/50000, loss = 4.167078 (0.073 sec/b
643
- atch), lr: 0.003000
644
- 2025-12-01 16:55:22 INFO: Evaluating on dev set...
645
- 2025-12-01 16:57:17 INFO: LAS MLAS BLEX
646
- 2025-12-01 16:57:17 INFO: 61.16 53.01 55.95
647
- 2025-12-01 16:57:17 INFO: step 1800: train_loss = 3.881957, dev_score = 0.6116
648
- 2025-12-01 16:57:17 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
649
- rlm_parser_checkpoint.pt
650
- 2025-12-01 16:57:17 INFO: new model checkpoint saved.
651
- 2025-12-01 16:57:19 INFO: Finished STEP 1820/50000, loss = 3.780247 (0.078 sec/b
652
- atch), lr: 0.003000
653
- 2025-12-01 16:57:20 INFO: Finished STEP 1840/50000, loss = 2.951214 (0.066 sec/b
654
- atch), lr: 0.003000
655
- 2025-12-01 16:57:22 INFO: Finished STEP 1860/50000, loss = 3.275378 (0.065 sec/b
656
- atch), lr: 0.003000
657
- 2025-12-01 16:57:23 INFO: Finished STEP 1880/50000, loss = 2.737443 (0.074 sec/b
658
- atch), lr: 0.003000
659
- 2025-12-01 16:57:25 INFO: Finished STEP 1900/50000, loss = 4.514575 (0.079 sec/b
660
- atch), lr: 0.003000
661
- 2025-12-01 16:57:25 INFO: Evaluating on dev set...
662
- 2025-12-01 16:59:24 INFO: LAS MLAS BLEX
663
- 2025-12-01 16:59:24 INFO: 62.05 52.61 54.91
664
- 2025-12-01 16:59:24 INFO: step 1900: train_loss = 3.984699, dev_score = 0.6205
665
- 2025-12-01 16:59:25 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
666
- rlm_parser_checkpoint.pt
667
- 2025-12-01 16:59:25 INFO: new model checkpoint saved.
668
- 2025-12-01 16:59:26 INFO: Finished STEP 1920/50000, loss = 4.856260 (0.083 sec/b
669
- atch), lr: 0.003000
670
- 2025-12-01 16:59:28 INFO: Finished STEP 1940/50000, loss = 3.227711 (0.069 sec/b
671
- atch), lr: 0.003000
672
- 2025-12-01 16:59:29 INFO: Finished STEP 1960/50000, loss = 3.943364 (0.069 sec/b
673
- atch), lr: 0.003000
674
- 2025-12-01 16:59:31 INFO: Finished STEP 1980/50000, loss = 3.673366 (0.078 sec/b
675
- atch), lr: 0.003000
676
- 2025-12-01 16:59:32 INFO: Finished STEP 2000/50000, loss = 4.965534 (0.072 sec/b
677
- atch), lr: 0.003000
678
- 2025-12-01 16:59:32 INFO: Evaluating on dev set...
679
- 2025-12-01 17:01:25 INFO: LAS MLAS BLEX
680
- 2025-12-01 17:01:25 INFO: 61.96 52.97 56.59
681
- 2025-12-01 17:01:25 INFO: step 2000: train_loss = 3.946571, dev_score = 0.6196
682
- 2025-12-01 17:01:26 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
683
- rlm_parser_checkpoint.pt
684
- 2025-12-01 17:01:26 INFO: new model checkpoint saved.
685
- 2025-12-01 17:01:28 INFO: Finished STEP 2020/50000, loss = 3.939468 (0.071 sec/b
686
- atch), lr: 0.003000
687
- 2025-12-01 17:01:29 INFO: Finished STEP 2040/50000, loss = 4.461051 (0.069 sec/b
688
- atch), lr: 0.003000
689
- 2025-12-01 17:01:31 INFO: Finished STEP 2060/50000, loss = 2.162975 (0.075 sec/b
690
- atch), lr: 0.003000
691
- 2025-12-01 17:01:32 INFO: Finished STEP 2080/50000, loss = 3.416358 (0.074 sec/b
692
- atch), lr: 0.003000
693
- 2025-12-01 17:01:33 INFO: Finished STEP 2100/50000, loss = 4.449632 (0.076 sec/b
694
- atch), lr: 0.003000
695
- 2025-12-01 17:01:33 INFO: Evaluating on dev set...
696
- 2025-12-01 17:03:32 INFO: LAS MLAS BLEX
697
- 2025-12-01 17:03:32 INFO: 62.67 53.52 56.73
698
- 2025-12-01 17:03:32 INFO: step 2100: train_loss = 3.860987, dev_score = 0.6267
699
- 2025-12-01 17:03:32 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
700
- rlm_parser.pt
701
- 2025-12-01 17:03:32 INFO: new best model saved.
702
- 2025-12-01 17:03:32 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
703
- rlm_parser_checkpoint.pt
704
- 2025-12-01 17:03:32 INFO: new model checkpoint saved.
705
- 2025-12-01 17:03:34 INFO: Finished STEP 2120/50000, loss = 2.672351 (0.070 sec/b
706
- atch), lr: 0.003000
707
- 2025-12-01 17:03:35 INFO: Finished STEP 2140/50000, loss = 4.442038 (0.078 sec/b
708
- atch), lr: 0.003000
709
- 2025-12-01 17:03:37 INFO: Finished STEP 2160/50000, loss = 2.905113 (0.067 sec/b
710
- atch), lr: 0.003000
711
- 2025-12-01 17:03:38 INFO: Finished STEP 2180/50000, loss = 3.549378 (0.071 sec/b
712
- atch), lr: 0.003000
713
- 2025-12-01 17:03:40 INFO: Finished STEP 2200/50000, loss = 3.567661 (0.076 sec/b
714
- atch), lr: 0.003000
715
- 2025-12-01 17:03:40 INFO: Evaluating on dev set...
716
- 2025-12-01 17:05:34 INFO: LAS MLAS BLEX
717
- 2025-12-01 17:05:34 INFO: 60.38 51.57 54.44
718
- 2025-12-01 17:05:34 INFO: step 2200: train_loss = 3.920414, dev_score = 0.6038
719
- 2025-12-01 17:05:34 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
720
- rlm_parser_checkpoint.pt
721
- 2025-12-01 17:05:34 INFO: new model checkpoint saved.
722
- 2025-12-01 17:05:36 INFO: Finished STEP 2220/50000, loss = 4.313206 (0.070 sec/b
723
- atch), lr: 0.003000
724
- 2025-12-01 17:05:37 INFO: Finished STEP 2240/50000, loss = 3.860752 (0.066 sec/b
725
- atch), lr: 0.003000
726
- 2025-12-01 17:05:39 INFO: Finished STEP 2260/50000, loss = 3.410686 (0.072 sec/b
727
- atch), lr: 0.003000
728
- 2025-12-01 17:05:40 INFO: Finished STEP 2280/50000, loss = 3.162115 (0.075 sec/b
729
- atch), lr: 0.003000
730
- 2025-12-01 17:05:42 INFO: Finished STEP 2300/50000, loss = 4.265501 (0.070 sec/b
731
- atch), lr: 0.003000
732
- 2025-12-01 17:05:42 INFO: Evaluating on dev set...
733
- 2025-12-01 17:07:40 INFO: LAS MLAS BLEX
734
- 2025-12-01 17:07:40 INFO: 61.01 52.13 55.29
735
- 2025-12-01 17:07:40 INFO: step 2300: train_loss = 3.963177, dev_score = 0.6101
736
- 2025-12-01 17:07:41 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
737
- rlm_parser_checkpoint.pt
738
- 2025-12-01 17:07:41 INFO: new model checkpoint saved.
739
- 2025-12-01 17:07:42 INFO: Finished STEP 2320/50000, loss = 3.519798 (0.073 sec/b
740
- atch), lr: 0.003000
741
- 2025-12-01 17:07:44 INFO: Finished STEP 2340/50000, loss = 2.976451 (0.076 sec/b
742
- atch), lr: 0.003000
743
- 2025-12-01 17:07:45 INFO: Finished STEP 2360/50000, loss = 4.665425 (0.067 sec/b
744
- atch), lr: 0.003000
745
- 2025-12-01 17:07:46 INFO: Finished STEP 2380/50000, loss = 2.257787 (0.073 sec/b
746
- atch), lr: 0.003000
747
- 2025-12-01 17:07:48 INFO: Finished STEP 2400/50000, loss = 5.604243 (0.059 sec/b
748
- atch), lr: 0.003000
749
- 2025-12-01 17:07:48 INFO: Evaluating on dev set...
750
- 2025-12-01 17:09:43 INFO: LAS MLAS BLEX
751
- 2025-12-01 17:09:43 INFO: 61.79 52.79 55.99
752
- 2025-12-01 17:09:43 INFO: step 2400: train_loss = 3.984401, dev_score = 0.6179
753
- 2025-12-01 17:09:44 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
754
- rlm_parser_checkpoint.pt
755
- 2025-12-01 17:09:44 INFO: new model checkpoint saved.
756
- 2025-12-01 17:09:45 INFO: Finished STEP 2420/50000, loss = 5.101899 (0.066 sec/b
757
- atch), lr: 0.003000
758
- 2025-12-01 17:09:46 INFO: Finished STEP 2440/50000, loss = 4.179480 (0.073 sec/b
759
- atch), lr: 0.003000
760
- 2025-12-01 17:09:48 INFO: Finished STEP 2460/50000, loss = 5.012446 (0.067 sec/b
761
- atch), lr: 0.003000
762
- 2025-12-01 17:09:49 INFO: Finished STEP 2480/50000, loss = 2.745690 (0.064 sec/b
763
- atch), lr: 0.003000
764
- 2025-12-01 17:09:51 INFO: Finished STEP 2500/50000, loss = 3.649034 (0.063 sec/b
765
- atch), lr: 0.003000
766
- 2025-12-01 17:09:51 INFO: Evaluating on dev set...
767
- 2025-12-01 17:11:49 INFO: LAS MLAS BLEX
768
- 2025-12-01 17:11:49 INFO: 61.16 51.55 55.46
769
- 2025-12-01 17:11:49 INFO: step 2500: train_loss = 3.964052, dev_score = 0.6116
770
- 2025-12-01 17:11:50 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
771
- rlm_parser_checkpoint.pt
772
- 2025-12-01 17:11:50 INFO: new model checkpoint saved.
773
- 2025-12-01 17:11:51 INFO: Finished STEP 2520/50000, loss = 2.673005 (0.063 sec/b
774
- atch), lr: 0.003000
775
- 2025-12-01 17:11:53 INFO: Finished STEP 2540/50000, loss = 3.688977 (0.067 sec/b
776
- atch), lr: 0.003000
777
- 2025-12-01 17:11:54 INFO: Finished STEP 2560/50000, loss = 4.394986 (0.068 sec/b
778
- atch), lr: 0.003000
779
- 2025-12-01 17:11:56 INFO: Finished STEP 2580/50000, loss = 3.482469 (0.065 sec/b
780
- atch), lr: 0.003000
781
- 2025-12-01 17:11:57 INFO: Finished STEP 2600/50000, loss = 3.764716 (0.071 sec/b
782
- atch), lr: 0.003000
783
- 2025-12-01 17:11:57 INFO: Evaluating on dev set...
784
- 2025-12-01 17:13:51 INFO: LAS MLAS BLEX
785
- 2025-12-01 17:13:51 INFO: 62.01 53.21 56.43
786
- 2025-12-01 17:13:51 INFO: step 2600: train_loss = 4.141107, dev_score = 0.6201
787
- 2025-12-01 17:13:52 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
788
- rlm_parser_checkpoint.pt
789
- 2025-12-01 17:13:52 INFO: new model checkpoint saved.
790
- 2025-12-01 17:13:53 INFO: Finished STEP 2620/50000, loss = 2.570758 (0.069 sec/b
791
- atch), lr: 0.003000
792
- 2025-12-01 17:13:54 INFO: Finished STEP 2640/50000, loss = 6.978942 (0.073 sec/b
793
- atch), lr: 0.003000
794
- 2025-12-01 17:13:56 INFO: Finished STEP 2660/50000, loss = 4.613503 (0.070 sec/b
795
- atch), lr: 0.003000
796
- 2025-12-01 17:13:57 INFO: Finished STEP 2680/50000, loss = 3.846519 (0.069 sec/b
797
- atch), lr: 0.003000
798
- 2025-12-01 17:13:59 INFO: Finished STEP 2700/50000, loss = 3.728184 (0.073 sec/b
799
- atch), lr: 0.003000
800
- 2025-12-01 17:13:59 INFO: Evaluating on dev set...
801
- 2025-12-01 17:15:57 INFO: LAS MLAS BLEX
802
- 2025-12-01 17:15:57 INFO: 61.13 52.82 56.19
803
- 2025-12-01 17:15:57 INFO: step 2700: train_loss = 4.055157, dev_score = 0.6113
804
- 2025-12-01 17:15:58 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
805
- rlm_parser_checkpoint.pt
806
- 2025-12-01 17:15:58 INFO: new model checkpoint saved.
807
- 2025-12-01 17:15:59 INFO: Finished STEP 2720/50000, loss = 3.366929 (0.063 sec/b
808
- atch), lr: 0.003000
809
- 2025-12-01 17:16:01 INFO: Finished STEP 2740/50000, loss = 4.805958 (0.073 sec/b
810
- atch), lr: 0.003000
811
- 2025-12-01 17:16:02 INFO: Finished STEP 2760/50000, loss = 3.950007 (0.074 sec/b
812
- atch), lr: 0.003000
813
- 2025-12-01 17:16:03 INFO: Finished STEP 2780/50000, loss = 4.024664 (0.061 sec/b
814
- atch), lr: 0.003000
815
- 2025-12-01 17:16:05 INFO: Finished STEP 2800/50000, loss = 3.126366 (0.068 sec/b
816
- atch), lr: 0.003000
817
- 2025-12-01 17:16:05 INFO: Evaluating on dev set...
818
- 2025-12-01 17:17:59 INFO: LAS MLAS BLEX
819
- 2025-12-01 17:17:59 INFO: 62.28 54.02 56.91
820
- 2025-12-01 17:17:59 INFO: step 2800: train_loss = 4.068391, dev_score = 0.6228
821
- 2025-12-01 17:18:00 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
822
- rlm_parser_checkpoint.pt
823
- 2025-12-01 17:18:00 INFO: new model checkpoint saved.
824
- 2025-12-01 17:18:01 INFO: Finished STEP 2820/50000, loss = 2.825658 (0.066 sec/b
825
- atch), lr: 0.003000
826
- 2025-12-01 17:18:02 INFO: Finished STEP 2840/50000, loss = 3.183490 (0.068 sec/b
827
- atch), lr: 0.003000
828
- 2025-12-01 17:18:04 INFO: Finished STEP 2860/50000, loss = 2.668858 (0.069 sec/b
829
- atch), lr: 0.003000
830
- 2025-12-01 17:18:05 INFO: Finished STEP 2880/50000, loss = 3.450292 (0.076 sec/b
831
- atch), lr: 0.003000
832
- 2025-12-01 17:18:07 INFO: Finished STEP 2900/50000, loss = 4.068074 (0.062 sec/b
833
- atch), lr: 0.003000
834
- 2025-12-01 17:18:07 INFO: Evaluating on dev set...
835
- 2025-12-01 17:20:06 INFO: LAS MLAS BLEX
836
- 2025-12-01 17:20:06 INFO: 61.00 51.68 55.30
837
- 2025-12-01 17:20:06 INFO: step 2900: train_loss = 3.876318, dev_score = 0.6100
838
- 2025-12-01 17:20:07 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
839
- rlm_parser_checkpoint.pt
840
- 2025-12-01 17:20:07 INFO: new model checkpoint saved.
841
- 2025-12-01 17:20:08 INFO: Finished STEP 2920/50000, loss = 3.005747 (0.069 sec/b
842
- atch), lr: 0.003000
843
- 2025-12-01 17:20:09 INFO: Finished STEP 2940/50000, loss = 5.153252 (0.072 sec/b
844
- atch), lr: 0.003000
845
- 2025-12-01 17:20:11 INFO: Finished STEP 2960/50000, loss = 5.392646 (0.071 sec/b
846
- atch), lr: 0.003000
847
- 2025-12-01 17:20:12 INFO: Finished STEP 2980/50000, loss = 3.432074 (0.066 sec/b
848
- atch), lr: 0.003000
849
- 2025-12-01 17:20:13 INFO: Finished STEP 3000/50000, loss = 3.725832 (0.069 sec/b
850
- atch), lr: 0.003000
851
- 2025-12-01 17:20:13 INFO: Evaluating on dev set...
852
- 2025-12-01 17:22:08 INFO: LAS MLAS BLEX
853
- 2025-12-01 17:22:08 INFO: 62.78 54.12 56.96
854
- 2025-12-01 17:22:08 INFO: step 3000: train_loss = 3.861603, dev_score = 0.6278
855
- 2025-12-01 17:22:08 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
856
- rlm_parser.pt
857
- 2025-12-01 17:22:08 INFO: new best model saved.
858
- 2025-12-01 17:22:09 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
859
- rlm_parser_checkpoint.pt
860
- 2025-12-01 17:22:09 INFO: new model checkpoint saved.
861
- 2025-12-01 17:22:10 INFO: Finished STEP 3020/50000, loss = 2.951687 (0.067 sec/b
862
- atch), lr: 0.003000
863
- 2025-12-01 17:22:11 INFO: Finished STEP 3040/50000, loss = 4.273626 (0.071 sec/b
864
- atch), lr: 0.003000
865
- 2025-12-01 17:22:13 INFO: Finished STEP 3060/50000, loss = 5.322878 (0.062 sec/b
866
- atch), lr: 0.003000
867
- 2025-12-01 17:22:14 INFO: Finished STEP 3080/50000, loss = 2.738621 (0.064 sec/b
868
- atch), lr: 0.003000
869
- 2025-12-01 17:22:15 INFO: Finished STEP 3100/50000, loss = 4.397041 (0.068 sec/b
870
- atch), lr: 0.003000
871
- 2025-12-01 17:22:15 INFO: Evaluating on dev set...
872
- 2025-12-01 17:24:14 INFO: LAS MLAS BLEX
873
- 2025-12-01 17:24:14 INFO: 62.26 52.51 56.39
874
- 2025-12-01 17:24:14 INFO: step 3100: train_loss = 3.994215, dev_score = 0.6226
875
- 2025-12-01 17:24:15 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
876
- rlm_parser_checkpoint.pt
877
- 2025-12-01 17:24:15 INFO: new model checkpoint saved.
878
- 2025-12-01 17:24:16 INFO: Finished STEP 3120/50000, loss = 3.872339 (0.061 sec/b
879
- atch), lr: 0.003000
880
- 2025-12-01 17:24:17 INFO: Finished STEP 3140/50000, loss = 3.386866 (0.061 sec/b
881
- atch), lr: 0.003000
882
- 2025-12-01 17:24:19 INFO: Finished STEP 3160/50000, loss = 4.323652 (0.067 sec/b
883
- atch), lr: 0.003000
884
- 2025-12-01 17:24:20 INFO: Finished STEP 3180/50000, loss = 3.043522 (0.066 sec/b
885
- atch), lr: 0.003000
886
- 2025-12-01 17:24:21 INFO: Finished STEP 3200/50000, loss = 4.578977 (0.066 sec/b
887
- atch), lr: 0.003000
888
- 2025-12-01 17:24:21 INFO: Evaluating on dev set...
889
- 2025-12-01 17:26:15 INFO: LAS MLAS BLEX
890
- 2025-12-01 17:26:15 INFO: 62.79 53.58 56.52
891
- 2025-12-01 17:26:15 INFO: step 3200: train_loss = 3.890012, dev_score = 0.6279
892
- 2025-12-01 17:26:15 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
893
- rlm_parser.pt
894
- 2025-12-01 17:26:15 INFO: new best model saved.
895
- 2025-12-01 17:26:16 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
896
- rlm_parser_checkpoint.pt
897
- 2025-12-01 17:26:16 INFO: new model checkpoint saved.
898
- 2025-12-01 17:26:17 INFO: Finished STEP 3220/50000, loss = 5.345606 (0.069 sec/b
899
- atch), lr: 0.003000
900
- 2025-12-01 17:26:19 INFO: Finished STEP 3240/50000, loss = 4.361881 (0.060 sec/b
901
- atch), lr: 0.003000
902
- 2025-12-01 17:26:20 INFO: Finished STEP 3260/50000, loss = 3.357010 (0.066 sec/b
903
- atch), lr: 0.003000
904
- 2025-12-01 17:26:21 INFO: Finished STEP 3280/50000, loss = 3.802068 (0.069 sec/b
905
- atch), lr: 0.003000
906
- 2025-12-01 17:26:23 INFO: Finished STEP 3300/50000, loss = 5.973460 (0.067 sec/b
907
- atch), lr: 0.003000
908
- 2025-12-01 17:26:23 INFO: Evaluating on dev set...
909
- 2025-12-01 17:28:21 INFO: LAS MLAS BLEX
910
- 2025-12-01 17:28:21 INFO: 62.45 53.43 56.45
911
- 2025-12-01 17:28:21 INFO: step 3300: train_loss = 3.894302, dev_score = 0.6245
912
- 2025-12-01 17:28:22 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
913
- rlm_parser_checkpoint.pt
914
- 2025-12-01 17:28:22 INFO: new model checkpoint saved.
915
- 2025-12-01 17:28:23 INFO: Finished STEP 3320/50000, loss = 3.873763 (0.079 sec/b
916
- atch), lr: 0.003000
917
- 2025-12-01 17:28:25 INFO: Finished STEP 3340/50000, loss = 4.907520 (0.057 sec/b
918
- atch), lr: 0.003000
919
- 2025-12-01 17:28:26 INFO: Finished STEP 3360/50000, loss = 4.048786 (0.066 sec/b
920
- atch), lr: 0.003000
921
- 2025-12-01 17:28:27 INFO: Finished STEP 3380/50000, loss = 3.719656 (0.059 sec/b
922
- atch), lr: 0.003000
923
- 2025-12-01 17:28:28 INFO: Finished STEP 3400/50000, loss = 5.070821 (0.071 sec/b
924
- atch), lr: 0.003000
925
- 2025-12-01 17:28:28 INFO: Evaluating on dev set...
926
- 2025-12-01 17:30:25 INFO: LAS MLAS BLEX
927
- 2025-12-01 17:30:25 INFO: 61.85 52.74 55.92
928
- 2025-12-01 17:30:25 INFO: step 3400: train_loss = 4.035826, dev_score = 0.6185
929
- 2025-12-01 17:30:25 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
930
- rlm_parser_checkpoint.pt
931
- 2025-12-01 17:30:25 INFO: new model checkpoint saved.
932
- 2025-12-01 17:30:26 INFO: Finished STEP 3420/50000, loss = 3.293426 (0.061 sec/b
933
- atch), lr: 0.003000
934
- 2025-12-01 17:30:28 INFO: Finished STEP 3440/50000, loss = 2.864790 (0.066 sec/b
935
- atch), lr: 0.003000
936
- 2025-12-01 17:30:29 INFO: Finished STEP 3460/50000, loss = 3.674538 (0.063 sec/b
937
- atch), lr: 0.003000
938
- 2025-12-01 17:30:30 INFO: Finished STEP 3480/50000, loss = 1.785154 (0.065 sec/b
939
- atch), lr: 0.003000
940
- 2025-12-01 17:30:32 INFO: Finished STEP 3500/50000, loss = 4.523306 (0.068 sec/b
941
- atch), lr: 0.003000
942
- 2025-12-01 17:30:32 INFO: Evaluating on dev set...
943
- 2025-12-01 17:32:31 INFO: LAS MLAS BLEX
944
- 2025-12-01 17:32:31 INFO: 62.33 53.13 56.08
945
- 2025-12-01 17:32:31 INFO: step 3500: train_loss = 4.018484, dev_score = 0.6233
946
- 2025-12-01 17:32:31 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
947
- rlm_parser_checkpoint.pt
948
- 2025-12-01 17:32:31 INFO: new model checkpoint saved.
949
- 2025-12-01 17:32:32 INFO: Finished STEP 3520/50000, loss = 4.353161 (0.063 sec/b
950
- atch), lr: 0.003000
951
- 2025-12-01 17:32:34 INFO: Finished STEP 3540/50000, loss = 2.999902 (0.064 sec/b
952
- atch), lr: 0.003000
953
- 2025-12-01 17:32:35 INFO: Finished STEP 3560/50000, loss = 3.458462 (0.066 sec/b
954
- atch), lr: 0.003000
955
- 2025-12-01 17:32:36 INFO: Finished STEP 3580/50000, loss = 4.301951 (0.062 sec/b
956
- atch), lr: 0.003000
957
- 2025-12-01 17:32:38 INFO: Finished STEP 3600/50000, loss = 5.557861 (0.067 sec/b
958
- atch), lr: 0.003000
959
- 2025-12-01 17:32:38 INFO: Evaluating on dev set...
960
- 2025-12-01 17:34:32 INFO: LAS MLAS BLEX
961
- 2025-12-01 17:34:32 INFO: 61.84 52.45 55.64
962
- 2025-12-01 17:34:32 INFO: step 3600: train_loss = 4.084339, dev_score = 0.6184
963
- 2025-12-01 17:34:32 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
964
- rlm_parser_checkpoint.pt
965
- 2025-12-01 17:34:32 INFO: new model checkpoint saved.
966
- 2025-12-01 17:34:34 INFO: Finished STEP 3620/50000, loss = 5.347703 (0.059 sec/b
967
- atch), lr: 0.003000
968
- 2025-12-01 17:34:35 INFO: Finished STEP 3640/50000, loss = 5.321960 (0.067 sec/b
969
- atch), lr: 0.003000
970
- 2025-12-01 17:34:36 INFO: Finished STEP 3660/50000, loss = 4.301494 (0.066 sec/b
971
- atch), lr: 0.003000
972
- 2025-12-01 17:34:38 INFO: Finished STEP 3680/50000, loss = 4.353370 (0.070 sec/b
973
- atch), lr: 0.003000
974
- 2025-12-01 17:34:39 INFO: Finished STEP 3700/50000, loss = 4.955180 (0.058 sec/b
975
- atch), lr: 0.003000
976
- 2025-12-01 17:34:39 INFO: Evaluating on dev set...
977
- 2025-12-01 17:36:37 INFO: LAS MLAS BLEX
978
- 2025-12-01 17:36:37 INFO: 59.97 49.83 54.29
979
- 2025-12-01 17:36:37 INFO: step 3700: train_loss = 4.086822, dev_score = 0.5997
980
- 2025-12-01 17:36:37 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
981
- rlm_parser_checkpoint.pt
982
- 2025-12-01 17:36:37 INFO: new model checkpoint saved.
983
- 2025-12-01 17:36:39 INFO: Finished STEP 3720/50000, loss = 4.359533 (0.068 sec/b
984
- atch), lr: 0.003000
985
- 2025-12-01 17:36:40 INFO: Finished STEP 3740/50000, loss = 2.651897 (0.062 sec/b
986
- atch), lr: 0.003000
987
- 2025-12-01 17:36:41 INFO: Finished STEP 3760/50000, loss = 3.658015 (0.064 sec/b
988
- atch), lr: 0.003000
989
- 2025-12-01 17:36:43 INFO: Finished STEP 3780/50000, loss = 3.624045 (0.063 sec/b
990
- atch), lr: 0.003000
991
- 2025-12-01 17:36:44 INFO: Finished STEP 3800/50000, loss = 7.041559 (0.063 sec/b
992
- atch), lr: 0.003000
993
- 2025-12-01 17:36:44 INFO: Evaluating on dev set...
994
- 2025-12-01 17:38:39 INFO: LAS MLAS BLEX
995
- 2025-12-01 17:38:39 INFO: 61.31 52.57 56.01
996
- 2025-12-01 17:38:39 INFO: step 3800: train_loss = 4.042107, dev_score = 0.6131
997
- 2025-12-01 17:38:39 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
998
- rlm_parser_checkpoint.pt
999
- 2025-12-01 17:38:39 INFO: new model checkpoint saved.
1000
- 2025-12-01 17:38:41 INFO: Finished STEP 3820/50000, loss = 4.186870 (0.067 sec/b
1001
- atch), lr: 0.003000
1002
- 2025-12-01 17:38:42 INFO: Finished STEP 3840/50000, loss = 2.111587 (0.064 sec/b
1003
- atch), lr: 0.003000
1004
- 2025-12-01 17:38:43 INFO: Finished STEP 3860/50000, loss = 4.480252 (0.061 sec/b
1005
- atch), lr: 0.003000
1006
- 2025-12-01 17:38:45 INFO: Finished STEP 3880/50000, loss = 4.170528 (0.063 sec/b
1007
- atch), lr: 0.003000
1008
- 2025-12-01 17:38:46 INFO: Finished STEP 3900/50000, loss = 5.639600 (0.056 sec/b
1009
- atch), lr: 0.003000
1010
- 2025-12-01 17:38:46 INFO: Evaluating on dev set...
1011
- ^[[B^[[B^[[B^[[B^[[B^[[B^[[B^[[B^[[B2025-12-01 17:40:43 INFO: LAS MLAS B
1012
- LEX
1013
- 2025-12-01 17:40:43 INFO: 60.23 50.12 53.96
1014
- 2025-12-01 17:40:43 INFO: step 3900: train_loss = 4.067155, dev_score = 0.6023
1015
- 2025-12-01 17:40:44 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
1016
- rlm_parser_checkpoint.pt
1017
- 2025-12-01 17:40:44 INFO: new model checkpoint saved.
1018
- 2025-12-01 17:40:45 INFO: Finished STEP 3920/50000, loss = 9.129135 (0.070 sec/b
1019
- atch), lr: 0.003000
1020
- 2025-12-01 17:40:46 INFO: Finished STEP 3940/50000, loss = 3.588398 (0.059 sec/b
1021
- atch), lr: 0.003000
1022
- 2025-12-01 17:40:48 INFO: Finished STEP 3960/50000, loss = 3.612643 (0.062 sec/b
1023
- atch), lr: 0.003000
1024
- 2025-12-01 17:40:49 INFO: Finished STEP 3980/50000, loss = 4.231903 (0.062 sec/b
1025
- atch), lr: 0.003000
1026
- 2025-12-01 17:40:50 INFO: Finished STEP 4000/50000, loss = 3.722571 (0.064 sec/b
1027
- atch), lr: 0.003000
1028
- 2025-12-01 17:40:50 INFO: Evaluating on dev set...
1029
- 2025-12-01 17:42:43 INFO: LAS MLAS BLEX
1030
- 2025-12-01 17:42:43 INFO: 60.84 50.76 53.91
1031
- 2025-12-01 17:42:43 INFO: step 4000: train_loss = 3.990770, dev_score = 0.6084
1032
- 2025-12-01 17:42:44 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
1033
- rlm_parser_checkpoint.pt
1034
- 2025-12-01 17:42:44 INFO: new model checkpoint saved.
1035
- 2025-12-01 17:42:45 INFO: Finished STEP 4020/50000, loss = 3.633982 (0.063 sec/b
1036
- atch), lr: 0.003000
1037
- 2025-12-01 17:42:46 INFO: Finished STEP 4040/50000, loss = 4.624004 (0.061 sec/b
1038
- atch), lr: 0.003000
1039
- 2025-12-01 17:42:48 INFO: Finished STEP 4060/50000, loss = 4.380543 (0.065 sec/b
1040
- atch), lr: 0.003000
1041
- 2025-12-01 17:42:49 INFO: Finished STEP 4080/50000, loss = 3.084680 (0.067 sec/b
1042
- atch), lr: 0.003000
1043
- 2025-12-01 17:42:50 INFO: Finished STEP 4100/50000, loss = 5.554842 (0.061 sec/b
1044
- atch), lr: 0.003000
1045
- 2025-12-01 17:42:50 INFO: Evaluating on dev set...
1046
- 2025-12-01 17:44:49 INFO: LAS MLAS BLEX
1047
- 2025-12-01 17:44:49 INFO: 61.42 52.33 55.75
1048
- 2025-12-01 17:44:49 INFO: step 4100: train_loss = 4.179982, dev_score = 0.6142
1049
- 2025-12-01 17:44:50 INFO: Model saved to saved_models/depparse/sv_diachronic_cha
1050
- rlm_parser_checkpoint.pt
1051
- 2025-12-01 17:44:50 INFO: new model checkpoint saved.
1052
- 2025-12-01 17:44:51 INFO: Finished STEP 4120/50000, loss = 3.535981 (0.061 sec/b
1053
- atch), lr: 0.003000
1054
- 2025-12-01 17:44:53 INFO: Finished STEP 4140/50000, loss = 4.389372 (0.066 sec/b
1055
- atch), lr: 0.003000
1056
- 2025-12-01 17:44:54 INFO: Finished STEP 4160/50000, loss = 4.418224 (0.059 sec/b
1057
- atch), lr: 0.003000
1058
- 2025-12-01 17:44:55 INFO: Finished STEP 4180/50000, loss = 4.264222 (0.064 sec/b
1059
- atch), lr: 0.003000
1060
- 2025-12-01 17:44:56 INFO: Finished STEP 4200/50000, loss = 4.714503 (0.057 sec/b
1061
- atch), lr: 0.003000
1062
- 2025-12-01 17:44:56 INFO: Evaluating on dev set...
1063
- ^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A^[[A2025-12-01 17:46:54 INFO
1064
- : LAS MLAS BLEX
1065
- 2025-12-01 17:46:54 INFO: 62.40 52.94 56.39
1066
- 2025-12-01 17:46:54 INFO: step 4200: train_loss = 3.973738, dev_score = 0.6240
1067
- 2025-12-01 17:46:54 INFO: Training ended with 4200 steps.
1068
- 2025-12-01 17:46:54 INFO: Best dev F1 = 62.79, at iteration = 3200
1069
- 2025-12-01 17:46:54 INFO: Running dev depparse for UD_Swedish-diachronic with ar
1070
- gs ['--wordvec_dir', '/cephyr/users/cleland/Alvis/stanza_resources/sv/pretrain',
1071
- '--eval_file', '/mimer/NOBACKUP/groups/dionysus/cleland/stanza-digphil/data/dep
1072
- parse/sv_diachronic.dev.in.conllu', '--lang', 'sv', '--shorthand', 'sv_diachroni
1073
- c', '--mode', 'predict', '--charlm', '--charlm_shorthand', 'sv_conll17', '--char
1074
- lm_forward_file', '/cephyr/users/cleland/Alvis/stanza_resources/sv/forward_charl
1075
- m/conll17.pt', '--charlm_backward_file', '/cephyr/users/cleland/Alvis/stanza_res
1076
- ources/sv/backward_charlm/conll17.pt', '--wordvec_pretrain_file', '/cephyr/users
1077
- /cleland/Alvis/stanza_resources/sv/pretrain/diachronic.pt', '--batch_size', '32'
1078
- , '--dropout', '0.33']
1079
- 2025-12-01 17:46:54 INFO: Running parser in predict mode
1080
- 2025-12-01 17:46:54 INFO: Loading model from: saved_models/depparse/sv_diachroni
1081
- c_charlm_parser.pt
1082
- 2025-12-01 17:46:57 DEBUG: Loaded pretrain from /cephyr/users/cleland/Alvis/stan
1083
- za_resources/sv/pretrain/diachronic.pt
1084
- 2025-12-01 17:46:57 DEBUG: Depparse model loading charmodels: /cephyr/users/clel
1085
- and/Alvis/stanza_resources/sv/forward_charlm/conll17.pt and /cephyr/users/clelan
1086
- d/Alvis/stanza_resources/sv/backward_charlm/conll17.pt
1087
- 2025-12-01 17:46:57 DEBUG: Loading charlm from /cephyr/users/cleland/Alvis/stanz
1088
- a_resources/sv/forward_charlm/conll17.pt
1089
- 2025-12-01 17:46:57 DEBUG: Loading charlm from /cephyr/users/cleland/Alvis/stanz
1090
- a_resources/sv/backward_charlm/conll17.pt
1091
- 2025-12-01 17:46:58 DEBUG: Building Adam with lr=0.003000, betas=(0.9, 0.95), ep
1092
- s=0.000001
1093
- 2025-12-01 17:46:58 INFO: Loading data with batch size 32...
1094
- 2025-12-01 17:47:00 DEBUG: 3869 batches created.
1095
- 2025-12-01 17:48:51 INFO: F1 scores for each dependency:
1096
- Note that unlabeled attachment errors hurt the labeled attachment scores
1097
- acl: p 0.2824 r 0.0380 f1 0.0670 (631 actual)
1098
- acl:cleft: p 0.0000 r 0.0000 f1 0.0000 (64 actual)
1099
- acl:relcl: p 0.2273 r 0.1823 f1 0.2023 (1289 actual)
1100
- advcl: p 0.2257 r 0.1474 f1 0.1783 (2049 actual)
1101
- advcl:relcl: p 0.0000 r 0.0000 f1 0.0000 (4 actual)
1102
- advmod: p 0.5858 r 0.6231 f1 0.6038 (7866 actual)
1103
- amod: p 0.7948 r 0.8487 f1 0.8209 (5256 actual)
1104
- appos: p 0.0000 r 0.0000 f1 0.0000 (618 actual)
1105
- aux: p 0.9035 r 0.8874 f1 0.8954 (2406 actual)
1106
- aux:pass: p 0.0000 r 0.0000 f1 0.0000 (54 actual)
1107
- case: p 0.8684 r 0.8702 f1 0.8693 (9319 actual)
1108
- cc: p 0.7477 r 0.7448 f1 0.7463 (4413 actual)
1109
- ccomp: p 0.0000 r 0.0000 f1 0.0000 (621 actual)
1110
- compound: p 0.0000 r 0.0000 f1 0.0000 (362 actual)
1111
- compound:prt: p 0.7844 r 0.7231 f1 0.7525 (986 actual)
1112
- conj: p 0.2328 r 0.3743 f1 0.2870 (5293 actual)
1113
- cop: p 0.7981 r 0.7790 f1 0.7884 (1507 actual)
1114
- csubj: p 0.0000 r 0.0000 f1 0.0000 (219 actual)
1115
- csubj:outer: p 0.0000 r 0.0000 f1 0.0000 (1 actual)
1116
- csubj:pass: p 0.0000 r 0.0000 f1 0.0000 (15 actual)
1117
- dep: p 0.0000 r 0.0000 f1 0.0000 (1 actual)
1118
- det: p 0.7903 r 0.8794 f1 0.8325 (4826 actual)
1119
- discourse: p 0.0000 r 0.0000 f1 0.0000 (299 actual)
1120
- dislocated: p 0.0000 r 0.0000 f1 0.0000 (108 actual)
1121
- expl: p 0.2916 r 0.3375 f1 0.3129 (400 actual)
1122
- fixed: p 0.0000 r 0.0000 f1 0.0000 (307 actual)
1123
- flat: p 0.0000 r 0.0000 f1 0.0000 (55 actual)
1124
- flat:name: p 0.3180 r 0.1119 f1 0.1655 (742 actual)
1125
- goeswith: p 0.0000 r 0.0000 f1 0.0000 (3 actual)
1126
- iobj: p 0.0392 r 0.0052 f1 0.0092 (382 actual)
1127
- list: p 0.0000 r 0.0000 f1 0.0000 (17 actual)
1128
- mark: p 0.7468 r 0.7708 f1 0.7586 (3914 actual)
1129
- nmod: p 0.3660 r 0.2679 f1 0.3093 (3378 actual)
1130
- nmod:poss: p 0.9007 r 0.8752 f1 0.8877 (3357 actual)
1131
- nsubj: p 0.5942 r 0.7026 f1 0.6438 (9260 actual)
1132
- nsubj:outer: p 0.0000 r 0.0000 f1 0.0000 (3 actual)
1133
- nsubj:pass: p 0.0000 r 0.0000 f1 0.0000 (696 actual)
1134
- nummod: p 0.6013 r 0.3069 f1 0.4064 (619 actual)
1135
- obj: p 0.5790 r 0.7581 f1 0.6566 (5407 actual)
1136
- obl: p 0.5289 r 0.5730 f1 0.5501 (6717 actual)
1137
- obl:agent: p 0.0000 r 0.0000 f1 0.0000 (75 actual)
1138
- orphan: p 0.0000 r 0.0000 f1 0.0000 (36 actual)
1139
- parataxis: p 0.1071 r 0.0030 f1 0.0059 (984 actual)
1140
- punct: p 0.5487 r 0.5510 f1 0.5499 (13881 actual)
1141
- root: p 0.7315 r 0.7315 f1 0.7315 (6634 actual)
1142
- vocative: p 0.0000 r 0.0000 f1 0.0000 (8 actual)
1143
- xcomp: p 0.4831 r 0.3220 f1 0.3864 (1466 actual)
1144
- 2025-12-01 17:48:59 INFO: LAS MLAS BLEX
1145
- 2025-12-01 17:48:59 INFO: 62.79 53.58 56.52
1146
- 2025-12-01 17:48:59 INFO: Parser score:
1147
- 2025-12-01 17:48:59 INFO: sv_diachronic 62.79
1148
- 2025-12-01 17:49:06 INFO: Finished running dev set on
1149
- UD_Swedish-diachronic
1150
- UAS LAS CLAS MLAS BLEX
1151
- 71.59 62.79 56.52 53.58 56.52
1152
- 2025-12-01 17:49:06 INFO: Running test depparse for UD_Swedish-diachronic with a
1153
- rgs ['--wordvec_dir', '/cephyr/users/cleland/Alvis/stanza_resources/sv/pretrain'
1154
- , '--eval_file', '/mimer/NOBACKUP/groups/dionysus/cleland/stanza-digphil/data/de
1155
- pparse/sv_diachronic.test.in.conllu', '--lang', 'sv', '--shorthand', 'sv_diachro
1156
- nic', '--mode', 'predict', '--charlm', '--charlm_shorthand', 'sv_conll17', '--ch
1157
- arlm_forward_file', '/cephyr/users/cleland/Alvis/stanza_resources/sv/forward_cha
1158
- rlm/conll17.pt', '--charlm_backward_file', '/cephyr/users/cleland/Alvis/stanza_r
1159
- esources/sv/backward_charlm/conll17.pt', '--wordvec_pretrain_file', '/cephyr/use
1160
- rs/cleland/Alvis/stanza_resources/sv/pretrain/diachronic.pt', '--batch_size', '3
1161
- 2', '--dropout', '0.33']
1162
- 2025-12-01 17:49:06 INFO: Running parser in predict mode
1163
- 2025-12-01 17:49:06 INFO: Loading model from: saved_models/depparse/sv_diachroni
1164
- c_charlm_parser.pt
1165
- 2025-12-01 17:49:09 DEBUG: Loaded pretrain from /cephyr/users/cleland/Alvis/stan
1166
- za_resources/sv/pretrain/diachronic.pt
1167
- 2025-12-01 17:49:09 DEBUG: Depparse model loading charmodels: /cephyr/users/clel
1168
- and/Alvis/stanza_resources/sv/forward_charlm/conll17.pt and /cephyr/users/clelan
1169
- d/Alvis/stanza_resources/sv/backward_charlm/conll17.pt
1170
- 2025-12-01 17:49:09 DEBUG: Loading charlm from /cephyr/users/cleland/Alvis/stanz
1171
- a_resources/sv/forward_charlm/conll17.pt
1172
- 2025-12-01 17:49:09 DEBUG: Loading charlm from /cephyr/users/cleland/Alvis/stanz
1173
- a_resources/sv/backward_charlm/conll17.pt
1174
- 2025-12-01 17:49:09 DEBUG: Building Adam with lr=0.003000, betas=(0.9, 0.95), ep
1175
- s=0.000001
1176
- 2025-12-01 17:49:09 INFO: Loading data with batch size 32...
1177
- 2025-12-01 17:49:09 DEBUG: 102 batches created.
1178
- Traceback (most recent call last):
1179
- File "<frozen runpy>", line 198, in _run_module_as_main
1180
- File "<frozen runpy>", line 88, in _run_code
1181
- File "/mimer/NOBACKUP/groups/dionysus/cleland/stanza-digphil/stanza/utils/trai
1182
- ning/run_depparse.py", line 145, in <module>
1183
- main()
1184
- File "/mimer/NOBACKUP/groups/dionysus/cleland/stanza-digphil/stanza/utils/trai
1185
- ning/run_depparse.py", line 142, in main
1186
- common.main(run_treebank, "depparse", "parser", add_depparse_args, sub_argpa
1187
- rse=parser.build_argparse(), build_model_filename=build_model_filename, choose_c
1188
- harlm_method=choose_depparse_charlm)
1189
- File "/mimer/NOBACKUP/groups/dionysus/cleland/stanza-digphil/stanza/utils/trai
1190
- ning/common.py", line 198, in main
1191
- run_treebank(mode, paths, treebank, short_name, command_args, extra_args + s
1192
- ave_name_args)
1193
- File "/mimer/NOBACKUP/groups/dionysus/cleland/stanza-digphil/stanza/utils/trai
1194
- ning/run_depparse.py", line 129, in run_treebank
1195
- _, test_doc = parser.main(test_args)
1196
- ^^^^^^^^^^^^^^^^^^^^^^
1197
- File "/mimer/NOBACKUP/groups/dionysus/cleland/stanza-digphil/stanza/models/par
1198
- ser.py", line 157, in main
1199
- return evaluate(args)
1200
- ^^^^^^^^^^^^^^
1201
- File "/mimer/NOBACKUP/groups/dionysus/cleland/stanza-digphil/stanza/models/par
1202
- ser.py", line 396, in evaluate
1203
- return trainer, evaluate_trainer(args, trainer, pretrain)
1204
- ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
1205
- File "/mimer/NOBACKUP/groups/dionysus/cleland/stanza-digphil/stanza/models/par
1206
- ser.py", line 426, in evaluate_trainer
1207
- raise ValueError("Gold document {} has a None at sentence {} word {}\n{:C}".
1208
- format(args['eval_file'], sent_idx, word_idx, sentence))
1209
- ValueError: Gold document /mimer/NOBACKUP/groups/dionysus/cleland/stanza-digphil
1210
- /data/depparse/sv_diachronic.test.in.conllu has a None at sentence 33 word 14
1211
- # sent_id = 33
1212
- # text = Nu har jag fört hjelten i denna historia (sådan han ärligt är beskrifwe
1213
- n) till den dag då han blifwit millionär.
1214
- 1 Nu nu ADV AB _ 4 advmod _ _
1215
- 2 har ha AUX VB|PRS|AKT Mood=Ind|Tense=Pres|VerbForm=Fin
1216
- |Voice=Act 4 aux _ _
1217
- 3 jag jag PRON PN|UTR|SIN|DEF|SUB Case=Nom|Definite=Def|Ge
1218
- nder=Com|Number=Sing|PronType=Prs 4 nsubj _ _
1219
- 4 fört föra VERB VB|SUP|AKT VerbForm=Sup|Voice=Act 0 r
1220
- oot _ _
1221
- 5 hjelten hjelt NOUN NN|UTR|SIN|DEF|NOM Case=Nom|Definite=Def|Ge
1222
- nder=Com|Number=Sing 4 obj _ _
1223
- 6 i i ADP PP _ 8 case _ _
1224
- 7 denna denna DET DT|UTR|SIN|DEF Definite=Def|Gender=Com|Number=S
1225
- ing|PronType=Dem 8 det _ _
1226
- 8 historia historia NOUN NN|UTR|SIN|IND|NOM Case=Nom
1227
- |Definite=Ind|Gender=Com|Number=Sing 4 obl _ _
1228
- 9 ( ( PUNCT PAD _ 4 punct _ _
1229
- 10 sådan sådan ADJ JJ|POS|UTR|SIN|IND|NOM Case=Nom|Definite=Ind|De
1230
- gree=Pos|Gender=Com|Number=Sing 14 mark _ _
1231
- 11 han han PRON PN|UTR|SIN|DEF|SUB Case=Nom|Definite=Def|Ge
1232
- nder=Com|Number=Sing|PronType=Prs 14 nsubj _ _
1233
- 12 ärligt ärlig ADV AB|POS Degree=Pos 14 advmod _ _
1234
- 13 är vara AUX VB|PRS|AKT Mood=Ind|Tense=Pres|VerbForm=Fin
1235
- |Voice=Act 14 cop _ _
1236
- 14 beskrifwen beskrifw NOUN NN|UTR|SIN|IND|NOM Case=Nom
1237
- |Definite=Ind|Gender=Com|Number=Sing 4 advcl _ _
1238
- 15 ) ) PUNCT PAD _ 4 _ _ _
1239
- 16 till till ADP PP _ 18 case _ _
1240
- 17 den den DET DT|UTR|SIN|DEF Definite=Def|Gender=Com|Number=S
1241
- ing|PronType=Art 18 det _ _
1242
- 18 dag dag NOUN NN|UTR|SIN|IND|NOM Case=Nom|Definite=Ind|Ge
1243
- nder=Com|Number=Sing 4 obl _ _
1244
- 19 då då ADV HA _ 22 mark _ _
1245
- 20 han han PRON PN|UTR|SIN|DEF|SUB Case=Nom|Definite=Def|Ge
1246
- nder=Com|Number=Sing|PronType=Prs 22 nsubj _ _
1247
- 21 blifwit bli VERB VB|SUP|AKT VerbForm=Sup|Voice=Act 22 c
1248
- op _ _
1249
- 22 millionär millionär ADJ JJ|POS|UTR|SIN|IND|NOM Case=Nom
1250
- |Definite=Ind|Degree=Pos|Gender=Com|Number=Sing 18 acl _ _
1251
- 23 . . PUNCT MAD _ 4 punct _ _
1252
- (SyllaMBERT) tmux capture-pane -pS - -E - > pane_output.txt
1253
-
1254
-
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
.ipynb_checkpoints/prepare-train-val-test-checkpoint.py CHANGED
@@ -8,8 +8,9 @@ from collections import defaultdict
8
  # ============================================================
9
  BASE = Path("/mimer/NOBACKUP/groups/dionysus/cleland/stanza-digphil").resolve()
10
 
11
- SVENSKA_PROJEKT = BASE / "svenska-projekt-ud"
12
- NORSKA_PROJEKT = BASE / "norska-projekt-ud"
 
13
 
14
  DIGPHIL_MACHINE = BASE / "alanev_raw_files/diachron"
15
  DIGPHIL_GOLD = BASE / "alanev_raw_files/diachron-validated"
@@ -172,6 +173,7 @@ def clean_sentences(sentence_blocks):
172
  train_sentences = []
173
  train_sentences.extend(load_from_treebank_dir(SVENSKA_PROJEKT))
174
  train_sentences.extend(load_from_treebank_dir(NORSKA_PROJEKT))
 
175
 
176
 
177
  # ============================================================
 
8
  # ============================================================
9
  BASE = Path("/mimer/NOBACKUP/groups/dionysus/cleland/stanza-digphil").resolve()
10
 
11
+ SVENSKA_PROJEKT = BASE / "ud-treebanks-sv"
12
+ NORSKA_PROJEKT = BASE / "ud-treebanks-bm"
13
+ DANSKA_PROJEKT = BASE / "ud-treebanks-dk"
14
 
15
  DIGPHIL_MACHINE = BASE / "alanev_raw_files/diachron"
16
  DIGPHIL_GOLD = BASE / "alanev_raw_files/diachron-validated"
 
173
  train_sentences = []
174
  train_sentences.extend(load_from_treebank_dir(SVENSKA_PROJEKT))
175
  train_sentences.extend(load_from_treebank_dir(NORSKA_PROJEKT))
176
+ train_sentences.extend(load_from_treebank_dir(DANSKA_PROJEKT))
177
 
178
 
179
  # ============================================================
README.md CHANGED
@@ -20,6 +20,8 @@ python -m stanza.utils.training.run_depparse UD_Swedish-diachronic --wordvec_pre
20
 
21
  ## Pretrained vectors
22
 
 
 
23
  Jag konverterade först kubhist2-vektorerna från gensim fasttext .ft till en vanlig textfil med gensims pythonpaket, sedan använde jag stanzas konverterare till .pt:
24
 
25
  ```
@@ -29,3 +31,10 @@ pt.load()
29
  ```
30
 
31
  Resultatet finns komprimerat i `diachronic.pt.xz`.
 
 
 
 
 
 
 
 
20
 
21
  ## Pretrained vectors
22
 
23
+ We use the incremental vectors up until 1880 from Henchen & Tahmasebi 2021.
24
+
25
  Jag konverterade först kubhist2-vektorerna från gensim fasttext .ft till en vanlig textfil med gensims pythonpaket, sedan använde jag stanzas konverterare till .pt:
26
 
27
  ```
 
31
  ```
32
 
33
  Resultatet finns komprimerat i `diachronic.pt.xz`.
34
+
35
+ ## References
36
+
37
+ **Hengchen, Simon & Tahmasebi, Nina. (2021).**
38
+ *A collection of Swedish diachronic word embedding models trained on historical newspaper data.*
39
+ **Journal of Open Humanities Data**, 7(2), 1–7.
40
+ https://doi.org/10.5334/johd.22
data/depparse/sv_diachronic.dev.in.conllu CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b8e30e3fe48bd68b68fc7ddda2fcb41806052ced2ed94914213bed9cfbd2d45a
3
- size 19045
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3e187a488028eda24d6acbc32d15d75eb91d816ac0ab1e33479499ebe00c0948
3
+ size 30651
data/depparse/sv_diachronic.test.in.conllu CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:37efeffde7e61daecc96d3f1c15a4f1401c733553c0aadc9fac8c61e24e939de
3
- size 284086
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:562acbbc94d10e1dbbfb64a0678aee4c4fe9fa2d25d83ceb577fb1661ada19f3
3
+ size 272480
data/depparse/sv_diachronic.train.in.conllu CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f9f307243bcc164566957f06d840a2a0795f2318ece2bfb31cca58ad97d2159f
3
- size 108684317
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:10cc5ba42ab7435c03b73588eb514007d54367d696078684de08a308fb5cf989
3
+ size 116586108
prepare-train-val-test.py CHANGED
@@ -8,8 +8,9 @@ from collections import defaultdict
8
  # ============================================================
9
  BASE = Path("/mimer/NOBACKUP/groups/dionysus/cleland/stanza-digphil").resolve()
10
 
11
- SVENSKA_PROJEKT = BASE / "svenska-projekt-ud"
12
- NORSKA_PROJEKT = BASE / "norska-projekt-ud"
 
13
 
14
  DIGPHIL_MACHINE = BASE / "alanev_raw_files/diachron"
15
  DIGPHIL_GOLD = BASE / "alanev_raw_files/diachron-validated"
@@ -172,6 +173,7 @@ def clean_sentences(sentence_blocks):
172
  train_sentences = []
173
  train_sentences.extend(load_from_treebank_dir(SVENSKA_PROJEKT))
174
  train_sentences.extend(load_from_treebank_dir(NORSKA_PROJEKT))
 
175
 
176
 
177
  # ============================================================
 
8
  # ============================================================
9
  BASE = Path("/mimer/NOBACKUP/groups/dionysus/cleland/stanza-digphil").resolve()
10
 
11
+ SVENSKA_PROJEKT = BASE / "ud-treebanks-sv"
12
+ NORSKA_PROJEKT = BASE / "ud-treebanks-bm"
13
+ DANSKA_PROJEKT = BASE / "ud-treebanks-dk"
14
 
15
  DIGPHIL_MACHINE = BASE / "alanev_raw_files/diachron"
16
  DIGPHIL_GOLD = BASE / "alanev_raw_files/diachron-validated"
 
173
  train_sentences = []
174
  train_sentences.extend(load_from_treebank_dir(SVENSKA_PROJEKT))
175
  train_sentences.extend(load_from_treebank_dir(NORSKA_PROJEKT))
176
+ train_sentences.extend(load_from_treebank_dir(DANSKA_PROJEKT))
177
 
178
 
179
  # ============================================================
{saved_models_old/depparse → saved_models/depparse/conll17_bm}/sv_diachronic_charlm_parser.pt RENAMED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6acffa25e114e5e7981c58e60d0c6486ad90f9a3d962e563d353d55ccc50f367
3
- size 144779560
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4b0848c6832ca155fcfbf78040530aeb27598562cda430821afddbead91ea0b1
3
+ size 148183635
{saved_models_old/depparse → saved_models/depparse/conll17_bm}/sv_diachronic_charlm_parser_checkpoint.pt RENAMED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f488cfe7a6641469714df5bb50e80fa89f95c1b725ddb13ba9696d89d6483119
3
- size 433299171
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:567600f6876c4ceee884f04d0333538eb3cb24f6ec667112ff7e8088eab8cbd2
3
+ size 443081852
saved_models/depparse/sv_diachronic_charlm_parser.pt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4b0848c6832ca155fcfbf78040530aeb27598562cda430821afddbead91ea0b1
3
- size 148183635
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:87b032707188b5d53ed43f02af00b3d22f321b55e426400b0e8a35e2cf021bb9
3
+ size 150761755
saved_models/depparse/sv_diachronic_charlm_parser_checkpoint.pt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:567600f6876c4ceee884f04d0333538eb3cb24f6ec667112ff7e8088eab8cbd2
3
- size 443081852
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ec3577e850a19917e89cd37d0ca98f929c7b5093c05901938fe424cc53966cab
3
+ size 450717113
stanza/__pycache__/__init__.cpython-312.pyc CHANGED
Binary files a/stanza/__pycache__/__init__.cpython-312.pyc and b/stanza/__pycache__/__init__.cpython-312.pyc differ
 
stanza/__pycache__/_version.cpython-312.pyc CHANGED
Binary files a/stanza/__pycache__/_version.cpython-312.pyc and b/stanza/__pycache__/_version.cpython-312.pyc differ
 
stanza/models/__pycache__/__init__.cpython-312.pyc CHANGED
Binary files a/stanza/models/__pycache__/__init__.cpython-312.pyc and b/stanza/models/__pycache__/__init__.cpython-312.pyc differ
 
stanza/models/__pycache__/_training_logging.cpython-312.pyc CHANGED
Binary files a/stanza/models/__pycache__/_training_logging.cpython-312.pyc and b/stanza/models/__pycache__/_training_logging.cpython-312.pyc differ
 
stanza/models/__pycache__/parser.cpython-312.pyc CHANGED
Binary files a/stanza/models/__pycache__/parser.cpython-312.pyc and b/stanza/models/__pycache__/parser.cpython-312.pyc differ
 
stanza/models/__pycache__/tagger.cpython-312.pyc CHANGED
Binary files a/stanza/models/__pycache__/tagger.cpython-312.pyc and b/stanza/models/__pycache__/tagger.cpython-312.pyc differ
 
stanza/models/classifiers/__pycache__/__init__.cpython-312.pyc CHANGED
Binary files a/stanza/models/classifiers/__pycache__/__init__.cpython-312.pyc and b/stanza/models/classifiers/__pycache__/__init__.cpython-312.pyc differ
 
stanza/models/classifiers/__pycache__/base_classifier.cpython-312.pyc CHANGED
Binary files a/stanza/models/classifiers/__pycache__/base_classifier.cpython-312.pyc and b/stanza/models/classifiers/__pycache__/base_classifier.cpython-312.pyc differ
 
stanza/models/classifiers/__pycache__/cnn_classifier.cpython-312.pyc CHANGED
Binary files a/stanza/models/classifiers/__pycache__/cnn_classifier.cpython-312.pyc and b/stanza/models/classifiers/__pycache__/cnn_classifier.cpython-312.pyc differ
 
stanza/models/classifiers/__pycache__/config.cpython-312.pyc CHANGED
Binary files a/stanza/models/classifiers/__pycache__/config.cpython-312.pyc and b/stanza/models/classifiers/__pycache__/config.cpython-312.pyc differ
 
stanza/models/classifiers/__pycache__/constituency_classifier.cpython-312.pyc CHANGED
Binary files a/stanza/models/classifiers/__pycache__/constituency_classifier.cpython-312.pyc and b/stanza/models/classifiers/__pycache__/constituency_classifier.cpython-312.pyc differ
 
stanza/models/classifiers/__pycache__/data.cpython-312.pyc CHANGED
Binary files a/stanza/models/classifiers/__pycache__/data.cpython-312.pyc and b/stanza/models/classifiers/__pycache__/data.cpython-312.pyc differ
 
stanza/models/classifiers/__pycache__/trainer.cpython-312.pyc CHANGED
Binary files a/stanza/models/classifiers/__pycache__/trainer.cpython-312.pyc and b/stanza/models/classifiers/__pycache__/trainer.cpython-312.pyc differ
 
stanza/models/classifiers/__pycache__/utils.cpython-312.pyc CHANGED
Binary files a/stanza/models/classifiers/__pycache__/utils.cpython-312.pyc and b/stanza/models/classifiers/__pycache__/utils.cpython-312.pyc differ
 
stanza/models/common/__pycache__/__init__.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/__init__.cpython-312.pyc and b/stanza/models/common/__pycache__/__init__.cpython-312.pyc differ
 
stanza/models/common/__pycache__/beam.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/beam.cpython-312.pyc and b/stanza/models/common/__pycache__/beam.cpython-312.pyc differ
 
stanza/models/common/__pycache__/bert_embedding.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/bert_embedding.cpython-312.pyc and b/stanza/models/common/__pycache__/bert_embedding.cpython-312.pyc differ
 
stanza/models/common/__pycache__/biaffine.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/biaffine.cpython-312.pyc and b/stanza/models/common/__pycache__/biaffine.cpython-312.pyc differ
 
stanza/models/common/__pycache__/char_model.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/char_model.cpython-312.pyc and b/stanza/models/common/__pycache__/char_model.cpython-312.pyc differ
 
stanza/models/common/__pycache__/chuliu_edmonds.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/chuliu_edmonds.cpython-312.pyc and b/stanza/models/common/__pycache__/chuliu_edmonds.cpython-312.pyc differ
 
stanza/models/common/__pycache__/constant.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/constant.cpython-312.pyc and b/stanza/models/common/__pycache__/constant.cpython-312.pyc differ
 
stanza/models/common/__pycache__/crf.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/crf.cpython-312.pyc and b/stanza/models/common/__pycache__/crf.cpython-312.pyc differ
 
stanza/models/common/__pycache__/data.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/data.cpython-312.pyc and b/stanza/models/common/__pycache__/data.cpython-312.pyc differ
 
stanza/models/common/__pycache__/doc.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/doc.cpython-312.pyc and b/stanza/models/common/__pycache__/doc.cpython-312.pyc differ
 
stanza/models/common/__pycache__/dropout.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/dropout.cpython-312.pyc and b/stanza/models/common/__pycache__/dropout.cpython-312.pyc differ
 
stanza/models/common/__pycache__/exceptions.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/exceptions.cpython-312.pyc and b/stanza/models/common/__pycache__/exceptions.cpython-312.pyc differ
 
stanza/models/common/__pycache__/foundation_cache.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/foundation_cache.cpython-312.pyc and b/stanza/models/common/__pycache__/foundation_cache.cpython-312.pyc differ
 
stanza/models/common/__pycache__/hlstm.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/hlstm.cpython-312.pyc and b/stanza/models/common/__pycache__/hlstm.cpython-312.pyc differ
 
stanza/models/common/__pycache__/loss.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/loss.cpython-312.pyc and b/stanza/models/common/__pycache__/loss.cpython-312.pyc differ
 
stanza/models/common/__pycache__/maxout_linear.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/maxout_linear.cpython-312.pyc and b/stanza/models/common/__pycache__/maxout_linear.cpython-312.pyc differ
 
stanza/models/common/__pycache__/packed_lstm.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/packed_lstm.cpython-312.pyc and b/stanza/models/common/__pycache__/packed_lstm.cpython-312.pyc differ
 
stanza/models/common/__pycache__/peft_config.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/peft_config.cpython-312.pyc and b/stanza/models/common/__pycache__/peft_config.cpython-312.pyc differ
 
stanza/models/common/__pycache__/pretrain.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/pretrain.cpython-312.pyc and b/stanza/models/common/__pycache__/pretrain.cpython-312.pyc differ
 
stanza/models/common/__pycache__/relative_attn.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/relative_attn.cpython-312.pyc and b/stanza/models/common/__pycache__/relative_attn.cpython-312.pyc differ
 
stanza/models/common/__pycache__/seq2seq_constant.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/seq2seq_constant.cpython-312.pyc and b/stanza/models/common/__pycache__/seq2seq_constant.cpython-312.pyc differ
 
stanza/models/common/__pycache__/seq2seq_model.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/seq2seq_model.cpython-312.pyc and b/stanza/models/common/__pycache__/seq2seq_model.cpython-312.pyc differ
 
stanza/models/common/__pycache__/seq2seq_modules.cpython-312.pyc CHANGED
Binary files a/stanza/models/common/__pycache__/seq2seq_modules.cpython-312.pyc and b/stanza/models/common/__pycache__/seq2seq_modules.cpython-312.pyc differ