Upload folder using huggingface_hub

Browse files

Files changed (2) hide show

models/mpqa/sdp_byGenAI_Llama3.1_8b_Inst_desc/80/model_vi_mpqa_roberta_6.train.log +69 -0
models/mpqa/sdp_byGenAI_Llama3.1_8b_Inst_desc/80/model_vi_mpqa_xlm_6.train.log +69 -0

models/mpqa/sdp_byGenAI_Llama3.1_8b_Inst_desc/80/model_vi_mpqa_roberta_6.train.log ADDED Viewed

	@@ -0,0 +1,69 @@

+2025-10-28 03:25:38 INFO
+---------------------+-------------------------------
+Param                |             Value
+---------------------+-------------------------------
+lr                   |             5e-05
+mu                   |              0.9
+nu                   |             0.999
+eps                  |             1e-06
+weight_decay         |              0.0
+lr_rate              |               1
+patience             |               30
+update-steps         |               1
+warmup               |              0.0
+update_steps         |               1
+mode                 |             train
+path                 | /home/marvin/structured-sentiment-analysis-bis/models/mpqa/sdp_byGenAI_Llama3.1_8b_Inst_desc/80/model_vi_mpqa_roberta_6
+device               |               1
+seed                 |               1
+threads              |               16
+local_rank           |               -1
+feat                 |              None
+build                |              True
+checkpoint           |             False
+encoder              |              bert
+max_len              |              None
+buckets              |               32
+train                | /home/marvin/structured-sentiment-analysis-bis/sentiment_graphs/mpqa/head_final/sdp_byGenAI_Llama3.1_8b_Inst_desc/80/train.conllu
+dev                  | /home/marvin/structured-sentiment-analysis-bis/sentiment_graphs/mpqa/head_final/sdp_byGenAI_Llama3.1_8b_Inst_desc/80/dev.conllu
+test                 | /home/marvin/structured-sentiment-analysis-bis/sentiment_graphs/mpqa/head_final/sdp/test.conllu
+embed                |     data/glove.6B.100d.txt
+unk                  |              unk
+n_embed              |              100
+n_embed_proj         |              125
+bert                 |          roberta-base
+inference            |              mfvi
+---------------------+-------------------------------
+2025-10-28 03:25:38 INFO Building the fields
+2025-10-28 03:25:39 INFO CoNLL(
+ (words): SubwordField(pad=<pad>, unk=<unk>, bos=<s>)
+ (labels): ChartField()
+)
+2025-10-28 03:25:39 INFO Building the model
+2025-10-28 03:25:40 INFO VISemanticDependencyModel(
+  (encoder): TransformerEmbedding(roberta-base, n_layers=4, n_out=768, stride=256, pooling=mean, pad_index=1, requires_grad=True)
+  (encoder_dropout): Dropout(p=0.33, inplace=False)
+  (edge_mlp_d): MLP(n_in=768, n_out=600, dropout=0.25)
+  (edge_mlp_h): MLP(n_in=768, n_out=600, dropout=0.25)
+  (label_mlp_d): MLP(n_in=768, n_out=600, dropout=0.33)
+  (label_mlp_h): MLP(n_in=768, n_out=600, dropout=0.33)
+  (edge_attn): Biaffine(n_in=600, bias_x=True, bias_y=True)
+  (label_attn): Biaffine(n_in=600, bias_x=True, bias_y=True)
+  (criterion): CrossEntropyLoss()
+  (pair_mlp_d): MLP(n_in=768, n_out=150, dropout=0.25)
+  (pair_mlp_h): MLP(n_in=768, n_out=150, dropout=0.25)
+  (pair_mlp_g): MLP(n_in=768, n_out=150, dropout=0.25)
+  (sib_attn): Triaffine(n_in=150, bias_x=True, bias_y=True)
+  (cop_attn): Triaffine(n_in=150, bias_x=True, bias_y=True)
+  (grd_attn): Triaffine(n_in=150, bias_x=True, bias_y=True)
+  (inference): SemanticDependencyMFVI(max_iter=3)
+)
+2025-10-28 03:25:40 INFO Loading the data
+2025-10-28 03:25:42 INFO
+train: Dataset(n_sentences=1, n_batches=1, n_buckets=1)
+dev:   Dataset(n_sentences=1, n_batches=1, n_buckets=1)
+test:  Dataset(n_sentences=2112, n_batches=32, n_buckets=32)
+2025-10-28 03:25:43 INFO Epoch 1 / 5000:

models/mpqa/sdp_byGenAI_Llama3.1_8b_Inst_desc/80/model_vi_mpqa_xlm_6.train.log ADDED Viewed

	@@ -0,0 +1,69 @@

+2025-10-28 03:25:46 INFO
+---------------------+-------------------------------
+Param                |             Value
+---------------------+-------------------------------
+lr                   |             5e-05
+mu                   |              0.9
+nu                   |             0.999
+eps                  |             1e-06
+weight_decay         |              0.0
+lr_rate              |               1
+patience             |               30
+update-steps         |               1
+warmup               |              0.0
+update_steps         |               1
+mode                 |             train
+path                 | /home/marvin/structured-sentiment-analysis-bis/models/mpqa/sdp_byGenAI_Llama3.1_8b_Inst_desc/80/model_vi_mpqa_xlm_6
+device               |               1
+seed                 |               1
+threads              |               16
+local_rank           |               -1
+feat                 |              None
+build                |              True
+checkpoint           |             False
+encoder              |              bert
+max_len              |              None
+buckets              |               32
+train                | /home/marvin/structured-sentiment-analysis-bis/sentiment_graphs/mpqa/head_final/sdp_byGenAI_Llama3.1_8b_Inst_desc/80/train.conllu
+dev                  | /home/marvin/structured-sentiment-analysis-bis/sentiment_graphs/mpqa/head_final/sdp_byGenAI_Llama3.1_8b_Inst_desc/80/dev.conllu
+test                 | /home/marvin/structured-sentiment-analysis-bis/sentiment_graphs/mpqa/head_final/sdp/test.conllu
+embed                |     data/glove.6B.100d.txt
+unk                  |              unk
+n_embed              |              100
+n_embed_proj         |              125
+bert                 |        xlm-roberta-base
+inference            |              mfvi
+---------------------+-------------------------------
+2025-10-28 03:25:46 INFO Building the fields
+2025-10-28 03:25:47 INFO CoNLL(
+ (words): SubwordField(pad=<pad>, unk=<unk>, bos=<s>)
+ (labels): ChartField()
+)
+2025-10-28 03:25:47 INFO Building the model
+2025-10-28 03:25:49 INFO VISemanticDependencyModel(
+  (encoder): TransformerEmbedding(xlm-roberta-base, n_layers=4, n_out=768, stride=256, pooling=mean, pad_index=1, requires_grad=True)
+  (encoder_dropout): Dropout(p=0.33, inplace=False)
+  (edge_mlp_d): MLP(n_in=768, n_out=600, dropout=0.25)
+  (edge_mlp_h): MLP(n_in=768, n_out=600, dropout=0.25)
+  (label_mlp_d): MLP(n_in=768, n_out=600, dropout=0.33)
+  (label_mlp_h): MLP(n_in=768, n_out=600, dropout=0.33)
+  (edge_attn): Biaffine(n_in=600, bias_x=True, bias_y=True)
+  (label_attn): Biaffine(n_in=600, bias_x=True, bias_y=True)
+  (criterion): CrossEntropyLoss()
+  (pair_mlp_d): MLP(n_in=768, n_out=150, dropout=0.25)
+  (pair_mlp_h): MLP(n_in=768, n_out=150, dropout=0.25)
+  (pair_mlp_g): MLP(n_in=768, n_out=150, dropout=0.25)
+  (sib_attn): Triaffine(n_in=150, bias_x=True, bias_y=True)
+  (cop_attn): Triaffine(n_in=150, bias_x=True, bias_y=True)
+  (grd_attn): Triaffine(n_in=150, bias_x=True, bias_y=True)
+  (inference): SemanticDependencyMFVI(max_iter=3)
+)
+2025-10-28 03:25:49 INFO Loading the data
+2025-10-28 03:25:52 INFO
+train: Dataset(n_sentences=1, n_batches=1, n_buckets=1)
+dev:   Dataset(n_sentences=1, n_batches=1, n_buckets=1)
+test:  Dataset(n_sentences=2112, n_batches=32, n_buckets=32)
+2025-10-28 03:25:52 INFO Epoch 1 / 5000: