lgsilvaesilva commited on May 27

Commit

4c67f07

verified ·

1 Parent(s): 61c6633

Upload folder using huggingface_hub

Browse files

Files changed (36) hide show

README.md +70 -51
REPORT.md +39 -39
baselines/embedding-lightgbm/embedding-lightgbm.joblib +2 -2
baselines/embedding-lightgbm/test_predictions.csv +0 -0
baselines/embedding-lightgbm/validation_predictions.csv +0 -0
baselines/embedding-logistic/embedding-logistic.joblib +2 -2
baselines/embedding-logistic/test_predictions.csv +0 -0
baselines/embedding-logistic/validation_predictions.csv +0 -0
baselines/embedding-svm/embedding-svm.joblib +2 -2
baselines/embedding-svm/test_predictions.csv +0 -0
baselines/embedding-svm/validation_predictions.csv +0 -0
report.json +385 -385
transformer/checkpoint-1220/model.safetensors +1 -1
transformer/checkpoint-1220/optimizer.pt +1 -1
transformer/checkpoint-1220/scaler.pt +1 -1
transformer/checkpoint-1220/trainer_state.json +136 -136
transformer/checkpoint-1525/model.safetensors +1 -1
transformer/checkpoint-1525/optimizer.pt +1 -1
transformer/checkpoint-1525/scaler.pt +1 -1
transformer/checkpoint-1525/trainer_state.json +171 -171
transformer/checkpoint-305/model.safetensors +1 -1
transformer/checkpoint-305/optimizer.pt +1 -1
transformer/checkpoint-305/scaler.pt +1 -1
transformer/checkpoint-305/trainer_state.json +34 -34
transformer/checkpoint-610/model.safetensors +1 -1
transformer/checkpoint-610/optimizer.pt +1 -1
transformer/checkpoint-610/scaler.pt +1 -1
transformer/checkpoint-610/trainer_state.json +67 -67
transformer/checkpoint-915/model.safetensors +1 -1
transformer/checkpoint-915/optimizer.pt +1 -1
transformer/checkpoint-915/scaler.pt +1 -1
transformer/checkpoint-915/trainer_state.json +100 -100
transformer/config.json +5 -5
transformer/model.safetensors +1 -1
transformer/test_predictions.csv +0 -0
transformer/validation_predictions.csv +0 -0

README.md CHANGED Viewed

@@ -21,7 +21,7 @@ It includes the Transformer model, any configured TF-IDF or sentence-embedding b
 - Text column: `chunk_text`
 - Label column: `label`
 - Transformer: `FacebookAI/xlm-roberta-base`
-- Generated at: `2026-05-26T17:46:00.691870+00:00`
 ## Dataset Summary
@@ -41,14 +41,14 @@ Validation metrics document threshold selection and tuning behavior; test metric
 | logistic_tfidf | 0.608 | 0.942 | 0.696 | 0.494 | 0.578 | 0.872 | 0.594 |
 | xgboost_tfidf | 0.500 | 0.945 | 0.931 | 0.342 | 0.500 | 0.823 | 0.588 |
 | xgboost_tfidf | 0.177 | 0.934 | 0.592 | 0.570 | 0.581 | 0.823 | 0.588 |
-| embedding-logistic_sentence_embeddings | 0.500 | 0.916 | 0.490 | 0.911 | 0.637 | 0.956 | 0.749 |
-| embedding-logistic_sentence_embeddings | 0.616 | 0.946 | 0.612 | 0.899 | 0.728 | 0.956 | 0.749 |
-| embedding-svm_sentence_embeddings | 0.500 | 0.957 | 0.803 | 0.620 | 0.700 | 0.958 | 0.743 |
-| embedding-svm_sentence_embeddings | 0.276 | 0.952 | 0.667 | 0.810 | 0.731 | 0.958 | 0.743 |
-| embedding-lightgbm_sentence_embeddings | 0.500 | 0.948 | 0.700 | 0.620 | 0.658 | 0.952 | 0.778 |
-| embedding-lightgbm_sentence_embeddings | 0.052 | 0.953 | 0.670 | 0.823 | 0.739 | 0.952 | 0.778 |
-| transformer | 0.500 | 0.973 | 0.812 | 0.873 | 0.841 | 0.971 | 0.836 |
-| transformer | 0.500 | 0.974 | 0.814 | 0.886 | 0.848 | 0.971 | 0.836 |
 ## Threshold Comparison on Test Split
@@ -58,14 +58,14 @@ Validation metrics document threshold selection and tuning behavior; test metric
 | logistic_tfidf | 0.608 | 0.930 | 0.902 | 0.411 | 0.564 | 0.899 | 0.726 |
 | xgboost_tfidf | 0.500 | 0.924 | 1.000 | 0.312 | 0.476 | 0.892 | 0.692 |
 | xgboost_tfidf | 0.177 | 0.918 | 0.663 | 0.527 | 0.587 | 0.892 | 0.692 |
-| embedding-logistic_sentence_embeddings | 0.500 | 0.899 | 0.524 | 0.866 | 0.653 | 0.952 | 0.759 |
-| embedding-logistic_sentence_embeddings | 0.616 | 0.929 | 0.632 | 0.857 | 0.727 | 0.952 | 0.759 |
-| embedding-svm_sentence_embeddings | 0.500 | 0.941 | 0.771 | 0.661 | 0.712 | 0.952 | 0.743 |
-| embedding-svm_sentence_embeddings | 0.276 | 0.935 | 0.667 | 0.821 | 0.736 | 0.952 | 0.743 |
-| embedding-lightgbm_sentence_embeddings | 0.500 | 0.946 | 0.788 | 0.696 | 0.739 | 0.959 | 0.801 |
-| embedding-lightgbm_sentence_embeddings | 0.052 | 0.933 | 0.657 | 0.821 | 0.730 | 0.959 | 0.801 |
-| transformer | 0.500 | 0.945 | 0.750 | 0.750 | 0.750 | 0.954 | 0.773 |
-| transformer | 0.500 | 0.945 | 0.750 | 0.750 | 0.750 | 0.954 | 0.773 |
 ## Confusion Matrices on Test Split
@@ -103,67 +103,67 @@ Rows are true labels and columns are predicted labels.
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
-| NOT_RELEVANT | 816 | 88 |
-| RELEVANT | 15 | 97 |
-### embedding-logistic_sentence_embeddings at threshold 0.616
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
-| NOT_RELEVANT | 848 | 56 |
-| RELEVANT | 16 | 96 |
 ### embedding-svm_sentence_embeddings at threshold 0.500
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
 | NOT_RELEVANT | 882 | 22 |
-| RELEVANT | 38 | 74 |
-### embedding-svm_sentence_embeddings at threshold 0.276
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
-| NOT_RELEVANT | 858 | 46 |
-| RELEVANT | 20 | 92 |
 ### embedding-lightgbm_sentence_embeddings at threshold 0.500
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
-| NOT_RELEVANT | 883 | 21 |
-| RELEVANT | 34 | 78 |
-### embedding-lightgbm_sentence_embeddings at threshold 0.052
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
-| NOT_RELEVANT | 856 | 48 |
 | RELEVANT | 20 | 92 |
 ### transformer at threshold 0.500
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
-| NOT_RELEVANT | 876 | 28 |
-| RELEVANT | 28 | 84 |
-### transformer at threshold 0.500
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
-| NOT_RELEVANT | 876 | 28 |
-| RELEVANT | 28 | 84 |
 ## Validation-Tuned Thresholds
 - `logistic_tfidf`: threshold `0.608` (validation F1 `0.578`); test F1 change vs 0.5: `-0.077`.
 - `xgboost_tfidf`: threshold `0.177` (validation F1 `0.581`); test F1 change vs 0.5: `+0.111`.
-- `embedding-logistic_sentence_embeddings`: threshold `0.616` (validation F1 `0.728`); test F1 change vs 0.5: `+0.074`.
-- `embedding-svm_sentence_embeddings`: threshold `0.276` (validation F1 `0.731`); test F1 change vs 0.5: `+0.024`.
-- `embedding-lightgbm_sentence_embeddings`: threshold `0.052` (validation F1 `0.739`); test F1 change vs 0.5: `-0.009`.
-- `transformer`: threshold `0.500` (validation F1 `0.848`); test F1 change vs 0.5: `+0.000`.
 ## Artifacts
@@ -179,7 +179,7 @@ Rows are true labels and columns are predicted labels.
 Install the runtime dependencies:
 ```bash
-pip install transformers torch huggingface_hub pandas joblib scikit-learn xgboost sentence-transformers lightgbm
 ```
 ### Transformer
@@ -188,7 +188,7 @@ pip install transformers torch huggingface_hub pandas joblib scikit-learn xgboos
 import torch
 from transformers import AutoModelForSequenceClassification, AutoTokenizer
-MODEL_ID = "YOUR_USERNAME/YOUR_MODEL_REPO"
 texts = [
     "Rice export prices increased after new procurement rules were announced.",
@@ -225,7 +225,7 @@ import json
 import joblib
 from huggingface_hub import hf_hub_download
-MODEL_ID = "YOUR_USERNAME/YOUR_MODEL_REPO"
 BASELINE = "logistic"
 texts = [
@@ -266,10 +266,11 @@ Available embedding baseline names in this run: "embedding-logistic", "embedding
 ```python
 import joblib
 from huggingface_hub import hf_hub_download
-from sentence_transformers import SentenceTransformer
-MODEL_ID = "YOUR_USERNAME/YOUR_MODEL_REPO"
 BASELINE = "embedding-logistic"
 texts = [
@@ -283,13 +284,31 @@ model_path = hf_hub_download(
     filename=f"baselines/{BASELINE}/{BASELINE}.joblib",
 )
 artifact = joblib.load(model_path)
-embedding_model = SentenceTransformer(artifact["embedding_model_name"])
-embeddings = embedding_model.encode(
-    texts,
-    batch_size=artifact.get("embedding_batch_size", 64),
-    convert_to_numpy=True,
-    normalize_embeddings=artifact.get("normalize_embeddings", True),
-)
 probabilities = artifact["classifier"].predict_proba(embeddings)[:, 1]
 threshold = artifact["validation_best_threshold"]["threshold"]

 - Text column: `chunk_text`
 - Label column: `label`
 - Transformer: `FacebookAI/xlm-roberta-base`
+- Generated at: `2026-05-27T10:50:45.867038+00:00`
 ## Dataset Summary
 | logistic_tfidf | 0.608 | 0.942 | 0.696 | 0.494 | 0.578 | 0.872 | 0.594 |
 | xgboost_tfidf | 0.500 | 0.945 | 0.931 | 0.342 | 0.500 | 0.823 | 0.588 |
 | xgboost_tfidf | 0.177 | 0.934 | 0.592 | 0.570 | 0.581 | 0.823 | 0.588 |
+| embedding-logistic_sentence_embeddings | 0.500 | 0.912 | 0.476 | 0.861 | 0.613 | 0.953 | 0.762 |
+| embedding-logistic_sentence_embeddings | 0.722 | 0.957 | 0.703 | 0.810 | 0.753 | 0.953 | 0.762 |
+| embedding-svm_sentence_embeddings | 0.500 | 0.955 | 0.807 | 0.582 | 0.676 | 0.952 | 0.754 |
+| embedding-svm_sentence_embeddings | 0.310 | 0.957 | 0.713 | 0.785 | 0.747 | 0.952 | 0.754 |
+| embedding-lightgbm_sentence_embeddings | 0.500 | 0.954 | 0.750 | 0.646 | 0.694 | 0.948 | 0.782 |
+| embedding-lightgbm_sentence_embeddings | 0.042 | 0.952 | 0.670 | 0.797 | 0.728 | 0.948 | 0.782 |
+| transformer | 0.500 | 0.970 | 0.798 | 0.848 | 0.822 | 0.966 | 0.854 |
+| transformer | 0.471 | 0.971 | 0.800 | 0.861 | 0.829 | 0.966 | 0.854 |
 ## Threshold Comparison on Test Split
 | logistic_tfidf | 0.608 | 0.930 | 0.902 | 0.411 | 0.564 | 0.899 | 0.726 |
 | xgboost_tfidf | 0.500 | 0.924 | 1.000 | 0.312 | 0.476 | 0.892 | 0.692 |
 | xgboost_tfidf | 0.177 | 0.918 | 0.663 | 0.527 | 0.587 | 0.892 | 0.692 |
+| embedding-logistic_sentence_embeddings | 0.500 | 0.891 | 0.503 | 0.884 | 0.641 | 0.955 | 0.710 |
+| embedding-logistic_sentence_embeddings | 0.722 | 0.935 | 0.689 | 0.750 | 0.718 | 0.955 | 0.710 |
+| embedding-svm_sentence_embeddings | 0.500 | 0.930 | 0.741 | 0.562 | 0.640 | 0.956 | 0.704 |
+| embedding-svm_sentence_embeddings | 0.310 | 0.934 | 0.686 | 0.741 | 0.712 | 0.956 | 0.704 |
+| embedding-lightgbm_sentence_embeddings | 0.500 | 0.937 | 0.740 | 0.661 | 0.698 | 0.960 | 0.791 |
+| embedding-lightgbm_sentence_embeddings | 0.042 | 0.929 | 0.639 | 0.821 | 0.719 | 0.960 | 0.791 |
+| transformer | 0.500 | 0.951 | 0.777 | 0.777 | 0.777 | 0.968 | 0.817 |
+| transformer | 0.471 | 0.950 | 0.770 | 0.777 | 0.773 | 0.968 | 0.817 |
 ## Confusion Matrices on Test Split
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
+| NOT_RELEVANT | 806 | 98 |
+| RELEVANT | 13 | 99 |
+### embedding-logistic_sentence_embeddings at threshold 0.722
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
+| NOT_RELEVANT | 866 | 38 |
+| RELEVANT | 28 | 84 |
 ### embedding-svm_sentence_embeddings at threshold 0.500
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
 | NOT_RELEVANT | 882 | 22 |
+| RELEVANT | 49 | 63 |
+### embedding-svm_sentence_embeddings at threshold 0.310
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
+| NOT_RELEVANT | 866 | 38 |
+| RELEVANT | 29 | 83 |
 ### embedding-lightgbm_sentence_embeddings at threshold 0.500
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
+| NOT_RELEVANT | 878 | 26 |
+| RELEVANT | 38 | 74 |
+### embedding-lightgbm_sentence_embeddings at threshold 0.042
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
+| NOT_RELEVANT | 852 | 52 |
 | RELEVANT | 20 | 92 |
 ### transformer at threshold 0.500
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
+| NOT_RELEVANT | 879 | 25 |
+| RELEVANT | 25 | 87 |
+### transformer at threshold 0.471
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
+| NOT_RELEVANT | 878 | 26 |
+| RELEVANT | 25 | 87 |
 ## Validation-Tuned Thresholds
 - `logistic_tfidf`: threshold `0.608` (validation F1 `0.578`); test F1 change vs 0.5: `-0.077`.
 - `xgboost_tfidf`: threshold `0.177` (validation F1 `0.581`); test F1 change vs 0.5: `+0.111`.
+- `embedding-logistic_sentence_embeddings`: threshold `0.722` (validation F1 `0.753`); test F1 change vs 0.5: `+0.077`.
+- `embedding-svm_sentence_embeddings`: threshold `0.310` (validation F1 `0.747`); test F1 change vs 0.5: `+0.073`.
+- `embedding-lightgbm_sentence_embeddings`: threshold `0.042` (validation F1 `0.728`); test F1 change vs 0.5: `+0.021`.
+- `transformer`: threshold `0.471` (validation F1 `0.829`); test F1 change vs 0.5: `-0.003`.
 ## Artifacts
 Install the runtime dependencies:
 ```bash
+pip install transformers torch huggingface_hub pandas joblib scikit-learn xgboost lightgbm
 ```
 ### Transformer
 import torch
 from transformers import AutoModelForSequenceClassification, AutoTokenizer
+MODEL_ID = "faodl/agri-utilization-classifier"
 texts = [
     "Rice export prices increased after new procurement rules were announced.",
 import joblib
 from huggingface_hub import hf_hub_download
+MODEL_ID = "faodl/agri-utilization-classifier"
 BASELINE = "logistic"
 texts = [
 ```python
 import joblib
+import torch
 from huggingface_hub import hf_hub_download
+from transformers import AutoModel, AutoTokenizer
+MODEL_ID = "faodl/agri-utilization-classifier"
 BASELINE = "embedding-logistic"
 texts = [
     filename=f"baselines/{BASELINE}/{BASELINE}.joblib",
 )
 artifact = joblib.load(model_path)
+tokenizer = AutoTokenizer.from_pretrained(artifact["embedding_model_name"])
+encoder = AutoModel.from_pretrained(artifact["embedding_model_name"])
+encoder.eval()
+encoded_batches = []
+batch_size = artifact.get("embedding_batch_size", 64)
+for start in range(0, len(texts), batch_size):
+    batch_texts = texts[start : start + batch_size]
+    inputs = tokenizer(
+        batch_texts,
+        padding=True,
+        truncation=True,
+        max_length=artifact.get("embedding_max_length", 256),
+        return_tensors="pt",
+    )
+    with torch.no_grad():
+        outputs = encoder(**inputs)
+    token_embeddings = outputs.last_hidden_state
+    attention_mask = inputs["attention_mask"].unsqueeze(-1).to(token_embeddings.dtype)
+    embeddings = (token_embeddings * attention_mask).sum(dim=1)
+    embeddings = embeddings / attention_mask.sum(dim=1).clamp(min=1e-9)
+    if artifact.get("normalize_embeddings", True):
+        embeddings = torch.nn.functional.normalize(embeddings, p=2, dim=1)
+    encoded_batches.append(embeddings)
+embeddings = torch.cat(encoded_batches).numpy()
 probabilities = artifact["classifier"].predict_proba(embeddings)[:, 1]
 threshold = artifact["validation_best_threshold"]["threshold"]

REPORT.md CHANGED Viewed

@@ -6,7 +6,7 @@
 - Text column: `chunk_text`
 - Label column: `label`
 - Transformer: `FacebookAI/xlm-roberta-base`
-- Generated at: `2026-05-26T17:46:00.691870+00:00`
 ## Dataset Summary
@@ -26,14 +26,14 @@ Validation metrics document threshold selection and tuning behavior; test metric
 | logistic_tfidf | 0.608 | 0.942 | 0.696 | 0.494 | 0.578 | 0.872 | 0.594 |
 | xgboost_tfidf | 0.500 | 0.945 | 0.931 | 0.342 | 0.500 | 0.823 | 0.588 |
 | xgboost_tfidf | 0.177 | 0.934 | 0.592 | 0.570 | 0.581 | 0.823 | 0.588 |
-| embedding-logistic_sentence_embeddings | 0.500 | 0.916 | 0.490 | 0.911 | 0.637 | 0.956 | 0.749 |
-| embedding-logistic_sentence_embeddings | 0.616 | 0.946 | 0.612 | 0.899 | 0.728 | 0.956 | 0.749 |
-| embedding-svm_sentence_embeddings | 0.500 | 0.957 | 0.803 | 0.620 | 0.700 | 0.958 | 0.743 |
-| embedding-svm_sentence_embeddings | 0.276 | 0.952 | 0.667 | 0.810 | 0.731 | 0.958 | 0.743 |
-| embedding-lightgbm_sentence_embeddings | 0.500 | 0.948 | 0.700 | 0.620 | 0.658 | 0.952 | 0.778 |
-| embedding-lightgbm_sentence_embeddings | 0.052 | 0.953 | 0.670 | 0.823 | 0.739 | 0.952 | 0.778 |
-| transformer | 0.500 | 0.973 | 0.812 | 0.873 | 0.841 | 0.971 | 0.836 |
-| transformer | 0.500 | 0.974 | 0.814 | 0.886 | 0.848 | 0.971 | 0.836 |
 ## Threshold Comparison on Test Split
@@ -43,14 +43,14 @@ Validation metrics document threshold selection and tuning behavior; test metric
 | logistic_tfidf | 0.608 | 0.930 | 0.902 | 0.411 | 0.564 | 0.899 | 0.726 |
 | xgboost_tfidf | 0.500 | 0.924 | 1.000 | 0.312 | 0.476 | 0.892 | 0.692 |
 | xgboost_tfidf | 0.177 | 0.918 | 0.663 | 0.527 | 0.587 | 0.892 | 0.692 |
-| embedding-logistic_sentence_embeddings | 0.500 | 0.899 | 0.524 | 0.866 | 0.653 | 0.952 | 0.759 |
-| embedding-logistic_sentence_embeddings | 0.616 | 0.929 | 0.632 | 0.857 | 0.727 | 0.952 | 0.759 |
-| embedding-svm_sentence_embeddings | 0.500 | 0.941 | 0.771 | 0.661 | 0.712 | 0.952 | 0.743 |
-| embedding-svm_sentence_embeddings | 0.276 | 0.935 | 0.667 | 0.821 | 0.736 | 0.952 | 0.743 |
-| embedding-lightgbm_sentence_embeddings | 0.500 | 0.946 | 0.788 | 0.696 | 0.739 | 0.959 | 0.801 |
-| embedding-lightgbm_sentence_embeddings | 0.052 | 0.933 | 0.657 | 0.821 | 0.730 | 0.959 | 0.801 |
-| transformer | 0.500 | 0.945 | 0.750 | 0.750 | 0.750 | 0.954 | 0.773 |
-| transformer | 0.500 | 0.945 | 0.750 | 0.750 | 0.750 | 0.954 | 0.773 |
 ## Confusion Matrices on Test Split
@@ -88,67 +88,67 @@ Rows are true labels and columns are predicted labels.
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
-| NOT_RELEVANT | 816 | 88 |
-| RELEVANT | 15 | 97 |
-### embedding-logistic_sentence_embeddings at threshold 0.616
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
-| NOT_RELEVANT | 848 | 56 |
-| RELEVANT | 16 | 96 |
 ### embedding-svm_sentence_embeddings at threshold 0.500
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
 | NOT_RELEVANT | 882 | 22 |
-| RELEVANT | 38 | 74 |
-### embedding-svm_sentence_embeddings at threshold 0.276
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
-| NOT_RELEVANT | 858 | 46 |
-| RELEVANT | 20 | 92 |
 ### embedding-lightgbm_sentence_embeddings at threshold 0.500
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
-| NOT_RELEVANT | 883 | 21 |
-| RELEVANT | 34 | 78 |
-### embedding-lightgbm_sentence_embeddings at threshold 0.052
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
-| NOT_RELEVANT | 856 | 48 |
 | RELEVANT | 20 | 92 |
 ### transformer at threshold 0.500
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
-| NOT_RELEVANT | 876 | 28 |
-| RELEVANT | 28 | 84 |
-### transformer at threshold 0.500
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
-| NOT_RELEVANT | 876 | 28 |
-| RELEVANT | 28 | 84 |
 ## Validation-Tuned Thresholds
 - `logistic_tfidf`: threshold `0.608` (validation F1 `0.578`); test F1 change vs 0.5: `-0.077`.
 - `xgboost_tfidf`: threshold `0.177` (validation F1 `0.581`); test F1 change vs 0.5: `+0.111`.
-- `embedding-logistic_sentence_embeddings`: threshold `0.616` (validation F1 `0.728`); test F1 change vs 0.5: `+0.074`.
-- `embedding-svm_sentence_embeddings`: threshold `0.276` (validation F1 `0.731`); test F1 change vs 0.5: `+0.024`.
-- `embedding-lightgbm_sentence_embeddings`: threshold `0.052` (validation F1 `0.739`); test F1 change vs 0.5: `-0.009`.
-- `transformer`: threshold `0.500` (validation F1 `0.848`); test F1 change vs 0.5: `+0.000`.
 ## Artifacts

 - Text column: `chunk_text`
 - Label column: `label`
 - Transformer: `FacebookAI/xlm-roberta-base`
+- Generated at: `2026-05-27T10:50:45.867038+00:00`
 ## Dataset Summary
 | logistic_tfidf | 0.608 | 0.942 | 0.696 | 0.494 | 0.578 | 0.872 | 0.594 |
 | xgboost_tfidf | 0.500 | 0.945 | 0.931 | 0.342 | 0.500 | 0.823 | 0.588 |
 | xgboost_tfidf | 0.177 | 0.934 | 0.592 | 0.570 | 0.581 | 0.823 | 0.588 |
+| embedding-logistic_sentence_embeddings | 0.500 | 0.912 | 0.476 | 0.861 | 0.613 | 0.953 | 0.762 |
+| embedding-logistic_sentence_embeddings | 0.722 | 0.957 | 0.703 | 0.810 | 0.753 | 0.953 | 0.762 |
+| embedding-svm_sentence_embeddings | 0.500 | 0.955 | 0.807 | 0.582 | 0.676 | 0.952 | 0.754 |
+| embedding-svm_sentence_embeddings | 0.310 | 0.957 | 0.713 | 0.785 | 0.747 | 0.952 | 0.754 |
+| embedding-lightgbm_sentence_embeddings | 0.500 | 0.954 | 0.750 | 0.646 | 0.694 | 0.948 | 0.782 |
+| embedding-lightgbm_sentence_embeddings | 0.042 | 0.952 | 0.670 | 0.797 | 0.728 | 0.948 | 0.782 |
+| transformer | 0.500 | 0.970 | 0.798 | 0.848 | 0.822 | 0.966 | 0.854 |
+| transformer | 0.471 | 0.971 | 0.800 | 0.861 | 0.829 | 0.966 | 0.854 |
 ## Threshold Comparison on Test Split
 | logistic_tfidf | 0.608 | 0.930 | 0.902 | 0.411 | 0.564 | 0.899 | 0.726 |
 | xgboost_tfidf | 0.500 | 0.924 | 1.000 | 0.312 | 0.476 | 0.892 | 0.692 |
 | xgboost_tfidf | 0.177 | 0.918 | 0.663 | 0.527 | 0.587 | 0.892 | 0.692 |
+| embedding-logistic_sentence_embeddings | 0.500 | 0.891 | 0.503 | 0.884 | 0.641 | 0.955 | 0.710 |
+| embedding-logistic_sentence_embeddings | 0.722 | 0.935 | 0.689 | 0.750 | 0.718 | 0.955 | 0.710 |
+| embedding-svm_sentence_embeddings | 0.500 | 0.930 | 0.741 | 0.562 | 0.640 | 0.956 | 0.704 |
+| embedding-svm_sentence_embeddings | 0.310 | 0.934 | 0.686 | 0.741 | 0.712 | 0.956 | 0.704 |
+| embedding-lightgbm_sentence_embeddings | 0.500 | 0.937 | 0.740 | 0.661 | 0.698 | 0.960 | 0.791 |
+| embedding-lightgbm_sentence_embeddings | 0.042 | 0.929 | 0.639 | 0.821 | 0.719 | 0.960 | 0.791 |
+| transformer | 0.500 | 0.951 | 0.777 | 0.777 | 0.777 | 0.968 | 0.817 |
+| transformer | 0.471 | 0.950 | 0.770 | 0.777 | 0.773 | 0.968 | 0.817 |
 ## Confusion Matrices on Test Split
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
+| NOT_RELEVANT | 806 | 98 |
+| RELEVANT | 13 | 99 |
+### embedding-logistic_sentence_embeddings at threshold 0.722
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
+| NOT_RELEVANT | 866 | 38 |
+| RELEVANT | 28 | 84 |
 ### embedding-svm_sentence_embeddings at threshold 0.500
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
 | NOT_RELEVANT | 882 | 22 |
+| RELEVANT | 49 | 63 |
+### embedding-svm_sentence_embeddings at threshold 0.310
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
+| NOT_RELEVANT | 866 | 38 |
+| RELEVANT | 29 | 83 |
 ### embedding-lightgbm_sentence_embeddings at threshold 0.500
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
+| NOT_RELEVANT | 878 | 26 |
+| RELEVANT | 38 | 74 |
+### embedding-lightgbm_sentence_embeddings at threshold 0.042
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
+| NOT_RELEVANT | 852 | 52 |
 | RELEVANT | 20 | 92 |
 ### transformer at threshold 0.500
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
+| NOT_RELEVANT | 879 | 25 |
+| RELEVANT | 25 | 87 |
+### transformer at threshold 0.471
 | True / Predicted | NOT_RELEVANT | RELEVANT |
 | --- | ---: | ---: |
+| NOT_RELEVANT | 878 | 26 |
+| RELEVANT | 25 | 87 |
 ## Validation-Tuned Thresholds
 - `logistic_tfidf`: threshold `0.608` (validation F1 `0.578`); test F1 change vs 0.5: `-0.077`.
 - `xgboost_tfidf`: threshold `0.177` (validation F1 `0.581`); test F1 change vs 0.5: `+0.111`.
+- `embedding-logistic_sentence_embeddings`: threshold `0.722` (validation F1 `0.753`); test F1 change vs 0.5: `+0.077`.
+- `embedding-svm_sentence_embeddings`: threshold `0.310` (validation F1 `0.747`); test F1 change vs 0.5: `+0.073`.
+- `embedding-lightgbm_sentence_embeddings`: threshold `0.042` (validation F1 `0.728`); test F1 change vs 0.5: `+0.021`.
+- `transformer`: threshold `0.471` (validation F1 `0.829`); test F1 change vs 0.5: `-0.003`.
 ## Artifacts

baselines/embedding-lightgbm/embedding-lightgbm.joblib CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3a14be333902e726d49155cf98ec689843edfa4320b39724da54a187bea078e8
-size 1467460

 version https://git-lfs.github.com/spec/v1
+oid sha256:02039c6ee8487042ae61343afc227ab7375bbfdb042e073232a995d2e4d57dd6
+size 1467646

baselines/embedding-lightgbm/test_predictions.csv CHANGED Viewed

The diff for this file is too large to render. See raw diff

baselines/embedding-lightgbm/validation_predictions.csv CHANGED Viewed

The diff for this file is too large to render. See raw diff

baselines/embedding-logistic/embedding-logistic.joblib CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:596282c69402bd7479f4057afeaeeec5cc81d9c13bede61569f3be96207798f0
-size 4287

 version https://git-lfs.github.com/spec/v1
+oid sha256:433846875da231d3a97fc0f6bfa5adc3a1c4edb548d9655dc98a07523b436207
+size 4361

baselines/embedding-logistic/test_predictions.csv CHANGED Viewed

The diff for this file is too large to render. See raw diff

baselines/embedding-logistic/validation_predictions.csv CHANGED Viewed

The diff for this file is too large to render. See raw diff

baselines/embedding-svm/embedding-svm.joblib CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d4dcb68c9d78767b36ec44c943e7085a53ccbf4fc61e5568acaf2d3cf442f72e
-size 11696

 version https://git-lfs.github.com/spec/v1
+oid sha256:df3e6eaec015a205089efe2457d89d2ecacdf1661b8607ad60905ef318adc5f4
+size 11770

baselines/embedding-svm/test_predictions.csv CHANGED Viewed

The diff for this file is too large to render. See raw diff

baselines/embedding-svm/validation_predictions.csv CHANGED Viewed

The diff for this file is too large to render. See raw diff

report.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "created_at": "2026-05-26T17:46:00.691870+00:00",
   "config": {
     "hf_dataset": "faodl/amis-agri-utilization",
     "hf_subset": null,
@@ -38,8 +38,8 @@
     "embedding_batch_size": 64,
     "positive_label_name": "RELEVANT",
     "negative_label_name": "NOT_RELEVANT",
-    "push_to_hub": false,
-    "hub_model_id": null,
     "hub_private_repo": false
   },
   "dataset_summary": {
@@ -474,194 +474,194 @@
       "artifact_dir": "/content/agri-utilization-classifier/baselines/embedding-logistic",
       "artifact_file": "/content/agri-utilization-classifier/baselines/embedding-logistic/embedding-logistic.joblib",
       "validation_best_threshold": {
-        "threshold": 0.616087721531811,
-        "f1": 0.7282051282051282,
-        "precision": 0.6120689655172413,
-        "recall": 0.8987341772151899
       },
       "validation_default_0_5": {
         "threshold": 0.5,
-        "accuracy": 0.9161554192229039,
-        "precision": 0.4897959183673469,
-        "recall": 0.9113924050632911,
-        "f1": 0.6371681415929203,
         "confusion_matrix": [
           [
             824,
             75
           ],
           [
-            7,
-            72
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
-            "precision": 0.9915764139590855,
             "recall": 0.9165739710789766,
-            "f1-score": 0.9526011560693641,
             "support": 899.0
           },
           "RELEVANT": {
-            "precision": 0.4897959183673469,
-            "recall": 0.9113924050632911,
-            "f1-score": 0.6371681415929203,
             "support": 79.0
           },
-          "accuracy": 0.9161554192229039,
           "macro avg": {
-            "precision": 0.7406861661632163,
-            "recall": 0.9139831880711339,
-            "f1-score": 0.7948846488311423,
             "support": 978.0
           },
           "weighted avg": {
-            "precision": 0.9510440426382804,
-            "recall": 0.9161554192229039,
-            "f1-score": 0.9271213931413078,
             "support": 978.0
           }
         },
-        "roc_auc": 0.9563227777699554,
-        "average_precision": 0.7488532716951917
       },
       "validation_optimal_threshold": {
-        "threshold": 0.616087721531811,
-        "accuracy": 0.9458077709611452,
-        "precision": 0.6120689655172413,
-        "recall": 0.8987341772151899,
-        "f1": 0.7282051282051282,
         "confusion_matrix": [
           [
-            854,
-            45
           ],
           [
-            8,
-            71
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
-            "precision": 0.9907192575406032,
-            "recall": 0.949944382647386,
-            "f1-score": 0.9699034639409426,
             "support": 899.0
           },
           "RELEVANT": {
-            "precision": 0.6120689655172413,
-            "recall": 0.8987341772151899,
-            "f1-score": 0.7282051282051282,
             "support": 79.0
           },
-          "accuracy": 0.9458077709611452,
           "macro avg": {
-            "precision": 0.8013941115289223,
-            "recall": 0.924339279931288,
-            "f1-score": 0.8490542960730354,
             "support": 978.0
           },
           "weighted avg": {
-            "precision": 0.9601329865080412,
-            "recall": 0.9458077709611452,
-            "f1-score": 0.9503797742444914,
             "support": 978.0
           }
         },
-        "roc_auc": 0.9563227777699554,
-        "average_precision": 0.7488532716951917
       },
       "test_default_0_5": {
         "threshold": 0.5,
-        "accuracy": 0.8986220472440944,
-        "precision": 0.5243243243243243,
-        "recall": 0.8660714285714286,
-        "f1": 0.6531986531986532,
         "confusion_matrix": [
           [
-            816,
-            88
           ],
           [
-            15,
-            97
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
-            "precision": 0.9819494584837545,
-            "recall": 0.9026548672566371,
-            "f1-score": 0.9406340057636887,
             "support": 904.0
           },
           "RELEVANT": {
-            "precision": 0.5243243243243243,
-            "recall": 0.8660714285714286,
-            "f1-score": 0.6531986531986532,
             "support": 112.0
           },
-          "accuracy": 0.8986220472440944,
           "macro avg": {
-            "precision": 0.7531368914040394,
-            "recall": 0.8843631479140328,
-            "f1-score": 0.796916329481171,
             "support": 1016.0
           },
           "weighted avg": {
-            "precision": 0.9315025933008252,
-            "recall": 0.8986220472440944,
-            "f1-score": 0.9089482188667557,
             "support": 1016.0
           }
         },
-        "roc_auc": 0.9523842446270544,
-        "average_precision": 0.7588349048416645
       },
       "test_optimal_threshold": {
-        "threshold": 0.616087721531811,
-        "accuracy": 0.9291338582677166,
-        "precision": 0.631578947368421,
-        "recall": 0.8571428571428571,
-        "f1": 0.7272727272727273,
         "confusion_matrix": [
           [
-            848,
-            56
           ],
           [
-            16,
-            96
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
-            "precision": 0.9814814814814815,
-            "recall": 0.9380530973451328,
-            "f1-score": 0.9592760180995475,
             "support": 904.0
           },
           "RELEVANT": {
-            "precision": 0.631578947368421,
-            "recall": 0.8571428571428571,
-            "f1-score": 0.7272727272727273,
             "support": 112.0
           },
-          "accuracy": 0.9291338582677166,
           "macro avg": {
-            "precision": 0.8065302144249513,
-            "recall": 0.8975979772439949,
-            "f1-score": 0.8432743726861374,
             "support": 1016.0
           },
           "weighted avg": {
-            "precision": 0.9429095485871283,
-            "recall": 0.9291338582677166,
-            "f1-score": 0.9337008521816303,
             "support": 1016.0
           }
         },
-        "roc_auc": 0.9523842446270544,
-        "average_precision": 0.7588349048416645
       }
     },
     {
@@ -671,194 +671,194 @@
       "artifact_dir": "/content/agri-utilization-classifier/baselines/embedding-svm",
       "artifact_file": "/content/agri-utilization-classifier/baselines/embedding-svm/embedding-svm.joblib",
       "validation_best_threshold": {
-        "threshold": 0.27629376276966117,
-        "f1": 0.7314285714285714,
-        "precision": 0.6666666666666666,
-        "recall": 0.810126582278481
       },
       "validation_default_0_5": {
         "threshold": 0.5,
-        "accuracy": 0.9570552147239264,
-        "precision": 0.8032786885245902,
-        "recall": 0.620253164556962,
-        "f1": 0.7,
         "confusion_matrix": [
           [
-            887,
-            12
           ],
           [
-            30,
-            49
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
-            "precision": 0.9672846237731734,
-            "recall": 0.9866518353726362,
-            "f1-score": 0.9768722466960352,
             "support": 899.0
           },
           "RELEVANT": {
-            "precision": 0.8032786885245902,
-            "recall": 0.620253164556962,
-            "f1-score": 0.7,
             "support": 79.0
           },
-          "accuracy": 0.9570552147239264,
           "macro avg": {
-            "precision": 0.8852816561488818,
-            "recall": 0.8034524999647992,
-            "f1-score": 0.8384361233480175,
             "support": 978.0
           },
           "weighted avg": {
-            "precision": 0.9540367005782469,
-            "recall": 0.9570552147239264,
-            "f1-score": 0.9545073106132266,
             "support": 978.0
           }
         },
-        "roc_auc": 0.9584911505047804,
-        "average_precision": 0.7425325495012566
       },
       "validation_optimal_threshold": {
-        "threshold": 0.27629376276966117,
-        "accuracy": 0.9519427402862985,
-        "precision": 0.6666666666666666,
-        "recall": 0.810126582278481,
-        "f1": 0.7314285714285714,
         "confusion_matrix": [
           [
-            867,
-            32
           ],
           [
-            15,
-            64
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
-            "precision": 0.9829931972789115,
-            "recall": 0.96440489432703,
-            "f1-score": 0.9736103312745649,
             "support": 899.0
           },
           "RELEVANT": {
-            "precision": 0.6666666666666666,
-            "recall": 0.810126582278481,
-            "f1-score": 0.7314285714285714,
             "support": 79.0
           },
-          "accuracy": 0.9519427402862985,
           "macro avg": {
-            "precision": 0.8248299319727891,
-            "recall": 0.8872657383027556,
-            "f1-score": 0.8525194513515681,
             "support": 978.0
           },
           "weighted avg": {
-            "precision": 0.9574412587120738,
-            "recall": 0.9519427402862985,
-            "f1-score": 0.9540475919823016,
             "support": 978.0
           }
         },
-        "roc_auc": 0.9584911505047804,
-        "average_precision": 0.7425325495012566
       },
       "test_default_0_5": {
         "threshold": 0.5,
-        "accuracy": 0.9409448818897638,
-        "precision": 0.7708333333333334,
-        "recall": 0.6607142857142857,
-        "f1": 0.7115384615384616,
         "confusion_matrix": [
           [
             882,
             22
           ],
           [
-            38,
-            74
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
-            "precision": 0.9586956521739131,
             "recall": 0.9756637168141593,
-            "f1-score": 0.9671052631578947,
             "support": 904.0
           },
           "RELEVANT": {
-            "precision": 0.7708333333333334,
-            "recall": 0.6607142857142857,
-            "f1-score": 0.7115384615384616,
             "support": 112.0
           },
-          "accuracy": 0.9409448818897638,
           "macro avg": {
-            "precision": 0.8647644927536232,
-            "recall": 0.8181890012642226,
-            "f1-score": 0.8393218623481782,
             "support": 1016.0
           },
           "weighted avg": {
-            "precision": 0.9379864201757389,
-            "recall": 0.9409448818897638,
-            "f1-score": 0.9389325448691382,
             "support": 1016.0
           }
         },
-        "roc_auc": 0.9517817635903919,
-        "average_precision": 0.743247391124005
       },
       "test_optimal_threshold": {
-        "threshold": 0.27629376276966117,
-        "accuracy": 0.9350393700787402,
-        "precision": 0.6666666666666666,
-        "recall": 0.8214285714285714,
-        "f1": 0.736,
         "confusion_matrix": [
           [
-            858,
-            46
           ],
           [
-            20,
-            92
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
-            "precision": 0.9772209567198178,
-            "recall": 0.9491150442477876,
-            "f1-score": 0.9629629629629629,
             "support": 904.0
           },
           "RELEVANT": {
-            "precision": 0.6666666666666666,
-            "recall": 0.8214285714285714,
-            "f1-score": 0.736,
             "support": 112.0
           },
-          "accuracy": 0.9350393700787402,
           "macro avg": {
-            "precision": 0.8219438116932423,
-            "recall": 0.8852718078381795,
-            "f1-score": 0.8494814814814815,
             "support": 1016.0
           },
           "weighted avg": {
-            "precision": 0.9429866255328562,
-            "recall": 0.9350393700787402,
-            "f1-score": 0.9379434237386993,
             "support": 1016.0
           }
         },
-        "roc_auc": 0.9517817635903919,
-        "average_precision": 0.743247391124005
       }
     },
     {
@@ -868,159 +868,159 @@
       "artifact_dir": "/content/agri-utilization-classifier/baselines/embedding-lightgbm",
       "artifact_file": "/content/agri-utilization-classifier/baselines/embedding-lightgbm/embedding-lightgbm.joblib",
       "validation_best_threshold": {
-        "threshold": 0.05244099185733503,
-        "f1": 0.7386363636363636,
-        "precision": 0.6701030927835051,
-        "recall": 0.8227848101265823
       },
       "validation_default_0_5": {
         "threshold": 0.5,
-        "accuracy": 0.9478527607361963,
-        "precision": 0.7,
-        "recall": 0.620253164556962,
-        "f1": 0.6577181208053692,
         "confusion_matrix": [
           [
-            878,
-            21
           ],
           [
-            30,
-            49
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
-            "precision": 0.9669603524229075,
-            "recall": 0.9766407119021134,
-            "f1-score": 0.9717764250138351,
             "support": 899.0
           },
           "RELEVANT": {
-            "precision": 0.7,
-            "recall": 0.620253164556962,
-            "f1-score": 0.6577181208053692,
             "support": 79.0
           },
-          "accuracy": 0.9478527607361963,
           "macro avg": {
-            "precision": 0.8334801762114536,
-            "recall": 0.7984469382295377,
-            "f1-score": 0.8147472729096021,
             "support": 978.0
           },
           "weighted avg": {
-            "precision": 0.9453960703764762,
-            "recall": 0.9478527607361963,
-            "f1-score": 0.9464077071892248,
             "support": 978.0
           }
         },
-        "roc_auc": 0.952112755382211,
-        "average_precision": 0.777786126005225
       },
       "validation_optimal_threshold": {
-        "threshold": 0.05244099185733503,
-        "accuracy": 0.9529652351738241,
-        "precision": 0.6701030927835051,
-        "recall": 0.8227848101265823,
-        "f1": 0.7386363636363636,
         "confusion_matrix": [
           [
-            867,
-            32
           ],
           [
-            14,
-            65
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
-            "precision": 0.9841089670828603,
-            "recall": 0.96440489432703,
-            "f1-score": 0.9741573033707865,
             "support": 899.0
           },
           "RELEVANT": {
-            "precision": 0.6701030927835051,
-            "recall": 0.8227848101265823,
-            "f1-score": 0.7386363636363636,
             "support": 79.0
           },
-          "accuracy": 0.9529652351738241,
           "macro avg": {
-            "precision": 0.8271060299331827,
-            "recall": 0.8935948522268062,
-            "f1-score": 0.8563968335035751,
             "support": 978.0
           },
           "weighted avg": {
-            "precision": 0.9587444843940576,
-            "recall": 0.9529652351738241,
-            "f1-score": 0.9551326057848771,
             "support": 978.0
           }
         },
-        "roc_auc": 0.952112755382211,
-        "average_precision": 0.777786126005225
       },
       "test_default_0_5": {
         "threshold": 0.5,
-        "accuracy": 0.9458661417322834,
-        "precision": 0.7878787878787878,
-        "recall": 0.6964285714285714,
-        "f1": 0.7393364928909952,
         "confusion_matrix": [
           [
-            883,
-            21
           ],
           [
-            34,
-            78
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
-            "precision": 0.9629225736095965,
-            "recall": 0.9767699115044248,
-            "f1-score": 0.9697968149368479,
             "support": 904.0
           },
           "RELEVANT": {
-            "precision": 0.7878787878787878,
-            "recall": 0.6964285714285714,
-            "f1-score": 0.7393364928909952,
             "support": 112.0
           },
-          "accuracy": 0.9458661417322834,
           "macro avg": {
-            "precision": 0.8754006807441922,
-            "recall": 0.8365992414664981,
-            "f1-score": 0.8545666539139216,
             "support": 1016.0
           },
           "weighted avg": {
-            "precision": 0.9436264082534445,
-            "recall": 0.9458661417322834,
-            "f1-score": 0.9443917400656515,
             "support": 1016.0
           }
         },
-        "roc_auc": 0.9585078223767383,
-        "average_precision": 0.8011064601086128
       },
       "test_optimal_threshold": {
-        "threshold": 0.05244099185733503,
-        "accuracy": 0.9330708661417323,
-        "precision": 0.6571428571428571,
         "recall": 0.8214285714285714,
-        "f1": 0.7301587301587301,
         "confusion_matrix": [
           [
-            856,
-            48
           ],
           [
             20,
@@ -1029,33 +1029,33 @@
         ],
         "classification_report": {
           "NOT_RELEVANT": {
-            "precision": 0.9771689497716894,
-            "recall": 0.9469026548672567,
-            "f1-score": 0.9617977528089887,
             "support": 904.0
           },
           "RELEVANT": {
-            "precision": 0.6571428571428571,
             "recall": 0.8214285714285714,
-            "f1-score": 0.7301587301587301,
             "support": 112.0
           },
-          "accuracy": 0.9330708661417323,
           "macro avg": {
-            "precision": 0.8171559034572733,
-            "recall": 0.8841656131479141,
-            "f1-score": 0.8459782414838595,
             "support": 1016.0
           },
           "weighted avg": {
-            "precision": 0.9418904828677237,
-            "recall": 0.9330708661417323,
-            "f1-score": 0.936262742438094,
             "support": 1016.0
           }
         },
-        "roc_auc": 0.9585078223767383,
-        "average_precision": 0.8011064601086128
       }
     },
     {
@@ -1063,194 +1063,194 @@
       "model_name": "FacebookAI/xlm-roberta-base",
       "artifact_dir": "/content/agri-utilization-classifier/transformer",
       "validation_best_threshold": {
-        "threshold": 0.4999122619628906,
-        "f1": 0.8484848484848485,
-        "precision": 0.813953488372093,
-        "recall": 0.8860759493670886
       },
       "validation_default_0_5": {
         "threshold": 0.5,
-        "accuracy": 0.9734151329243353,
-        "precision": 0.8117647058823529,
-        "recall": 0.8734177215189873,
-        "f1": 0.8414634146341463,
         "confusion_matrix": [
           [
-            883,
-            16
           ],
           [
-            10,
-            69
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
-            "precision": 0.9888017917133258,
-            "recall": 0.982202447163515,
-            "f1-score": 0.9854910714285714,
             "support": 899.0
           },
           "RELEVANT": {
-            "precision": 0.8117647058823529,
-            "recall": 0.8734177215189873,
-            "f1-score": 0.8414634146341463,
             "support": 79.0
           },
-          "accuracy": 0.9734151329243353,
           "macro avg": {
-            "precision": 0.9002832487978394,
-            "recall": 0.9278100843412511,
-            "f1-score": 0.9134772430313589,
             "support": 978.0
           },
           "weighted avg": {
-            "precision": 0.974501250015323,
-            "recall": 0.9734151329243353,
-            "f1-score": 0.9738569355525392,
             "support": 978.0
           }
         },
-        "roc_auc": 0.9707692091071654,
-        "average_precision": 0.836048392061997
       },
       "validation_optimal_threshold": {
-        "threshold": 0.4999122619628906,
-        "accuracy": 0.9744376278118609,
-        "precision": 0.813953488372093,
-        "recall": 0.8860759493670886,
-        "f1": 0.8484848484848485,
         "confusion_matrix": [
           [
-            883,
-            16
           ],
           [
-            9,
-            70
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
-            "precision": 0.9899103139013453,
-            "recall": 0.982202447163515,
-            "f1-score": 0.9860413176996091,
             "support": 899.0
           },
           "RELEVANT": {
-            "precision": 0.813953488372093,
-            "recall": 0.8860759493670886,
-            "f1-score": 0.8484848484848485,
             "support": 79.0
           },
-          "accuracy": 0.9744376278118609,
           "macro avg": {
-            "precision": 0.9019319011367192,
-            "recall": 0.9341391982653018,
-            "f1-score": 0.9172630830922288,
             "support": 978.0
           },
           "weighted avg": {
-            "precision": 0.9756970324935632,
-            "recall": 0.9744376278118609,
-            "f1-score": 0.9749299055646745,
             "support": 978.0
           }
         },
-        "roc_auc": 0.9707692091071654,
-        "average_precision": 0.836048392061997
       },
       "test_default_0_5": {
         "threshold": 0.5,
-        "accuracy": 0.9448818897637795,
-        "precision": 0.75,
-        "recall": 0.75,
-        "f1": 0.75,
         "confusion_matrix": [
           [
-            876,
-            28
           ],
           [
-            28,
-            84
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
-            "precision": 0.9690265486725663,
-            "recall": 0.9690265486725663,
-            "f1-score": 0.9690265486725663,
             "support": 904.0
           },
           "RELEVANT": {
-            "precision": 0.75,
-            "recall": 0.75,
-            "f1-score": 0.75,
             "support": 112.0
           },
-          "accuracy": 0.9448818897637795,
           "macro avg": {
-            "precision": 0.8595132743362832,
-            "recall": 0.8595132743362832,
-            "f1-score": 0.8595132743362832,
             "support": 1016.0
           },
           "weighted avg": {
-            "precision": 0.9448818897637795,
-            "recall": 0.9448818897637795,
-            "f1-score": 0.9448818897637795,
             "support": 1016.0
           }
         },
-        "roc_auc": 0.9541373656763591,
-        "average_precision": 0.7726393350690168
       },
       "test_optimal_threshold": {
-        "threshold": 0.4999122619628906,
-        "accuracy": 0.9448818897637795,
-        "precision": 0.75,
-        "recall": 0.75,
-        "f1": 0.75,
         "confusion_matrix": [
           [
-            876,
-            28
           ],
           [
-            28,
-            84
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
-            "precision": 0.9690265486725663,
-            "recall": 0.9690265486725663,
-            "f1-score": 0.9690265486725663,
             "support": 904.0
           },
           "RELEVANT": {
-            "precision": 0.75,
-            "recall": 0.75,
-            "f1-score": 0.75,
             "support": 112.0
           },
-          "accuracy": 0.9448818897637795,
           "macro avg": {
-            "precision": 0.8595132743362832,
-            "recall": 0.8595132743362832,
-            "f1-score": 0.8595132743362832,
             "support": 1016.0
           },
           "weighted avg": {
-            "precision": 0.9448818897637795,
-            "recall": 0.9448818897637795,
-            "f1-score": 0.9448818897637795,
             "support": 1016.0
           }
         },
-        "roc_auc": 0.9541373656763591,
-        "average_precision": 0.7726393350690168
       }
     }
   ]

 {
+  "created_at": "2026-05-27T10:50:45.867038+00:00",
   "config": {
     "hf_dataset": "faodl/amis-agri-utilization",
     "hf_subset": null,
     "embedding_batch_size": 64,
     "positive_label_name": "RELEVANT",
     "negative_label_name": "NOT_RELEVANT",
+    "push_to_hub": true,
+    "hub_model_id": "faodl/agri-utilization-classifier",
     "hub_private_repo": false
   },
   "dataset_summary": {
       "artifact_dir": "/content/agri-utilization-classifier/baselines/embedding-logistic",
       "artifact_file": "/content/agri-utilization-classifier/baselines/embedding-logistic/embedding-logistic.joblib",
       "validation_best_threshold": {
+        "threshold": 0.7220406191151401,
+        "f1": 0.7529411764705883,
+        "precision": 0.7032967032967034,
+        "recall": 0.810126582278481
       },
       "validation_default_0_5": {
         "threshold": 0.5,
+        "accuracy": 0.9120654396728016,
+        "precision": 0.4755244755244755,
+        "recall": 0.8607594936708861,
+        "f1": 0.6126126126126126,
         "confusion_matrix": [
           [
             824,
             75
           ],
           [
+            11,
+            68
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
+            "precision": 0.9868263473053892,
             "recall": 0.9165739710789766,
+            "f1-score": 0.9504036908881199,
             "support": 899.0
           },
           "RELEVANT": {
+            "precision": 0.4755244755244755,
+            "recall": 0.8607594936708861,
+            "f1-score": 0.6126126126126126,
             "support": 79.0
           },
+          "accuracy": 0.9120654396728016,
           "macro avg": {
+            "precision": 0.7311754114149324,
+            "recall": 0.8886667323749313,
+            "f1-score": 0.7815081517503663,
             "support": 978.0
           },
           "weighted avg": {
+            "precision": 0.9455248668650087,
+            "recall": 0.9120654396728016,
+            "f1-score": 0.9231179084916321,
             "support": 978.0
           }
         },
+        "roc_auc": 0.9525633263400967,
+        "average_precision": 0.7622834015915168
       },
       "validation_optimal_threshold": {
+        "threshold": 0.7220406191151401,
+        "accuracy": 0.9570552147239264,
+        "precision": 0.7032967032967034,
+        "recall": 0.810126582278481,
+        "f1": 0.7529411764705882,
         "confusion_matrix": [
           [
+            872,
+            27
           ],
           [
+            15,
+            64
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
+            "precision": 0.9830890642615558,
+            "recall": 0.9699666295884316,
+            "f1-score": 0.9764837625979843,
             "support": 899.0
           },
           "RELEVANT": {
+            "precision": 0.7032967032967034,
+            "recall": 0.810126582278481,
+            "f1-score": 0.7529411764705882,
             "support": 79.0
           },
+          "accuracy": 0.9570552147239264,
           "macro avg": {
+            "precision": 0.8431928837791296,
+            "recall": 0.8900466059334563,
+            "f1-score": 0.8647124695342863,
             "support": 978.0
           },
           "weighted avg": {
+            "precision": 0.9604882498277897,
+            "recall": 0.9570552147239264,
+            "f1-score": 0.9584266416326834,
             "support": 978.0
           }
         },
+        "roc_auc": 0.9525633263400967,
+        "average_precision": 0.7622834015915168
       },
       "test_default_0_5": {
         "threshold": 0.5,
+        "accuracy": 0.890748031496063,
+        "precision": 0.5025380710659898,
+        "recall": 0.8839285714285714,
+        "f1": 0.6407766990291263,
         "confusion_matrix": [
           [
+            806,
+            98
           ],
           [
+            13,
+            99
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
+            "precision": 0.9841269841269841,
+            "recall": 0.8915929203539823,
+            "f1-score": 0.9355774811375508,
             "support": 904.0
           },
           "RELEVANT": {
+            "precision": 0.5025380710659898,
+            "recall": 0.8839285714285714,
+            "f1-score": 0.6407766990291263,
             "support": 112.0
           },
+          "accuracy": 0.890748031496063,
           "macro avg": {
+            "precision": 0.7433325275964869,
+            "recall": 0.8877607458912768,
+            "f1-score": 0.7881770900833385,
             "support": 1016.0
           },
           "weighted avg": {
+            "precision": 0.9310384425297091,
+            "recall": 0.890748031496063,
+            "f1-score": 0.9030797571255984,
             "support": 1016.0
           }
         },
+        "roc_auc": 0.955317635903919,
+        "average_precision": 0.7096184898069098
       },
       "test_optimal_threshold": {
+        "threshold": 0.7220406191151401,
+        "accuracy": 0.9350393700787402,
+        "precision": 0.6885245901639344,
+        "recall": 0.75,
+        "f1": 0.717948717948718,
         "confusion_matrix": [
           [
+            866,
+            38
           ],
           [
+            28,
+            84
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
+            "precision": 0.9686800894854586,
+            "recall": 0.9579646017699115,
+            "f1-score": 0.9632925472747497,
             "support": 904.0
           },
           "RELEVANT": {
+            "precision": 0.6885245901639344,
+            "recall": 0.75,
+            "f1-score": 0.717948717948718,
             "support": 112.0
           },
+          "accuracy": 0.9350393700787402,
           "macro avg": {
+            "precision": 0.8286023398246964,
+            "recall": 0.8539823008849557,
+            "f1-score": 0.8406206326117338,
             "support": 1016.0
           },
           "weighted avg": {
+            "precision": 0.9377968060956843,
+            "recall": 0.9350393700787402,
+            "f1-score": 0.9362467708136123,
             "support": 1016.0
           }
         },
+        "roc_auc": 0.955317635903919,
+        "average_precision": 0.7096184898069098
       }
     },
     {
       "artifact_dir": "/content/agri-utilization-classifier/baselines/embedding-svm",
       "artifact_file": "/content/agri-utilization-classifier/baselines/embedding-svm/embedding-svm.joblib",
       "validation_best_threshold": {
+        "threshold": 0.30975184413575924,
+        "f1": 0.746987951807229,
+        "precision": 0.7126436781609196,
+        "recall": 0.7848101265822784
       },
       "validation_default_0_5": {
         "threshold": 0.5,
+        "accuracy": 0.9550102249488752,
+        "precision": 0.8070175438596491,
+        "recall": 0.5822784810126582,
+        "f1": 0.6764705882352942,
         "confusion_matrix": [
           [
+            888,
+            11
           ],
           [
+            33,
+            46
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
+            "precision": 0.9641693811074918,
+            "recall": 0.9877641824249166,
+            "f1-score": 0.9758241758241758,
             "support": 899.0
           },
           "RELEVANT": {
+            "precision": 0.8070175438596491,
+            "recall": 0.5822784810126582,
+            "f1-score": 0.6764705882352942,
             "support": 79.0
           },
+          "accuracy": 0.9550102249488752,
           "macro avg": {
+            "precision": 0.8855934624835704,
+            "recall": 0.7850213317187874,
+            "f1-score": 0.8261473820297349,
             "support": 978.0
           },
           "weighted avg": {
+            "precision": 0.9514751120455496,
+            "recall": 0.9550102249488752,
+            "f1-score": 0.9516432623072826,
             "support": 978.0
           }
         },
+        "roc_auc": 0.9524506836006251,
+        "average_precision": 0.7542419360138435
       },
       "validation_optimal_threshold": {
+        "threshold": 0.30975184413575924,
+        "accuracy": 0.9570552147239264,
+        "precision": 0.7126436781609196,
+        "recall": 0.7848101265822784,
+        "f1": 0.7469879518072289,
         "confusion_matrix": [
           [
+            874,
+            25
           ],
           [
+            17,
+            62
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
+            "precision": 0.9809203142536476,
+            "recall": 0.9721913236929922,
+            "f1-score": 0.976536312849162,
             "support": 899.0
           },
           "RELEVANT": {
+            "precision": 0.7126436781609196,
+            "recall": 0.7848101265822784,
+            "f1-score": 0.7469879518072289,
             "support": 79.0
           },
+          "accuracy": 0.9570552147239264,
           "macro avg": {
+            "precision": 0.8467819962072836,
+            "recall": 0.8785007251376353,
+            "f1-score": 0.8617621323281954,
             "support": 978.0
           },
           "weighted avg": {
+            "precision": 0.9592497066347054,
+            "recall": 0.9570552147239264,
+            "f1-score": 0.9579940628263474,
             "support": 978.0
           }
         },
+        "roc_auc": 0.9524506836006251,
+        "average_precision": 0.7542419360138435
       },
       "test_default_0_5": {
         "threshold": 0.5,
+        "accuracy": 0.9301181102362205,
+        "precision": 0.7411764705882353,
+        "recall": 0.5625,
+        "f1": 0.6395939086294417,
         "confusion_matrix": [
           [
             882,
             22
           ],
           [
+            49,
+            63
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
+            "precision": 0.9473684210526315,
             "recall": 0.9756637168141593,
+            "f1-score": 0.9613079019073569,
             "support": 904.0
           },
           "RELEVANT": {
+            "precision": 0.7411764705882353,
+            "recall": 0.5625,
+            "f1-score": 0.6395939086294417,
             "support": 112.0
           },
+          "accuracy": 0.9301181102362205,
           "macro avg": {
+            "precision": 0.8442724458204334,
+            "recall": 0.7690818584070797,
+            "f1-score": 0.8004509052683992,
             "support": 1016.0
           },
           "weighted avg": {
+            "precision": 0.9246385997415957,
+            "recall": 0.9301181102362205,
+            "f1-score": 0.9258433672153032,
             "support": 1016.0
           }
         },
+        "roc_auc": 0.9563744469026548,
+        "average_precision": 0.7035914186137721
       },
       "test_optimal_threshold": {
+        "threshold": 0.30975184413575924,
+        "accuracy": 0.9340551181102362,
+        "precision": 0.6859504132231405,
+        "recall": 0.7410714285714286,
+        "f1": 0.7124463519313304,
         "confusion_matrix": [
           [
+            866,
+            38
           ],
           [
+            29,
+            83
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
+            "precision": 0.9675977653631285,
+            "recall": 0.9579646017699115,
+            "f1-score": 0.962757087270706,
             "support": 904.0
           },
           "RELEVANT": {
+            "precision": 0.6859504132231405,
+            "recall": 0.7410714285714286,
+            "f1-score": 0.7124463519313304,
             "support": 112.0
           },
+          "accuracy": 0.9340551181102362,
           "macro avg": {
+            "precision": 0.8267740892931346,
+            "recall": 0.84951801517067,
+            "f1-score": 0.8376017196010181,
             "support": 1016.0
           },
           "weighted avg": {
+            "precision": 0.9365500257571455,
+            "recall": 0.9340551181102362,
+            "f1-score": 0.9351637778632157,
             "support": 1016.0
           }
         },
+        "roc_auc": 0.9563744469026548,
+        "average_precision": 0.7035914186137721
       }
     },
     {
       "artifact_dir": "/content/agri-utilization-classifier/baselines/embedding-lightgbm",
       "artifact_file": "/content/agri-utilization-classifier/baselines/embedding-lightgbm/embedding-lightgbm.joblib",
       "validation_best_threshold": {
+        "threshold": 0.042041465431985434,
+        "f1": 0.7283236994219654,
+        "precision": 0.6702127659574468,
+        "recall": 0.7974683544303798
       },
       "validation_default_0_5": {
         "threshold": 0.5,
+        "accuracy": 0.9539877300613497,
+        "precision": 0.75,
+        "recall": 0.6455696202531646,
+        "f1": 0.6938775510204082,
         "confusion_matrix": [
           [
+            882,
+            17
           ],
           [
+            28,
+            51
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
+            "precision": 0.9692307692307692,
+            "recall": 0.9810901001112347,
+            "f1-score": 0.9751243781094527,
             "support": 899.0
           },
           "RELEVANT": {
+            "precision": 0.75,
+            "recall": 0.6455696202531646,
+            "f1-score": 0.6938775510204082,
             "support": 79.0
           },
+          "accuracy": 0.9539877300613497,
           "macro avg": {
+            "precision": 0.8596153846153847,
+            "recall": 0.8133298601821997,
+            "f1-score": 0.8345009645649304,
             "support": 978.0
           },
           "weighted avg": {
+            "precision": 0.9515219443133554,
+            "recall": 0.9539877300613497,
+            "f1-score": 0.9524060761257774,
             "support": 978.0
           }
         },
+        "roc_auc": 0.9480716971036736,
+        "average_precision": 0.7818499996214695
       },
       "validation_optimal_threshold": {
+        "threshold": 0.042041465431985434,
+        "accuracy": 0.9519427402862985,
+        "precision": 0.6702127659574468,
+        "recall": 0.7974683544303798,
+        "f1": 0.7283236994219653,
         "confusion_matrix": [
           [
+            868,
+            31
           ],
           [
+            16,
+            63
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
+            "precision": 0.9819004524886877,
+            "recall": 0.9655172413793104,
+            "f1-score": 0.9736399326977005,
             "support": 899.0
           },
           "RELEVANT": {
+            "precision": 0.6702127659574468,
+            "recall": 0.7974683544303798,
+            "f1-score": 0.7283236994219653,
             "support": 79.0
           },
+          "accuracy": 0.9519427402862985,
           "macro avg": {
+            "precision": 0.8260566092230672,
+            "recall": 0.881492797904845,
+            "f1-score": 0.8509818160598329,
             "support": 978.0
           },
           "weighted avg": {
+            "precision": 0.9567232262760416,
+            "recall": 0.9519427402862985,
+            "f1-score": 0.9538239997439346,
             "support": 978.0
           }
         },
+        "roc_auc": 0.9480716971036736,
+        "average_precision": 0.7818499996214695
       },
       "test_default_0_5": {
         "threshold": 0.5,
+        "accuracy": 0.937007874015748,
+        "precision": 0.74,
+        "recall": 0.6607142857142857,
+        "f1": 0.6981132075471698,
         "confusion_matrix": [
           [
+            878,
+            26
           ],
           [
+            38,
+            74
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
+            "precision": 0.9585152838427947,
+            "recall": 0.9712389380530974,
+            "f1-score": 0.9648351648351648,
             "support": 904.0
           },
           "RELEVANT": {
+            "precision": 0.74,
+            "recall": 0.6607142857142857,
+            "f1-score": 0.6981132075471698,
             "support": 112.0
           },
+          "accuracy": 0.937007874015748,
           "macro avg": {
+            "precision": 0.8492576419213973,
+            "recall": 0.8159766118836915,
+            "f1-score": 0.8314741861911673,
             "support": 1016.0
           },
           "weighted avg": {
+            "precision": 0.9344269848365024,
+            "recall": 0.937007874015748,
+            "f1-score": 0.9354327443467244,
             "support": 1016.0
           }
         },
+        "roc_auc": 0.9597819216182049,
+        "average_precision": 0.7911233572387708
       },
       "test_optimal_threshold": {
+        "threshold": 0.042041465431985434,
+        "accuracy": 0.9291338582677166,
+        "precision": 0.6388888888888888,
         "recall": 0.8214285714285714,
+        "f1": 0.71875,
         "confusion_matrix": [
           [
+            852,
+            52
           ],
           [
             20,
         ],
         "classification_report": {
           "NOT_RELEVANT": {
+            "precision": 0.9770642201834863,
+            "recall": 0.9424778761061947,
+            "f1-score": 0.9594594594594594,
             "support": 904.0
           },
           "RELEVANT": {
+            "precision": 0.6388888888888888,
             "recall": 0.8214285714285714,
+            "f1-score": 0.71875,
             "support": 112.0
           },
+          "accuracy": 0.9291338582677166,
           "macro avg": {
+            "precision": 0.8079765545361876,
+            "recall": 0.881953223767383,
+            "f1-score": 0.8391047297297297,
             "support": 1016.0
           },
           "weighted avg": {
+            "precision": 0.9397850498045542,
+            "recall": 0.9291338582677166,
+            "f1-score": 0.9329245584166844,
             "support": 1016.0
           }
         },
+        "roc_auc": 0.9597819216182049,
+        "average_precision": 0.7911233572387708
       }
     },
     {
       "model_name": "FacebookAI/xlm-roberta-base",
       "artifact_dir": "/content/agri-utilization-classifier/transformer",
       "validation_best_threshold": {
+        "threshold": 0.4710787534713745,
+        "f1": 0.829268292682927,
+        "precision": 0.8,
+        "recall": 0.8607594936708861
       },
       "validation_default_0_5": {
         "threshold": 0.5,
+        "accuracy": 0.9703476482617587,
+        "precision": 0.7976190476190477,
+        "recall": 0.8481012658227848,
+        "f1": 0.8220858895705522,
         "confusion_matrix": [
           [
+            882,
+            17
           ],
           [
+            12,
+            67
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
+            "precision": 0.9865771812080537,
+            "recall": 0.9810901001112347,
+            "f1-score": 0.9838259899609593,
             "support": 899.0
           },
           "RELEVANT": {
+            "precision": 0.7976190476190477,
+            "recall": 0.8481012658227848,
+            "f1-score": 0.8220858895705522,
             "support": 79.0
           },
+          "accuracy": 0.9703476482617587,
           "macro avg": {
+            "precision": 0.8920981144135507,
+            "recall": 0.9145956829670097,
+            "f1-score": 0.9029559397657557,
             "support": 978.0
           },
           "weighted avg": {
+            "precision": 0.9713136918895144,
+            "recall": 0.9703476482617587,
+            "f1-score": 0.9707610943261513,
             "support": 978.0
           }
         },
+        "roc_auc": 0.9661086157615353,
+        "average_precision": 0.8539255147550682
       },
       "validation_optimal_threshold": {
+        "threshold": 0.4710787534713745,
+        "accuracy": 0.9713701431492843,
+        "precision": 0.8,
+        "recall": 0.8607594936708861,
+        "f1": 0.8292682926829268,
         "confusion_matrix": [
           [
+            882,
+            17
           ],
           [
+            11,
+            68
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
+            "precision": 0.9876819708846585,
+            "recall": 0.9810901001112347,
+            "f1-score": 0.984375,
             "support": 899.0
           },
           "RELEVANT": {
+            "precision": 0.8,
+            "recall": 0.8607594936708861,
+            "f1-score": 0.8292682926829268,
             "support": 79.0
           },
+          "accuracy": 0.9713701431492843,
           "macro avg": {
+            "precision": 0.8938409854423293,
+            "recall": 0.9209247968910603,
+            "f1-score": 0.9068216463414633,
             "support": 978.0
           },
           "weighted avg": {
+            "precision": 0.972521566283546,
+            "recall": 0.9713701431492843,
+            "f1-score": 0.9718459305950421,
             "support": 978.0
           }
         },
+        "roc_auc": 0.9661086157615353,
+        "average_precision": 0.8539255147550682
       },
       "test_default_0_5": {
         "threshold": 0.5,
+        "accuracy": 0.9507874015748031,
+        "precision": 0.7767857142857143,
+        "recall": 0.7767857142857143,
+        "f1": 0.7767857142857143,
         "confusion_matrix": [
           [
+            879,
+            25
           ],
           [
+            25,
+            87
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
+            "precision": 0.9723451327433629,
+            "recall": 0.9723451327433629,
+            "f1-score": 0.9723451327433629,
             "support": 904.0
           },
           "RELEVANT": {
+            "precision": 0.7767857142857143,
+            "recall": 0.7767857142857143,
+            "f1-score": 0.7767857142857143,
             "support": 112.0
           },
+          "accuracy": 0.9507874015748031,
           "macro avg": {
+            "precision": 0.8745654235145386,
+            "recall": 0.8745654235145386,
+            "f1-score": 0.8745654235145386,
             "support": 1016.0
           },
           "weighted avg": {
+            "precision": 0.9507874015748031,
+            "recall": 0.9507874015748031,
+            "f1-score": 0.9507874015748031,
             "support": 1016.0
           }
         },
+        "roc_auc": 0.9682512247155499,
+        "average_precision": 0.8171206633671375
       },
       "test_optimal_threshold": {
+        "threshold": 0.4710787534713745,
+        "accuracy": 0.9498031496062992,
+        "precision": 0.7699115044247787,
+        "recall": 0.7767857142857143,
+        "f1": 0.7733333333333333,
         "confusion_matrix": [
           [
+            878,
+            26
           ],
           [
+            25,
+            87
           ]
         ],
         "classification_report": {
           "NOT_RELEVANT": {
+            "precision": 0.9723145071982281,
+            "recall": 0.9712389380530974,
+            "f1-score": 0.9717764250138351,
             "support": 904.0
           },
           "RELEVANT": {
+            "precision": 0.7699115044247787,
+            "recall": 0.7767857142857143,
+            "f1-score": 0.7733333333333333,
             "support": 112.0
           },
+          "accuracy": 0.9498031496062992,
           "macro avg": {
+            "precision": 0.8711130058115034,
+            "recall": 0.8740123261694058,
+            "f1-score": 0.8725548791735842,
             "support": 1016.0
           },
           "weighted avg": {
+            "precision": 0.9500023651602102,
+            "recall": 0.9498031496062992,
+            "f1-score": 0.9499008086081104,
             "support": 1016.0
           }
         },
+        "roc_auc": 0.9682512247155499,
+        "average_precision": 0.8171206633671375
       }
     }
   ]

transformer/checkpoint-1220/model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:572ab7a5b2bc6140bc72ff08cc111f90496ceab40c64330fc6373973a0b6830c
 size 1112205008

 version https://git-lfs.github.com/spec/v1
+oid sha256:b248f60ff3e153b28949243967a2debde809912442c1ef5fe19d89dad891f1f9
 size 1112205008

transformer/checkpoint-1220/optimizer.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ce55d9becbab31f17b7cfca4a308defc4a0d3875c852bc0df4b57343af2e439a
 size 2224532875

 version https://git-lfs.github.com/spec/v1
+oid sha256:f731780f8bff3652e23ff4cf1692c96c1068919f515c85113ffd987765be34ce
 size 2224532875

transformer/checkpoint-1220/scaler.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d3fd1def138c5a78584037782d4486580ca45db784f5e2a18955179628a8a257
 size 1383

 version https://git-lfs.github.com/spec/v1
+oid sha256:f2e11cad5f2deee13f6148971cf1c6ded27d5cbdc725a37902243981a6125a17
 size 1383

transformer/checkpoint-1220/trainer_state.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
-  "best_global_step": 1220,
-  "best_metric": 0.8414634146341463,
-  "best_model_checkpoint": "/content/agri-utilization-classifier/transformer/checkpoint-1220",
   "epoch": 4.0,
   "eval_steps": 500,
   "global_step": 1220,
@@ -11,390 +11,390 @@
   "log_history": [
     {
       "epoch": 0.08196721311475409,
-      "grad_norm": Infinity,
       "learning_rate": 3.157894736842105e-06,
-      "loss": 0.7012384033203125,
       "step": 25
     },
     {
       "epoch": 0.16393442622950818,
-      "grad_norm": 11.968428611755371,
       "learning_rate": 6.447368421052632e-06,
-      "loss": 0.4254766845703125,
       "step": 50
     },
     {
       "epoch": 0.2459016393442623,
-      "grad_norm": 19.943565368652344,
       "learning_rate": 9.736842105263159e-06,
-      "loss": 0.3554811859130859,
       "step": 75
     },
     {
       "epoch": 0.32786885245901637,
-      "grad_norm": 5.117671489715576,
       "learning_rate": 1.3026315789473684e-05,
-      "loss": 0.3145046615600586,
       "step": 100
     },
     {
       "epoch": 0.4098360655737705,
-      "grad_norm": 11.402164459228516,
       "learning_rate": 1.6315789473684213e-05,
-      "loss": 0.2773847770690918,
       "step": 125
     },
     {
       "epoch": 0.4918032786885246,
-      "grad_norm": 8.077093124389648,
       "learning_rate": 1.960526315789474e-05,
-      "loss": 0.2556156539916992,
       "step": 150
     },
     {
       "epoch": 0.5737704918032787,
-      "grad_norm": 2.0055508613586426,
       "learning_rate": 1.9679533867443555e-05,
-      "loss": 0.25026893615722656,
       "step": 175
     },
     {
       "epoch": 0.6557377049180327,
-      "grad_norm": 0.9293265342712402,
       "learning_rate": 1.9315367807720323e-05,
-      "loss": 0.24691852569580078,
       "step": 200
     },
     {
       "epoch": 0.7377049180327869,
-      "grad_norm": 1.4303064346313477,
       "learning_rate": 1.8951201747997088e-05,
-      "loss": 0.21208112716674804,
       "step": 225
     },
     {
       "epoch": 0.819672131147541,
-      "grad_norm": 4.035928249359131,
       "learning_rate": 1.8587035688273852e-05,
-      "loss": 0.17017290115356445,
       "step": 250
     },
     {
       "epoch": 0.9016393442622951,
-      "grad_norm": 4.241303443908691,
       "learning_rate": 1.822286962855062e-05,
-      "loss": 0.16115386962890624,
       "step": 275
     },
     {
       "epoch": 0.9836065573770492,
-      "grad_norm": 0.3990240693092346,
       "learning_rate": 1.7858703568827385e-05,
-      "loss": 0.18343988418579102,
       "step": 300
     },
     {
       "epoch": 1.0,
-      "eval_accuracy": 0.9642126789366053,
-      "eval_f1": 0.7741935483870968,
-      "eval_loss": 0.14546315371990204,
-      "eval_precision": 0.7894736842105263,
-      "eval_recall": 0.759493670886076,
-      "eval_roc_auc": 0.9131946888948339,
-      "eval_runtime": 3.5921,
-      "eval_samples_per_second": 272.266,
-      "eval_steps_per_second": 8.63,
       "step": 305
     },
     {
       "epoch": 1.0655737704918034,
-      "grad_norm": 4.865096569061279,
       "learning_rate": 1.7494537509104153e-05,
-      "loss": 0.19335979461669922,
       "step": 325
     },
     {
       "epoch": 1.1475409836065573,
-      "grad_norm": 1.9780689477920532,
       "learning_rate": 1.7130371449380918e-05,
-      "loss": 0.2344082260131836,
       "step": 350
     },
     {
       "epoch": 1.2295081967213115,
-      "grad_norm": 1.3759413957595825,
       "learning_rate": 1.6766205389657686e-05,
-      "loss": 0.20309404373168946,
       "step": 375
     },
     {
       "epoch": 1.3114754098360657,
-      "grad_norm": 0.30811628699302673,
       "learning_rate": 1.640203932993445e-05,
-      "loss": 0.224365234375,
       "step": 400
     },
     {
       "epoch": 1.3934426229508197,
-      "grad_norm": 5.530014514923096,
       "learning_rate": 1.603787327021122e-05,
-      "loss": 0.14759160995483397,
       "step": 425
     },
     {
       "epoch": 1.4754098360655736,
-      "grad_norm": 3.7750189304351807,
       "learning_rate": 1.5673707210487983e-05,
-      "loss": 0.14137668609619142,
       "step": 450
     },
     {
       "epoch": 1.5573770491803278,
-      "grad_norm": 1.8209186792373657,
       "learning_rate": 1.530954115076475e-05,
-      "loss": 0.19394855499267577,
       "step": 475
     },
     {
       "epoch": 1.639344262295082,
-      "grad_norm": 0.4824683368206024,
       "learning_rate": 1.4945375091041516e-05,
-      "loss": 0.1700056266784668,
       "step": 500
     },
     {
       "epoch": 1.721311475409836,
-      "grad_norm": 0.5682937502861023,
       "learning_rate": 1.4581209031318282e-05,
-      "loss": 0.17243267059326173,
       "step": 525
     },
     {
       "epoch": 1.8032786885245902,
-      "grad_norm": 2.2086634635925293,
       "learning_rate": 1.4217042971595047e-05,
-      "loss": 0.15430424690246583,
       "step": 550
     },
     {
       "epoch": 1.8852459016393444,
-      "grad_norm": 6.93908166885376,
       "learning_rate": 1.3852876911871815e-05,
-      "loss": 0.10752416610717773,
       "step": 575
     },
     {
       "epoch": 1.9672131147540983,
-      "grad_norm": 5.092395782470703,
       "learning_rate": 1.3488710852148582e-05,
-      "loss": 0.21721889495849608,
       "step": 600
     },
     {
       "epoch": 2.0,
-      "eval_accuracy": 0.9570552147239264,
-      "eval_f1": 0.7692307692307693,
-      "eval_loss": 0.12331932783126831,
-      "eval_precision": 0.6796116504854369,
-      "eval_recall": 0.8860759493670886,
-      "eval_roc_auc": 0.9679672209628138,
-      "eval_runtime": 3.5595,
-      "eval_samples_per_second": 274.754,
-      "eval_steps_per_second": 8.709,
       "step": 610
     },
     {
       "epoch": 2.0491803278688523,
-      "grad_norm": 12.040640830993652,
       "learning_rate": 1.3124544792425346e-05,
-      "loss": 0.11267939567565918,
       "step": 625
     },
     {
       "epoch": 2.1311475409836067,
-      "grad_norm": 0.33291733264923096,
       "learning_rate": 1.2760378732702113e-05,
-      "loss": 0.16029356002807618,
       "step": 650
     },
     {
       "epoch": 2.2131147540983607,
-      "grad_norm": 0.1562187671661377,
       "learning_rate": 1.239621267297888e-05,
-      "loss": 0.13354766845703125,
       "step": 675
     },
     {
       "epoch": 2.2950819672131146,
-      "grad_norm": 0.39492854475975037,
       "learning_rate": 1.2032046613255645e-05,
-      "loss": 0.08748809814453125,
       "step": 700
     },
     {
       "epoch": 2.3770491803278686,
-      "grad_norm": 0.22857463359832764,
       "learning_rate": 1.1667880553532412e-05,
-      "loss": 0.1255797290802002,
       "step": 725
     },
     {
       "epoch": 2.459016393442623,
-      "grad_norm": 42.23853302001953,
       "learning_rate": 1.1303714493809176e-05,
-      "loss": 0.09398910522460938,
       "step": 750
     },
     {
       "epoch": 2.540983606557377,
-      "grad_norm": 9.628519058227539,
       "learning_rate": 1.0939548434085944e-05,
-      "loss": 0.12067486763000489,
       "step": 775
     },
     {
       "epoch": 2.6229508196721314,
-      "grad_norm": 8.281865119934082,
       "learning_rate": 1.057538237436271e-05,
-      "loss": 0.0960771656036377,
       "step": 800
     },
     {
       "epoch": 2.7049180327868854,
-      "grad_norm": 0.2366073578596115,
       "learning_rate": 1.0211216314639475e-05,
-      "loss": 0.1477354335784912,
       "step": 825
     },
     {
       "epoch": 2.7868852459016393,
-      "grad_norm": 2.127614974975586,
       "learning_rate": 9.847050254916243e-06,
-      "loss": 0.12143749237060547,
       "step": 850
     },
     {
       "epoch": 2.8688524590163933,
-      "grad_norm": 0.1283058375120163,
       "learning_rate": 9.482884195193008e-06,
-      "loss": 0.0978905963897705,
       "step": 875
     },
     {
       "epoch": 2.9508196721311473,
-      "grad_norm": 0.16377978026866913,
       "learning_rate": 9.118718135469774e-06,
-      "loss": 0.13501665115356445,
       "step": 900
     },
     {
       "epoch": 3.0,
-      "eval_accuracy": 0.9683026584867076,
-      "eval_f1": 0.8143712574850299,
-      "eval_loss": 0.11399859189987183,
-      "eval_precision": 0.7727272727272727,
-      "eval_recall": 0.8607594936708861,
-      "eval_roc_auc": 0.9685867560299067,
-      "eval_runtime": 3.583,
-      "eval_samples_per_second": 272.954,
-      "eval_steps_per_second": 8.652,
       "step": 915
     },
     {
       "epoch": 3.0327868852459017,
-      "grad_norm": 0.36435577273368835,
       "learning_rate": 8.754552075746541e-06,
-      "loss": 0.12846416473388672,
       "step": 925
     },
     {
       "epoch": 3.1147540983606556,
-      "grad_norm": 0.12181571871042252,
       "learning_rate": 8.390386016023307e-06,
-      "loss": 0.07323605537414551,
       "step": 950
     },
     {
       "epoch": 3.19672131147541,
-      "grad_norm": 0.08613187074661255,
       "learning_rate": 8.026219956300074e-06,
-      "loss": 0.11347267150878906,
       "step": 975
     },
     {
       "epoch": 3.278688524590164,
-      "grad_norm": 2.452489137649536,
       "learning_rate": 7.66205389657684e-06,
-      "loss": 0.07726279258728028,
       "step": 1000
     },
     {
       "epoch": 3.360655737704918,
-      "grad_norm": 0.05209459364414215,
       "learning_rate": 7.2978878368536055e-06,
-      "loss": 0.07418290138244629,
       "step": 1025
     },
     {
       "epoch": 3.442622950819672,
-      "grad_norm": 20.379858016967773,
       "learning_rate": 6.933721777130372e-06,
-      "loss": 0.12044317245483399,
       "step": 1050
     },
     {
       "epoch": 3.5245901639344264,
-      "grad_norm": 0.8267766237258911,
       "learning_rate": 6.569555717407138e-06,
-      "loss": 0.07422394752502441,
       "step": 1075
     },
     {
       "epoch": 3.6065573770491803,
-      "grad_norm": 0.18650104105472565,
       "learning_rate": 6.2053896576839045e-06,
-      "loss": 0.07682370185852051,
       "step": 1100
     },
     {
       "epoch": 3.6885245901639343,
-      "grad_norm": 0.07509063929319382,
       "learning_rate": 5.84122359796067e-06,
-      "loss": 0.06737090110778808,
       "step": 1125
     },
     {
       "epoch": 3.7704918032786887,
-      "grad_norm": 16.239089965820312,
       "learning_rate": 5.477057538237437e-06,
-      "loss": 0.0853554630279541,
       "step": 1150
     },
     {
       "epoch": 3.8524590163934427,
-      "grad_norm": 0.07212834060192108,
       "learning_rate": 5.112891478514203e-06,
-      "loss": 0.07101381301879883,
       "step": 1175
     },
     {
       "epoch": 3.9344262295081966,
-      "grad_norm": 0.05511339381337166,
       "learning_rate": 4.748725418790969e-06,
-      "loss": 0.09712701797485351,
       "step": 1200
     },
     {
       "epoch": 4.0,
-      "eval_accuracy": 0.9734151329243353,
-      "eval_f1": 0.8414634146341463,
-      "eval_loss": 0.11315910518169403,
-      "eval_precision": 0.8117647058823529,
-      "eval_recall": 0.8734177215189873,
-      "eval_roc_auc": 0.9707692091071654,
-      "eval_runtime": 3.7125,
-      "eval_samples_per_second": 263.434,
-      "eval_steps_per_second": 8.35,
       "step": 1220
     }
   ],
@@ -410,7 +410,7 @@
         "early_stopping_threshold": 0.0
       },
       "attributes": {
-        "early_stopping_patience_counter": 0
       }
     },
     "TrainerControl": {

 {
+  "best_global_step": 915,
+  "best_metric": 0.8220858895705522,
+  "best_model_checkpoint": "/content/agri-utilization-classifier/transformer/checkpoint-915",
   "epoch": 4.0,
   "eval_steps": 500,
   "global_step": 1220,
   "log_history": [
     {
       "epoch": 0.08196721311475409,
+      "grad_norm": 6.055062770843506,
       "learning_rate": 3.157894736842105e-06,
+      "loss": 0.62972900390625,
       "step": 25
     },
     {
       "epoch": 0.16393442622950818,
+      "grad_norm": 10.6914701461792,
       "learning_rate": 6.447368421052632e-06,
+      "loss": 0.44850738525390627,
       "step": 50
     },
     {
       "epoch": 0.2459016393442623,
+      "grad_norm": 6.670228481292725,
       "learning_rate": 9.736842105263159e-06,
+      "loss": 0.3566379165649414,
       "step": 75
     },
     {
       "epoch": 0.32786885245901637,
+      "grad_norm": 2.589911937713623,
       "learning_rate": 1.3026315789473684e-05,
+      "loss": 0.2718839645385742,
       "step": 100
     },
     {
       "epoch": 0.4098360655737705,
+      "grad_norm": 22.02676773071289,
       "learning_rate": 1.6315789473684213e-05,
+      "loss": 0.1922766876220703,
       "step": 125
     },
     {
       "epoch": 0.4918032786885246,
+      "grad_norm": 2.6362855434417725,
       "learning_rate": 1.960526315789474e-05,
+      "loss": 0.1837622833251953,
       "step": 150
     },
     {
       "epoch": 0.5737704918032787,
+      "grad_norm": 3.478484630584717,
       "learning_rate": 1.9679533867443555e-05,
+      "loss": 0.18766048431396484,
       "step": 175
     },
     {
       "epoch": 0.6557377049180327,
+      "grad_norm": 8.077605247497559,
       "learning_rate": 1.9315367807720323e-05,
+      "loss": 0.23830581665039063,
       "step": 200
     },
     {
       "epoch": 0.7377049180327869,
+      "grad_norm": 0.7427046298980713,
       "learning_rate": 1.8951201747997088e-05,
+      "loss": 0.30742517471313474,
       "step": 225
     },
     {
       "epoch": 0.819672131147541,
+      "grad_norm": 36.34975051879883,
       "learning_rate": 1.8587035688273852e-05,
+      "loss": 0.22336017608642578,
       "step": 250
     },
     {
       "epoch": 0.9016393442622951,
+      "grad_norm": 5.215510845184326,
       "learning_rate": 1.822286962855062e-05,
+      "loss": 0.13779294967651368,
       "step": 275
     },
     {
       "epoch": 0.9836065573770492,
+      "grad_norm": 3.551121950149536,
       "learning_rate": 1.7858703568827385e-05,
+      "loss": 0.19200111389160157,
       "step": 300
     },
     {
       "epoch": 1.0,
+      "eval_accuracy": 0.9631901840490797,
+      "eval_f1": 0.7721518987341772,
+      "eval_loss": 0.1292734444141388,
+      "eval_precision": 0.7721518987341772,
+      "eval_recall": 0.7721518987341772,
+      "eval_roc_auc": 0.9563720589684741,
+      "eval_runtime": 3.3396,
+      "eval_samples_per_second": 292.853,
+      "eval_steps_per_second": 9.283,
       "step": 305
     },
     {
       "epoch": 1.0655737704918034,
+      "grad_norm": 0.5402449369430542,
       "learning_rate": 1.7494537509104153e-05,
+      "loss": 0.1241053295135498,
       "step": 325
     },
     {
       "epoch": 1.1475409836065573,
+      "grad_norm": 4.476892948150635,
       "learning_rate": 1.7130371449380918e-05,
+      "loss": 0.20724605560302733,
       "step": 350
     },
     {
       "epoch": 1.2295081967213115,
+      "grad_norm": 0.46729782223701477,
       "learning_rate": 1.6766205389657686e-05,
+      "loss": 0.13567353248596192,
       "step": 375
     },
     {
       "epoch": 1.3114754098360657,
+      "grad_norm": 0.1852118819952011,
       "learning_rate": 1.640203932993445e-05,
+      "loss": 0.13295170783996582,
       "step": 400
     },
     {
       "epoch": 1.3934426229508197,
+      "grad_norm": 1.2681413888931274,
       "learning_rate": 1.603787327021122e-05,
+      "loss": 0.2027936363220215,
       "step": 425
     },
     {
       "epoch": 1.4754098360655736,
+      "grad_norm": 7.484091281890869,
       "learning_rate": 1.5673707210487983e-05,
+      "loss": 0.12364128112792969,
       "step": 450
     },
     {
       "epoch": 1.5573770491803278,
+      "grad_norm": 0.46489500999450684,
       "learning_rate": 1.530954115076475e-05,
+      "loss": 0.14407362937927246,
       "step": 475
     },
     {
       "epoch": 1.639344262295082,
+      "grad_norm": 0.20967872440814972,
       "learning_rate": 1.4945375091041516e-05,
+      "loss": 0.12458925247192383,
       "step": 500
     },
     {
       "epoch": 1.721311475409836,
+      "grad_norm": 0.1643747240304947,
       "learning_rate": 1.4581209031318282e-05,
+      "loss": 0.21631996154785157,
       "step": 525
     },
     {
       "epoch": 1.8032786885245902,
+      "grad_norm": 7.073329448699951,
       "learning_rate": 1.4217042971595047e-05,
+      "loss": 0.16043865203857421,
       "step": 550
     },
     {
       "epoch": 1.8852459016393444,
+      "grad_norm": 1.744958758354187,
       "learning_rate": 1.3852876911871815e-05,
+      "loss": 0.0966644287109375,
       "step": 575
     },
     {
       "epoch": 1.9672131147540983,
+      "grad_norm": 12.79035472869873,
       "learning_rate": 1.3488710852148582e-05,
+      "loss": 0.15884541511535644,
       "step": 600
     },
     {
       "epoch": 2.0,
+      "eval_accuracy": 0.9611451942740287,
+      "eval_f1": 0.7432432432432432,
+      "eval_loss": 0.13287827372550964,
+      "eval_precision": 0.7971014492753623,
+      "eval_recall": 0.6962025316455697,
+      "eval_roc_auc": 0.9594697343039381,
+      "eval_runtime": 3.2739,
+      "eval_samples_per_second": 298.727,
+      "eval_steps_per_second": 9.469,
       "step": 610
     },
     {
       "epoch": 2.0491803278688523,
+      "grad_norm": 17.520444869995117,
       "learning_rate": 1.3124544792425346e-05,
+      "loss": 0.08896012306213379,
       "step": 625
     },
     {
       "epoch": 2.1311475409836067,
+      "grad_norm": 0.16623224318027496,
       "learning_rate": 1.2760378732702113e-05,
+      "loss": 0.11752216339111328,
       "step": 650
     },
     {
       "epoch": 2.2131147540983607,
+      "grad_norm": 0.20762814581394196,
       "learning_rate": 1.239621267297888e-05,
+      "loss": 0.1193038272857666,
       "step": 675
     },
     {
       "epoch": 2.2950819672131146,
+      "grad_norm": 0.1500111073255539,
       "learning_rate": 1.2032046613255645e-05,
+      "loss": 0.0630855655670166,
       "step": 700
     },
     {
       "epoch": 2.3770491803278686,
+      "grad_norm": 0.17727839946746826,
       "learning_rate": 1.1667880553532412e-05,
+      "loss": 0.08730959892272949,
       "step": 725
     },
     {
       "epoch": 2.459016393442623,
+      "grad_norm": 4.3997321128845215,
       "learning_rate": 1.1303714493809176e-05,
+      "loss": 0.12114215850830078,
       "step": 750
     },
     {
       "epoch": 2.540983606557377,
+      "grad_norm": 34.47224044799805,
       "learning_rate": 1.0939548434085944e-05,
+      "loss": 0.11070786476135254,
       "step": 775
     },
     {
       "epoch": 2.6229508196721314,
+      "grad_norm": 25.977081298828125,
       "learning_rate": 1.057538237436271e-05,
+      "loss": 0.10845686912536621,
       "step": 800
     },
     {
       "epoch": 2.7049180327868854,
+      "grad_norm": 0.1657736450433731,
       "learning_rate": 1.0211216314639475e-05,
+      "loss": 0.1025285530090332,
       "step": 825
     },
     {
       "epoch": 2.7868852459016393,
+      "grad_norm": 34.05498504638672,
       "learning_rate": 9.847050254916243e-06,
+      "loss": 0.07825160026550293,
       "step": 850
     },
     {
       "epoch": 2.8688524590163933,
+      "grad_norm": 0.2868161201477051,
       "learning_rate": 9.482884195193008e-06,
+      "loss": 0.12041816711425782,
       "step": 875
     },
     {
       "epoch": 2.9508196721311473,
+      "grad_norm": 0.19192977249622345,
       "learning_rate": 9.118718135469774e-06,
+      "loss": 0.08709416389465333,
       "step": 900
     },
     {
       "epoch": 3.0,
+      "eval_accuracy": 0.9703476482617587,
+      "eval_f1": 0.8220858895705522,
+      "eval_loss": 0.11163181066513062,
+      "eval_precision": 0.7976190476190477,
+      "eval_recall": 0.8481012658227848,
+      "eval_roc_auc": 0.9661086157615353,
+      "eval_runtime": 3.1733,
+      "eval_samples_per_second": 308.193,
+      "eval_steps_per_second": 9.769,
       "step": 915
     },
     {
       "epoch": 3.0327868852459017,
+      "grad_norm": 1.0706992149353027,
       "learning_rate": 8.754552075746541e-06,
+      "loss": 0.10751664161682128,
       "step": 925
     },
     {
       "epoch": 3.1147540983606556,
+      "grad_norm": 0.12844231724739075,
       "learning_rate": 8.390386016023307e-06,
+      "loss": 0.06818144798278808,
       "step": 950
     },
     {
       "epoch": 3.19672131147541,
+      "grad_norm": 0.07692205160856247,
       "learning_rate": 8.026219956300074e-06,
+      "loss": 0.12229555130004882,
       "step": 975
     },
     {
       "epoch": 3.278688524590164,
+      "grad_norm": 1.773990511894226,
       "learning_rate": 7.66205389657684e-06,
+      "loss": 0.06936595916748046,
       "step": 1000
     },
     {
       "epoch": 3.360655737704918,
+      "grad_norm": 0.07844381034374237,
       "learning_rate": 7.2978878368536055e-06,
+      "loss": 0.05219663143157959,
       "step": 1025
     },
     {
       "epoch": 3.442622950819672,
+      "grad_norm": 12.502548217773438,
       "learning_rate": 6.933721777130372e-06,
+      "loss": 0.06849228858947753,
       "step": 1050
     },
     {
       "epoch": 3.5245901639344264,
+      "grad_norm": 1.6993861198425293,
       "learning_rate": 6.569555717407138e-06,
+      "loss": 0.08783550262451172,
       "step": 1075
     },
     {
       "epoch": 3.6065573770491803,
+      "grad_norm": 0.06551510095596313,
       "learning_rate": 6.2053896576839045e-06,
+      "loss": 0.049420347213745115,
       "step": 1100
     },
     {
       "epoch": 3.6885245901639343,
+      "grad_norm": 0.034276798367500305,
       "learning_rate": 5.84122359796067e-06,
+      "loss": 0.05244039058685303,
       "step": 1125
     },
     {
       "epoch": 3.7704918032786887,
+      "grad_norm": 10.901683807373047,
       "learning_rate": 5.477057538237437e-06,
+      "loss": 0.06656317710876465,
       "step": 1150
     },
     {
       "epoch": 3.8524590163934427,
+      "grad_norm": 2.3856894969940186,
       "learning_rate": 5.112891478514203e-06,
+      "loss": 0.06277508735656738,
       "step": 1175
     },
     {
       "epoch": 3.9344262295081966,
+      "grad_norm": 0.018699949607253075,
       "learning_rate": 4.748725418790969e-06,
+      "loss": 0.046858911514282224,
       "step": 1200
     },
     {
       "epoch": 4.0,
+      "eval_accuracy": 0.9662576687116564,
+      "eval_f1": 0.8047337278106509,
+      "eval_loss": 0.14547723531723022,
+      "eval_precision": 0.7555555555555555,
+      "eval_recall": 0.8607594936708861,
+      "eval_roc_auc": 0.9600822291998141,
+      "eval_runtime": 3.1883,
+      "eval_samples_per_second": 306.745,
+      "eval_steps_per_second": 9.723,
       "step": 1220
     }
   ],
         "early_stopping_threshold": 0.0
       },
       "attributes": {
+        "early_stopping_patience_counter": 1
       }
     },
     "TrainerControl": {

transformer/checkpoint-1525/model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c80cdf8b98658539a77a69a282499e626f22060577eb079e6a055f09aba66066
 size 1112205008

 version https://git-lfs.github.com/spec/v1
+oid sha256:ca68ad0b5b62ce1f08ec00239ceda526ea353b1ddc553e305f4ceea6acda5317
 size 1112205008

transformer/checkpoint-1525/optimizer.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:014bf9ccc323714518bfa10873ef36916a7df86a1608813c3700872b55906b0c
 size 2224532875

 version https://git-lfs.github.com/spec/v1
+oid sha256:50962bf386796cce93bfd0be8d501de258304f3ad55fd24f22491068591bd9e2
 size 2224532875

transformer/checkpoint-1525/scaler.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2670ef1c6526aa1bf88e4427fd5e916ff5c25d9c98fc894c142c473952a06106
 size 1383

 version https://git-lfs.github.com/spec/v1
+oid sha256:6d3ceaf712c034a0e9722143d5622d35a083b5d5c1fc678fc7c4e4e70e581221
 size 1383

transformer/checkpoint-1525/trainer_state.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
-  "best_global_step": 1220,
-  "best_metric": 0.8414634146341463,
-  "best_model_checkpoint": "/content/agri-utilization-classifier/transformer/checkpoint-1220",
   "epoch": 5.0,
   "eval_steps": 500,
   "global_step": 1525,
@@ -11,494 +11,494 @@
   "log_history": [
     {
       "epoch": 0.08196721311475409,
-      "grad_norm": Infinity,
       "learning_rate": 3.157894736842105e-06,
-      "loss": 0.7012384033203125,
       "step": 25
     },
     {
       "epoch": 0.16393442622950818,
-      "grad_norm": 11.968428611755371,
       "learning_rate": 6.447368421052632e-06,
-      "loss": 0.4254766845703125,
       "step": 50
     },
     {
       "epoch": 0.2459016393442623,
-      "grad_norm": 19.943565368652344,
       "learning_rate": 9.736842105263159e-06,
-      "loss": 0.3554811859130859,
       "step": 75
     },
     {
       "epoch": 0.32786885245901637,
-      "grad_norm": 5.117671489715576,
       "learning_rate": 1.3026315789473684e-05,
-      "loss": 0.3145046615600586,
       "step": 100
     },
     {
       "epoch": 0.4098360655737705,
-      "grad_norm": 11.402164459228516,
       "learning_rate": 1.6315789473684213e-05,
-      "loss": 0.2773847770690918,
       "step": 125
     },
     {
       "epoch": 0.4918032786885246,
-      "grad_norm": 8.077093124389648,
       "learning_rate": 1.960526315789474e-05,
-      "loss": 0.2556156539916992,
       "step": 150
     },
     {
       "epoch": 0.5737704918032787,
-      "grad_norm": 2.0055508613586426,
       "learning_rate": 1.9679533867443555e-05,
-      "loss": 0.25026893615722656,
       "step": 175
     },
     {
       "epoch": 0.6557377049180327,
-      "grad_norm": 0.9293265342712402,
       "learning_rate": 1.9315367807720323e-05,
-      "loss": 0.24691852569580078,
       "step": 200
     },
     {
       "epoch": 0.7377049180327869,
-      "grad_norm": 1.4303064346313477,
       "learning_rate": 1.8951201747997088e-05,
-      "loss": 0.21208112716674804,
       "step": 225
     },
     {
       "epoch": 0.819672131147541,
-      "grad_norm": 4.035928249359131,
       "learning_rate": 1.8587035688273852e-05,
-      "loss": 0.17017290115356445,
       "step": 250
     },
     {
       "epoch": 0.9016393442622951,
-      "grad_norm": 4.241303443908691,
       "learning_rate": 1.822286962855062e-05,
-      "loss": 0.16115386962890624,
       "step": 275
     },
     {
       "epoch": 0.9836065573770492,
-      "grad_norm": 0.3990240693092346,
       "learning_rate": 1.7858703568827385e-05,
-      "loss": 0.18343988418579102,
       "step": 300
     },
     {
       "epoch": 1.0,
-      "eval_accuracy": 0.9642126789366053,
-      "eval_f1": 0.7741935483870968,
-      "eval_loss": 0.14546315371990204,
-      "eval_precision": 0.7894736842105263,
-      "eval_recall": 0.759493670886076,
-      "eval_roc_auc": 0.9131946888948339,
-      "eval_runtime": 3.5921,
-      "eval_samples_per_second": 272.266,
-      "eval_steps_per_second": 8.63,
       "step": 305
     },
     {
       "epoch": 1.0655737704918034,
-      "grad_norm": 4.865096569061279,
       "learning_rate": 1.7494537509104153e-05,
-      "loss": 0.19335979461669922,
       "step": 325
     },
     {
       "epoch": 1.1475409836065573,
-      "grad_norm": 1.9780689477920532,
       "learning_rate": 1.7130371449380918e-05,
-      "loss": 0.2344082260131836,
       "step": 350
     },
     {
       "epoch": 1.2295081967213115,
-      "grad_norm": 1.3759413957595825,
       "learning_rate": 1.6766205389657686e-05,
-      "loss": 0.20309404373168946,
       "step": 375
     },
     {
       "epoch": 1.3114754098360657,
-      "grad_norm": 0.30811628699302673,
       "learning_rate": 1.640203932993445e-05,
-      "loss": 0.224365234375,
       "step": 400
     },
     {
       "epoch": 1.3934426229508197,
-      "grad_norm": 5.530014514923096,
       "learning_rate": 1.603787327021122e-05,
-      "loss": 0.14759160995483397,
       "step": 425
     },
     {
       "epoch": 1.4754098360655736,
-      "grad_norm": 3.7750189304351807,
       "learning_rate": 1.5673707210487983e-05,
-      "loss": 0.14137668609619142,
       "step": 450
     },
     {
       "epoch": 1.5573770491803278,
-      "grad_norm": 1.8209186792373657,
       "learning_rate": 1.530954115076475e-05,
-      "loss": 0.19394855499267577,
       "step": 475
     },
     {
       "epoch": 1.639344262295082,
-      "grad_norm": 0.4824683368206024,
       "learning_rate": 1.4945375091041516e-05,
-      "loss": 0.1700056266784668,
       "step": 500
     },
     {
       "epoch": 1.721311475409836,
-      "grad_norm": 0.5682937502861023,
       "learning_rate": 1.4581209031318282e-05,
-      "loss": 0.17243267059326173,
       "step": 525
     },
     {
       "epoch": 1.8032786885245902,
-      "grad_norm": 2.2086634635925293,
       "learning_rate": 1.4217042971595047e-05,
-      "loss": 0.15430424690246583,
       "step": 550
     },
     {
       "epoch": 1.8852459016393444,
-      "grad_norm": 6.93908166885376,
       "learning_rate": 1.3852876911871815e-05,
-      "loss": 0.10752416610717773,
       "step": 575
     },
     {
       "epoch": 1.9672131147540983,
-      "grad_norm": 5.092395782470703,
       "learning_rate": 1.3488710852148582e-05,
-      "loss": 0.21721889495849608,
       "step": 600
     },
     {
       "epoch": 2.0,
-      "eval_accuracy": 0.9570552147239264,
-      "eval_f1": 0.7692307692307693,
-      "eval_loss": 0.12331932783126831,
-      "eval_precision": 0.6796116504854369,
-      "eval_recall": 0.8860759493670886,
-      "eval_roc_auc": 0.9679672209628138,
-      "eval_runtime": 3.5595,
-      "eval_samples_per_second": 274.754,
-      "eval_steps_per_second": 8.709,
       "step": 610
     },
     {
       "epoch": 2.0491803278688523,
-      "grad_norm": 12.040640830993652,
       "learning_rate": 1.3124544792425346e-05,
-      "loss": 0.11267939567565918,
       "step": 625
     },
     {
       "epoch": 2.1311475409836067,
-      "grad_norm": 0.33291733264923096,
       "learning_rate": 1.2760378732702113e-05,
-      "loss": 0.16029356002807618,
       "step": 650
     },
     {
       "epoch": 2.2131147540983607,
-      "grad_norm": 0.1562187671661377,
       "learning_rate": 1.239621267297888e-05,
-      "loss": 0.13354766845703125,
       "step": 675
     },
     {
       "epoch": 2.2950819672131146,
-      "grad_norm": 0.39492854475975037,
       "learning_rate": 1.2032046613255645e-05,
-      "loss": 0.08748809814453125,
       "step": 700
     },
     {
       "epoch": 2.3770491803278686,
-      "grad_norm": 0.22857463359832764,
       "learning_rate": 1.1667880553532412e-05,
-      "loss": 0.1255797290802002,
       "step": 725
     },
     {
       "epoch": 2.459016393442623,
-      "grad_norm": 42.23853302001953,
       "learning_rate": 1.1303714493809176e-05,
-      "loss": 0.09398910522460938,
       "step": 750
     },
     {
       "epoch": 2.540983606557377,
-      "grad_norm": 9.628519058227539,
       "learning_rate": 1.0939548434085944e-05,
-      "loss": 0.12067486763000489,
       "step": 775
     },
     {
       "epoch": 2.6229508196721314,
-      "grad_norm": 8.281865119934082,
       "learning_rate": 1.057538237436271e-05,
-      "loss": 0.0960771656036377,
       "step": 800
     },
     {
       "epoch": 2.7049180327868854,
-      "grad_norm": 0.2366073578596115,
       "learning_rate": 1.0211216314639475e-05,
-      "loss": 0.1477354335784912,
       "step": 825
     },
     {
       "epoch": 2.7868852459016393,
-      "grad_norm": 2.127614974975586,
       "learning_rate": 9.847050254916243e-06,
-      "loss": 0.12143749237060547,
       "step": 850
     },
     {
       "epoch": 2.8688524590163933,
-      "grad_norm": 0.1283058375120163,
       "learning_rate": 9.482884195193008e-06,
-      "loss": 0.0978905963897705,
       "step": 875
     },
     {
       "epoch": 2.9508196721311473,
-      "grad_norm": 0.16377978026866913,
       "learning_rate": 9.118718135469774e-06,
-      "loss": 0.13501665115356445,
       "step": 900
     },
     {
       "epoch": 3.0,
-      "eval_accuracy": 0.9683026584867076,
-      "eval_f1": 0.8143712574850299,
-      "eval_loss": 0.11399859189987183,
-      "eval_precision": 0.7727272727272727,
-      "eval_recall": 0.8607594936708861,
-      "eval_roc_auc": 0.9685867560299067,
-      "eval_runtime": 3.583,
-      "eval_samples_per_second": 272.954,
-      "eval_steps_per_second": 8.652,
       "step": 915
     },
     {
       "epoch": 3.0327868852459017,
-      "grad_norm": 0.36435577273368835,
       "learning_rate": 8.754552075746541e-06,
-      "loss": 0.12846416473388672,
       "step": 925
     },
     {
       "epoch": 3.1147540983606556,
-      "grad_norm": 0.12181571871042252,
       "learning_rate": 8.390386016023307e-06,
-      "loss": 0.07323605537414551,
       "step": 950
     },
     {
       "epoch": 3.19672131147541,
-      "grad_norm": 0.08613187074661255,
       "learning_rate": 8.026219956300074e-06,
-      "loss": 0.11347267150878906,
       "step": 975
     },
     {
       "epoch": 3.278688524590164,
-      "grad_norm": 2.452489137649536,
       "learning_rate": 7.66205389657684e-06,
-      "loss": 0.07726279258728028,
       "step": 1000
     },
     {
       "epoch": 3.360655737704918,
-      "grad_norm": 0.05209459364414215,
       "learning_rate": 7.2978878368536055e-06,
-      "loss": 0.07418290138244629,
       "step": 1025
     },
     {
       "epoch": 3.442622950819672,
-      "grad_norm": 20.379858016967773,
       "learning_rate": 6.933721777130372e-06,
-      "loss": 0.12044317245483399,
       "step": 1050
     },
     {
       "epoch": 3.5245901639344264,
-      "grad_norm": 0.8267766237258911,
       "learning_rate": 6.569555717407138e-06,
-      "loss": 0.07422394752502441,
       "step": 1075
     },
     {
       "epoch": 3.6065573770491803,
-      "grad_norm": 0.18650104105472565,
       "learning_rate": 6.2053896576839045e-06,
-      "loss": 0.07682370185852051,
       "step": 1100
     },
     {
       "epoch": 3.6885245901639343,
-      "grad_norm": 0.07509063929319382,
       "learning_rate": 5.84122359796067e-06,
-      "loss": 0.06737090110778808,
       "step": 1125
     },
     {
       "epoch": 3.7704918032786887,
-      "grad_norm": 16.239089965820312,
       "learning_rate": 5.477057538237437e-06,
-      "loss": 0.0853554630279541,
       "step": 1150
     },
     {
       "epoch": 3.8524590163934427,
-      "grad_norm": 0.07212834060192108,
       "learning_rate": 5.112891478514203e-06,
-      "loss": 0.07101381301879883,
       "step": 1175
     },
     {
       "epoch": 3.9344262295081966,
-      "grad_norm": 0.05511339381337166,
       "learning_rate": 4.748725418790969e-06,
-      "loss": 0.09712701797485351,
       "step": 1200
     },
     {
       "epoch": 4.0,
-      "eval_accuracy": 0.9734151329243353,
-      "eval_f1": 0.8414634146341463,
-      "eval_loss": 0.11315910518169403,
-      "eval_precision": 0.8117647058823529,
-      "eval_recall": 0.8734177215189873,
-      "eval_roc_auc": 0.9707692091071654,
-      "eval_runtime": 3.7125,
-      "eval_samples_per_second": 263.434,
-      "eval_steps_per_second": 8.35,
       "step": 1220
     },
     {
       "epoch": 4.016393442622951,
-      "grad_norm": 0.16843096911907196,
       "learning_rate": 4.3845593590677355e-06,
-      "loss": 0.08288318634033204,
       "step": 1225
     },
     {
       "epoch": 4.098360655737705,
-      "grad_norm": 0.12639367580413818,
       "learning_rate": 4.020393299344502e-06,
-      "loss": 0.0468332052230835,
       "step": 1250
     },
     {
       "epoch": 4.180327868852459,
-      "grad_norm": 0.11602156609296799,
       "learning_rate": 3.656227239621268e-06,
-      "loss": 0.041537661552429196,
       "step": 1275
     },
     {
       "epoch": 4.262295081967213,
-      "grad_norm": 0.046698153018951416,
       "learning_rate": 3.292061179898034e-06,
-      "loss": 0.04678268432617187,
       "step": 1300
     },
     {
       "epoch": 4.344262295081967,
-      "grad_norm": 0.05532635748386383,
       "learning_rate": 2.9278951201748e-06,
-      "loss": 0.04036891937255859,
       "step": 1325
     },
     {
       "epoch": 4.426229508196721,
-      "grad_norm": 0.1428351104259491,
       "learning_rate": 2.5637290604515665e-06,
-      "loss": 0.037136049270629884,
       "step": 1350
     },
     {
       "epoch": 4.508196721311475,
-      "grad_norm": 0.11422587931156158,
       "learning_rate": 2.1995630007283324e-06,
-      "loss": 0.09556760787963867,
       "step": 1375
     },
     {
       "epoch": 4.590163934426229,
-      "grad_norm": 0.10767149180173874,
       "learning_rate": 1.8353969410050983e-06,
-      "loss": 0.07056922912597656,
       "step": 1400
     },
     {
       "epoch": 4.672131147540983,
-      "grad_norm": 7.874922752380371,
       "learning_rate": 1.4712308812818645e-06,
-      "loss": 0.0239799165725708,
       "step": 1425
     },
     {
       "epoch": 4.754098360655737,
-      "grad_norm": 0.9701394438743591,
       "learning_rate": 1.1070648215586309e-06,
-      "loss": 0.06149462699890137,
       "step": 1450
     },
     {
       "epoch": 4.836065573770492,
-      "grad_norm": 0.05171401798725128,
       "learning_rate": 7.428987618353969e-07,
-      "loss": 0.038357572555541994,
       "step": 1475
     },
     {
       "epoch": 4.918032786885246,
-      "grad_norm": 0.05953866243362427,
       "learning_rate": 3.787327021121632e-07,
-      "loss": 0.12664172172546387,
       "step": 1500
     },
     {
       "epoch": 5.0,
-      "grad_norm": 0.10173187404870987,
       "learning_rate": 1.4566642388929353e-08,
-      "loss": 0.1284790515899658,
       "step": 1525
     },
     {
       "epoch": 5.0,
-      "eval_accuracy": 0.9713701431492843,
-      "eval_f1": 0.8313253012048193,
-      "eval_loss": 0.1342686116695404,
-      "eval_precision": 0.7931034482758621,
-      "eval_recall": 0.8734177215189873,
-      "eval_roc_auc": 0.9718111544472763,
-      "eval_runtime": 3.5346,
-      "eval_samples_per_second": 276.69,
-      "eval_steps_per_second": 8.77,
       "step": 1525
     }
   ],
@@ -514,7 +514,7 @@
         "early_stopping_threshold": 0.0
       },
       "attributes": {
-        "early_stopping_patience_counter": 1
       }
     },
     "TrainerControl": {

 {
+  "best_global_step": 915,
+  "best_metric": 0.8220858895705522,
+  "best_model_checkpoint": "/content/agri-utilization-classifier/transformer/checkpoint-915",
   "epoch": 5.0,
   "eval_steps": 500,
   "global_step": 1525,
   "log_history": [
     {
       "epoch": 0.08196721311475409,
+      "grad_norm": 6.055062770843506,
       "learning_rate": 3.157894736842105e-06,
+      "loss": 0.62972900390625,
       "step": 25
     },
     {
       "epoch": 0.16393442622950818,
+      "grad_norm": 10.6914701461792,
       "learning_rate": 6.447368421052632e-06,
+      "loss": 0.44850738525390627,
       "step": 50
     },
     {
       "epoch": 0.2459016393442623,
+      "grad_norm": 6.670228481292725,
       "learning_rate": 9.736842105263159e-06,
+      "loss": 0.3566379165649414,
       "step": 75
     },
     {
       "epoch": 0.32786885245901637,
+      "grad_norm": 2.589911937713623,
       "learning_rate": 1.3026315789473684e-05,
+      "loss": 0.2718839645385742,
       "step": 100
     },
     {
       "epoch": 0.4098360655737705,
+      "grad_norm": 22.02676773071289,
       "learning_rate": 1.6315789473684213e-05,
+      "loss": 0.1922766876220703,
       "step": 125
     },
     {
       "epoch": 0.4918032786885246,
+      "grad_norm": 2.6362855434417725,
       "learning_rate": 1.960526315789474e-05,
+      "loss": 0.1837622833251953,
       "step": 150
     },
     {
       "epoch": 0.5737704918032787,
+      "grad_norm": 3.478484630584717,
       "learning_rate": 1.9679533867443555e-05,
+      "loss": 0.18766048431396484,
       "step": 175
     },
     {
       "epoch": 0.6557377049180327,
+      "grad_norm": 8.077605247497559,
       "learning_rate": 1.9315367807720323e-05,
+      "loss": 0.23830581665039063,
       "step": 200
     },
     {
       "epoch": 0.7377049180327869,
+      "grad_norm": 0.7427046298980713,
       "learning_rate": 1.8951201747997088e-05,
+      "loss": 0.30742517471313474,
       "step": 225
     },
     {
       "epoch": 0.819672131147541,
+      "grad_norm": 36.34975051879883,
       "learning_rate": 1.8587035688273852e-05,
+      "loss": 0.22336017608642578,
       "step": 250
     },
     {
       "epoch": 0.9016393442622951,
+      "grad_norm": 5.215510845184326,
       "learning_rate": 1.822286962855062e-05,
+      "loss": 0.13779294967651368,
       "step": 275
     },
     {
       "epoch": 0.9836065573770492,
+      "grad_norm": 3.551121950149536,
       "learning_rate": 1.7858703568827385e-05,
+      "loss": 0.19200111389160157,
       "step": 300
     },
     {
       "epoch": 1.0,
+      "eval_accuracy": 0.9631901840490797,
+      "eval_f1": 0.7721518987341772,
+      "eval_loss": 0.1292734444141388,
+      "eval_precision": 0.7721518987341772,
+      "eval_recall": 0.7721518987341772,
+      "eval_roc_auc": 0.9563720589684741,
+      "eval_runtime": 3.3396,
+      "eval_samples_per_second": 292.853,
+      "eval_steps_per_second": 9.283,
       "step": 305
     },
     {
       "epoch": 1.0655737704918034,
+      "grad_norm": 0.5402449369430542,
       "learning_rate": 1.7494537509104153e-05,
+      "loss": 0.1241053295135498,
       "step": 325
     },
     {
       "epoch": 1.1475409836065573,
+      "grad_norm": 4.476892948150635,
       "learning_rate": 1.7130371449380918e-05,
+      "loss": 0.20724605560302733,
       "step": 350
     },
     {
       "epoch": 1.2295081967213115,
+      "grad_norm": 0.46729782223701477,
       "learning_rate": 1.6766205389657686e-05,
+      "loss": 0.13567353248596192,
       "step": 375
     },
     {
       "epoch": 1.3114754098360657,
+      "grad_norm": 0.1852118819952011,
       "learning_rate": 1.640203932993445e-05,
+      "loss": 0.13295170783996582,
       "step": 400
     },
     {
       "epoch": 1.3934426229508197,
+      "grad_norm": 1.2681413888931274,
       "learning_rate": 1.603787327021122e-05,
+      "loss": 0.2027936363220215,
       "step": 425
     },
     {
       "epoch": 1.4754098360655736,
+      "grad_norm": 7.484091281890869,
       "learning_rate": 1.5673707210487983e-05,
+      "loss": 0.12364128112792969,
       "step": 450
     },
     {
       "epoch": 1.5573770491803278,
+      "grad_norm": 0.46489500999450684,
       "learning_rate": 1.530954115076475e-05,
+      "loss": 0.14407362937927246,
       "step": 475
     },
     {
       "epoch": 1.639344262295082,
+      "grad_norm": 0.20967872440814972,
       "learning_rate": 1.4945375091041516e-05,
+      "loss": 0.12458925247192383,
       "step": 500
     },
     {
       "epoch": 1.721311475409836,
+      "grad_norm": 0.1643747240304947,
       "learning_rate": 1.4581209031318282e-05,
+      "loss": 0.21631996154785157,
       "step": 525
     },
     {
       "epoch": 1.8032786885245902,
+      "grad_norm": 7.073329448699951,
       "learning_rate": 1.4217042971595047e-05,
+      "loss": 0.16043865203857421,
       "step": 550
     },
     {
       "epoch": 1.8852459016393444,
+      "grad_norm": 1.744958758354187,
       "learning_rate": 1.3852876911871815e-05,
+      "loss": 0.0966644287109375,
       "step": 575
     },
     {
       "epoch": 1.9672131147540983,
+      "grad_norm": 12.79035472869873,
       "learning_rate": 1.3488710852148582e-05,
+      "loss": 0.15884541511535644,
       "step": 600
     },
     {
       "epoch": 2.0,
+      "eval_accuracy": 0.9611451942740287,
+      "eval_f1": 0.7432432432432432,
+      "eval_loss": 0.13287827372550964,
+      "eval_precision": 0.7971014492753623,
+      "eval_recall": 0.6962025316455697,
+      "eval_roc_auc": 0.9594697343039381,
+      "eval_runtime": 3.2739,
+      "eval_samples_per_second": 298.727,
+      "eval_steps_per_second": 9.469,
       "step": 610
     },
     {
       "epoch": 2.0491803278688523,
+      "grad_norm": 17.520444869995117,
       "learning_rate": 1.3124544792425346e-05,
+      "loss": 0.08896012306213379,
       "step": 625
     },
     {
       "epoch": 2.1311475409836067,
+      "grad_norm": 0.16623224318027496,
       "learning_rate": 1.2760378732702113e-05,
+      "loss": 0.11752216339111328,
       "step": 650
     },
     {
       "epoch": 2.2131147540983607,
+      "grad_norm": 0.20762814581394196,
       "learning_rate": 1.239621267297888e-05,
+      "loss": 0.1193038272857666,
       "step": 675
     },
     {
       "epoch": 2.2950819672131146,
+      "grad_norm": 0.1500111073255539,
       "learning_rate": 1.2032046613255645e-05,
+      "loss": 0.0630855655670166,
       "step": 700
     },
     {
       "epoch": 2.3770491803278686,
+      "grad_norm": 0.17727839946746826,
       "learning_rate": 1.1667880553532412e-05,
+      "loss": 0.08730959892272949,
       "step": 725
     },
     {
       "epoch": 2.459016393442623,
+      "grad_norm": 4.3997321128845215,
       "learning_rate": 1.1303714493809176e-05,
+      "loss": 0.12114215850830078,
       "step": 750
     },
     {
       "epoch": 2.540983606557377,
+      "grad_norm": 34.47224044799805,
       "learning_rate": 1.0939548434085944e-05,
+      "loss": 0.11070786476135254,
       "step": 775
     },
     {
       "epoch": 2.6229508196721314,
+      "grad_norm": 25.977081298828125,
       "learning_rate": 1.057538237436271e-05,
+      "loss": 0.10845686912536621,
       "step": 800
     },
     {
       "epoch": 2.7049180327868854,
+      "grad_norm": 0.1657736450433731,
       "learning_rate": 1.0211216314639475e-05,
+      "loss": 0.1025285530090332,
       "step": 825
     },
     {
       "epoch": 2.7868852459016393,
+      "grad_norm": 34.05498504638672,
       "learning_rate": 9.847050254916243e-06,
+      "loss": 0.07825160026550293,
       "step": 850
     },
     {
       "epoch": 2.8688524590163933,
+      "grad_norm": 0.2868161201477051,
       "learning_rate": 9.482884195193008e-06,
+      "loss": 0.12041816711425782,
       "step": 875
     },
     {
       "epoch": 2.9508196721311473,
+      "grad_norm": 0.19192977249622345,
       "learning_rate": 9.118718135469774e-06,
+      "loss": 0.08709416389465333,
       "step": 900
     },
     {
       "epoch": 3.0,
+      "eval_accuracy": 0.9703476482617587,
+      "eval_f1": 0.8220858895705522,
+      "eval_loss": 0.11163181066513062,
+      "eval_precision": 0.7976190476190477,
+      "eval_recall": 0.8481012658227848,
+      "eval_roc_auc": 0.9661086157615353,
+      "eval_runtime": 3.1733,
+      "eval_samples_per_second": 308.193,
+      "eval_steps_per_second": 9.769,
       "step": 915
     },
     {
       "epoch": 3.0327868852459017,
+      "grad_norm": 1.0706992149353027,
       "learning_rate": 8.754552075746541e-06,
+      "loss": 0.10751664161682128,
       "step": 925
     },
     {
       "epoch": 3.1147540983606556,
+      "grad_norm": 0.12844231724739075,
       "learning_rate": 8.390386016023307e-06,
+      "loss": 0.06818144798278808,
       "step": 950
     },
     {
       "epoch": 3.19672131147541,
+      "grad_norm": 0.07692205160856247,
       "learning_rate": 8.026219956300074e-06,
+      "loss": 0.12229555130004882,
       "step": 975
     },
     {
       "epoch": 3.278688524590164,
+      "grad_norm": 1.773990511894226,
       "learning_rate": 7.66205389657684e-06,
+      "loss": 0.06936595916748046,
       "step": 1000
     },
     {
       "epoch": 3.360655737704918,
+      "grad_norm": 0.07844381034374237,
       "learning_rate": 7.2978878368536055e-06,
+      "loss": 0.05219663143157959,
       "step": 1025
     },
     {
       "epoch": 3.442622950819672,
+      "grad_norm": 12.502548217773438,
       "learning_rate": 6.933721777130372e-06,
+      "loss": 0.06849228858947753,
       "step": 1050
     },
     {
       "epoch": 3.5245901639344264,
+      "grad_norm": 1.6993861198425293,
       "learning_rate": 6.569555717407138e-06,
+      "loss": 0.08783550262451172,
       "step": 1075
     },
     {
       "epoch": 3.6065573770491803,
+      "grad_norm": 0.06551510095596313,
       "learning_rate": 6.2053896576839045e-06,
+      "loss": 0.049420347213745115,
       "step": 1100
     },
     {
       "epoch": 3.6885245901639343,
+      "grad_norm": 0.034276798367500305,
       "learning_rate": 5.84122359796067e-06,
+      "loss": 0.05244039058685303,
       "step": 1125
     },
     {
       "epoch": 3.7704918032786887,
+      "grad_norm": 10.901683807373047,
       "learning_rate": 5.477057538237437e-06,
+      "loss": 0.06656317710876465,
       "step": 1150
     },
     {
       "epoch": 3.8524590163934427,
+      "grad_norm": 2.3856894969940186,
       "learning_rate": 5.112891478514203e-06,
+      "loss": 0.06277508735656738,
       "step": 1175
     },
     {
       "epoch": 3.9344262295081966,
+      "grad_norm": 0.018699949607253075,
       "learning_rate": 4.748725418790969e-06,
+      "loss": 0.046858911514282224,
       "step": 1200
     },
     {
       "epoch": 4.0,
+      "eval_accuracy": 0.9662576687116564,
+      "eval_f1": 0.8047337278106509,
+      "eval_loss": 0.14547723531723022,
+      "eval_precision": 0.7555555555555555,
+      "eval_recall": 0.8607594936708861,
+      "eval_roc_auc": 0.9600822291998141,
+      "eval_runtime": 3.1883,
+      "eval_samples_per_second": 306.745,
+      "eval_steps_per_second": 9.723,
       "step": 1220
     },
     {
       "epoch": 4.016393442622951,
+      "grad_norm": 2.7085537910461426,
       "learning_rate": 4.3845593590677355e-06,
+      "loss": 0.0630385398864746,
       "step": 1225
     },
     {
       "epoch": 4.098360655737705,
+      "grad_norm": 45.71488952636719,
       "learning_rate": 4.020393299344502e-06,
+      "loss": 0.03476689338684082,
       "step": 1250
     },
     {
       "epoch": 4.180327868852459,
+      "grad_norm": 0.14275555312633514,
       "learning_rate": 3.656227239621268e-06,
+      "loss": 0.04420119285583496,
       "step": 1275
     },
     {
       "epoch": 4.262295081967213,
+      "grad_norm": 0.03228295221924782,
       "learning_rate": 3.292061179898034e-06,
+      "loss": 0.0440864372253418,
       "step": 1300
     },
     {
       "epoch": 4.344262295081967,
+      "grad_norm": 0.019369477406144142,
       "learning_rate": 2.9278951201748e-06,
+      "loss": 0.035089619159698486,
       "step": 1325
     },
     {
       "epoch": 4.426229508196721,
+      "grad_norm": 0.08530243486166,
       "learning_rate": 2.5637290604515665e-06,
+      "loss": 0.04550007343292237,
       "step": 1350
     },
     {
       "epoch": 4.508196721311475,
+      "grad_norm": 0.037266574800014496,
       "learning_rate": 2.1995630007283324e-06,
+      "loss": 0.059287338256835936,
       "step": 1375
     },
     {
       "epoch": 4.590163934426229,
+      "grad_norm": 0.07883958518505096,
       "learning_rate": 1.8353969410050983e-06,
+      "loss": 0.041749396324157716,
       "step": 1400
     },
     {
       "epoch": 4.672131147540983,
+      "grad_norm": 11.786356925964355,
       "learning_rate": 1.4712308812818645e-06,
+      "loss": 0.046635646820068356,
       "step": 1425
     },
     {
       "epoch": 4.754098360655737,
+      "grad_norm": 3.003070831298828,
       "learning_rate": 1.1070648215586309e-06,
+      "loss": 0.07572799682617187,
       "step": 1450
     },
     {
       "epoch": 4.836065573770492,
+      "grad_norm": 0.07770609855651855,
       "learning_rate": 7.428987618353969e-07,
+      "loss": 0.01340787172317505,
       "step": 1475
     },
     {
       "epoch": 4.918032786885246,
+      "grad_norm": 0.022642159834504128,
       "learning_rate": 3.787327021121632e-07,
+      "loss": 0.05259611129760742,
       "step": 1500
     },
     {
       "epoch": 5.0,
+      "grad_norm": 0.03238137811422348,
       "learning_rate": 1.4566642388929353e-08,
+      "loss": 0.058229475021362304,
       "step": 1525
     },
     {
       "epoch": 5.0,
+      "eval_accuracy": 0.9703476482617587,
+      "eval_f1": 0.8220858895705522,
+      "eval_loss": 0.1447911262512207,
+      "eval_precision": 0.7976190476190477,
+      "eval_recall": 0.8481012658227848,
+      "eval_roc_auc": 0.9660241337069317,
+      "eval_runtime": 3.4435,
+      "eval_samples_per_second": 284.012,
+      "eval_steps_per_second": 9.002,
       "step": 1525
     }
   ],
         "early_stopping_threshold": 0.0
       },
       "attributes": {
+        "early_stopping_patience_counter": 2
       }
     },
     "TrainerControl": {

transformer/checkpoint-305/model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:dc535e56030653cadcb705a1e64ce0106ee66296624c8cf834067d6d2304bad5
 size 1112205008

 version https://git-lfs.github.com/spec/v1
+oid sha256:1873e7a4c6babfc7c2968fb8d3cedcd6f4aef898980615480787cebcf2a5dfd8
 size 1112205008

transformer/checkpoint-305/optimizer.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:69cab210a194b5994a26d901bb7bc3744ea997681fc2128ced977095df0f3d95
 size 2224532875

 version https://git-lfs.github.com/spec/v1
+oid sha256:949dd09537449c52151556c073e03cd92355984ac38ba4355f4be6a7c633b13e
 size 2224532875

transformer/checkpoint-305/scaler.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d4804ba23baab7f91aff02de6948cc425203dfa59580fa70cc5f769dc34b74cb
 size 1383

 version https://git-lfs.github.com/spec/v1
+oid sha256:5b7065a647ca661fe79bdd011de3c790f0dbd92072446af4dc14ee2adc84bb56
 size 1383

transformer/checkpoint-305/trainer_state.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "best_global_step": 305,
-  "best_metric": 0.7741935483870968,
   "best_model_checkpoint": "/content/agri-utilization-classifier/transformer/checkpoint-305",
   "epoch": 1.0,
   "eval_steps": 500,
@@ -11,99 +11,99 @@
   "log_history": [
     {
       "epoch": 0.08196721311475409,
-      "grad_norm": Infinity,
       "learning_rate": 3.157894736842105e-06,
-      "loss": 0.7012384033203125,
       "step": 25
     },
     {
       "epoch": 0.16393442622950818,
-      "grad_norm": 11.968428611755371,
       "learning_rate": 6.447368421052632e-06,
-      "loss": 0.4254766845703125,
       "step": 50
     },
     {
       "epoch": 0.2459016393442623,
-      "grad_norm": 19.943565368652344,
       "learning_rate": 9.736842105263159e-06,
-      "loss": 0.3554811859130859,
       "step": 75
     },
     {
       "epoch": 0.32786885245901637,
-      "grad_norm": 5.117671489715576,
       "learning_rate": 1.3026315789473684e-05,
-      "loss": 0.3145046615600586,
       "step": 100
     },
     {
       "epoch": 0.4098360655737705,
-      "grad_norm": 11.402164459228516,
       "learning_rate": 1.6315789473684213e-05,
-      "loss": 0.2773847770690918,
       "step": 125
     },
     {
       "epoch": 0.4918032786885246,
-      "grad_norm": 8.077093124389648,
       "learning_rate": 1.960526315789474e-05,
-      "loss": 0.2556156539916992,
       "step": 150
     },
     {
       "epoch": 0.5737704918032787,
-      "grad_norm": 2.0055508613586426,
       "learning_rate": 1.9679533867443555e-05,
-      "loss": 0.25026893615722656,
       "step": 175
     },
     {
       "epoch": 0.6557377049180327,
-      "grad_norm": 0.9293265342712402,
       "learning_rate": 1.9315367807720323e-05,
-      "loss": 0.24691852569580078,
       "step": 200
     },
     {
       "epoch": 0.7377049180327869,
-      "grad_norm": 1.4303064346313477,
       "learning_rate": 1.8951201747997088e-05,
-      "loss": 0.21208112716674804,
       "step": 225
     },
     {
       "epoch": 0.819672131147541,
-      "grad_norm": 4.035928249359131,
       "learning_rate": 1.8587035688273852e-05,
-      "loss": 0.17017290115356445,
       "step": 250
     },
     {
       "epoch": 0.9016393442622951,
-      "grad_norm": 4.241303443908691,
       "learning_rate": 1.822286962855062e-05,
-      "loss": 0.16115386962890624,
       "step": 275
     },
     {
       "epoch": 0.9836065573770492,
-      "grad_norm": 0.3990240693092346,
       "learning_rate": 1.7858703568827385e-05,
-      "loss": 0.18343988418579102,
       "step": 300
     },
     {
       "epoch": 1.0,
-      "eval_accuracy": 0.9642126789366053,
-      "eval_f1": 0.7741935483870968,
-      "eval_loss": 0.14546315371990204,
-      "eval_precision": 0.7894736842105263,
-      "eval_recall": 0.759493670886076,
-      "eval_roc_auc": 0.9131946888948339,
-      "eval_runtime": 3.5921,
-      "eval_samples_per_second": 272.266,
-      "eval_steps_per_second": 8.63,
       "step": 305
     }
   ],

 {
   "best_global_step": 305,
+  "best_metric": 0.7721518987341772,
   "best_model_checkpoint": "/content/agri-utilization-classifier/transformer/checkpoint-305",
   "epoch": 1.0,
   "eval_steps": 500,
   "log_history": [
     {
       "epoch": 0.08196721311475409,
+      "grad_norm": 6.055062770843506,
       "learning_rate": 3.157894736842105e-06,
+      "loss": 0.62972900390625,
       "step": 25
     },
     {
       "epoch": 0.16393442622950818,
+      "grad_norm": 10.6914701461792,
       "learning_rate": 6.447368421052632e-06,
+      "loss": 0.44850738525390627,
       "step": 50
     },
     {
       "epoch": 0.2459016393442623,
+      "grad_norm": 6.670228481292725,
       "learning_rate": 9.736842105263159e-06,
+      "loss": 0.3566379165649414,
       "step": 75
     },
     {
       "epoch": 0.32786885245901637,
+      "grad_norm": 2.589911937713623,
       "learning_rate": 1.3026315789473684e-05,
+      "loss": 0.2718839645385742,
       "step": 100
     },
     {
       "epoch": 0.4098360655737705,
+      "grad_norm": 22.02676773071289,
       "learning_rate": 1.6315789473684213e-05,
+      "loss": 0.1922766876220703,
       "step": 125
     },
     {
       "epoch": 0.4918032786885246,
+      "grad_norm": 2.6362855434417725,
       "learning_rate": 1.960526315789474e-05,
+      "loss": 0.1837622833251953,
       "step": 150
     },
     {
       "epoch": 0.5737704918032787,
+      "grad_norm": 3.478484630584717,
       "learning_rate": 1.9679533867443555e-05,
+      "loss": 0.18766048431396484,
       "step": 175
     },
     {
       "epoch": 0.6557377049180327,
+      "grad_norm": 8.077605247497559,
       "learning_rate": 1.9315367807720323e-05,
+      "loss": 0.23830581665039063,
       "step": 200
     },
     {
       "epoch": 0.7377049180327869,
+      "grad_norm": 0.7427046298980713,
       "learning_rate": 1.8951201747997088e-05,
+      "loss": 0.30742517471313474,
       "step": 225
     },
     {
       "epoch": 0.819672131147541,
+      "grad_norm": 36.34975051879883,
       "learning_rate": 1.8587035688273852e-05,
+      "loss": 0.22336017608642578,
       "step": 250
     },
     {
       "epoch": 0.9016393442622951,
+      "grad_norm": 5.215510845184326,
       "learning_rate": 1.822286962855062e-05,
+      "loss": 0.13779294967651368,
       "step": 275
     },
     {
       "epoch": 0.9836065573770492,
+      "grad_norm": 3.551121950149536,
       "learning_rate": 1.7858703568827385e-05,
+      "loss": 0.19200111389160157,
       "step": 300
     },
     {
       "epoch": 1.0,
+      "eval_accuracy": 0.9631901840490797,
+      "eval_f1": 0.7721518987341772,
+      "eval_loss": 0.1292734444141388,
+      "eval_precision": 0.7721518987341772,
+      "eval_recall": 0.7721518987341772,
+      "eval_roc_auc": 0.9563720589684741,
+      "eval_runtime": 3.3396,
+      "eval_samples_per_second": 292.853,
+      "eval_steps_per_second": 9.283,
       "step": 305
     }
   ],

transformer/checkpoint-610/model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7732d8cc10495269abb42e8c1bbdcf1de4f88f55ef74140711a8f112b4bf271e
 size 1112205008

 version https://git-lfs.github.com/spec/v1
+oid sha256:8c2e8002a8b39d6b2b729d256b3d4cff3d522204ecb453b2bd5c433f9bd4944f
 size 1112205008

transformer/checkpoint-610/optimizer.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:14f0aa6126dd6cda43941781c125a8571de85d7824445cbdb13fba8b9ae327e9
 size 2224532875

 version https://git-lfs.github.com/spec/v1
+oid sha256:36fd23804b528193a6fc5999a821ef8809fb31a6efd5c61fe763007795ad7dff
 size 2224532875

transformer/checkpoint-610/scaler.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e3c3655285f7ee33ffd38e6285ffefb860743212f30b81fe2645995678e63f71
 size 1383

 version https://git-lfs.github.com/spec/v1
+oid sha256:50d9d499a5525a1f496c3b9a272dbba833f43becb5d780497724ade85d68372c
 size 1383

transformer/checkpoint-610/trainer_state.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "best_global_step": 305,
-  "best_metric": 0.7741935483870968,
   "best_model_checkpoint": "/content/agri-utilization-classifier/transformer/checkpoint-305",
   "epoch": 2.0,
   "eval_steps": 500,
@@ -11,196 +11,196 @@
   "log_history": [
     {
       "epoch": 0.08196721311475409,
-      "grad_norm": Infinity,
       "learning_rate": 3.157894736842105e-06,
-      "loss": 0.7012384033203125,
       "step": 25
     },
     {
       "epoch": 0.16393442622950818,
-      "grad_norm": 11.968428611755371,
       "learning_rate": 6.447368421052632e-06,
-      "loss": 0.4254766845703125,
       "step": 50
     },
     {
       "epoch": 0.2459016393442623,
-      "grad_norm": 19.943565368652344,
       "learning_rate": 9.736842105263159e-06,
-      "loss": 0.3554811859130859,
       "step": 75
     },
     {
       "epoch": 0.32786885245901637,
-      "grad_norm": 5.117671489715576,
       "learning_rate": 1.3026315789473684e-05,
-      "loss": 0.3145046615600586,
       "step": 100
     },
     {
       "epoch": 0.4098360655737705,
-      "grad_norm": 11.402164459228516,
       "learning_rate": 1.6315789473684213e-05,
-      "loss": 0.2773847770690918,
       "step": 125
     },
     {
       "epoch": 0.4918032786885246,
-      "grad_norm": 8.077093124389648,
       "learning_rate": 1.960526315789474e-05,
-      "loss": 0.2556156539916992,
       "step": 150
     },
     {
       "epoch": 0.5737704918032787,
-      "grad_norm": 2.0055508613586426,
       "learning_rate": 1.9679533867443555e-05,
-      "loss": 0.25026893615722656,
       "step": 175
     },
     {
       "epoch": 0.6557377049180327,
-      "grad_norm": 0.9293265342712402,
       "learning_rate": 1.9315367807720323e-05,
-      "loss": 0.24691852569580078,
       "step": 200
     },
     {
       "epoch": 0.7377049180327869,
-      "grad_norm": 1.4303064346313477,
       "learning_rate": 1.8951201747997088e-05,
-      "loss": 0.21208112716674804,
       "step": 225
     },
     {
       "epoch": 0.819672131147541,
-      "grad_norm": 4.035928249359131,
       "learning_rate": 1.8587035688273852e-05,
-      "loss": 0.17017290115356445,
       "step": 250
     },
     {
       "epoch": 0.9016393442622951,
-      "grad_norm": 4.241303443908691,
       "learning_rate": 1.822286962855062e-05,
-      "loss": 0.16115386962890624,
       "step": 275
     },
     {
       "epoch": 0.9836065573770492,
-      "grad_norm": 0.3990240693092346,
       "learning_rate": 1.7858703568827385e-05,
-      "loss": 0.18343988418579102,
       "step": 300
     },
     {
       "epoch": 1.0,
-      "eval_accuracy": 0.9642126789366053,
-      "eval_f1": 0.7741935483870968,
-      "eval_loss": 0.14546315371990204,
-      "eval_precision": 0.7894736842105263,
-      "eval_recall": 0.759493670886076,
-      "eval_roc_auc": 0.9131946888948339,
-      "eval_runtime": 3.5921,
-      "eval_samples_per_second": 272.266,
-      "eval_steps_per_second": 8.63,
       "step": 305
     },
     {
       "epoch": 1.0655737704918034,
-      "grad_norm": 4.865096569061279,
       "learning_rate": 1.7494537509104153e-05,
-      "loss": 0.19335979461669922,
       "step": 325
     },
     {
       "epoch": 1.1475409836065573,
-      "grad_norm": 1.9780689477920532,
       "learning_rate": 1.7130371449380918e-05,
-      "loss": 0.2344082260131836,
       "step": 350
     },
     {
       "epoch": 1.2295081967213115,
-      "grad_norm": 1.3759413957595825,
       "learning_rate": 1.6766205389657686e-05,
-      "loss": 0.20309404373168946,
       "step": 375
     },
     {
       "epoch": 1.3114754098360657,
-      "grad_norm": 0.30811628699302673,
       "learning_rate": 1.640203932993445e-05,
-      "loss": 0.224365234375,
       "step": 400
     },
     {
       "epoch": 1.3934426229508197,
-      "grad_norm": 5.530014514923096,
       "learning_rate": 1.603787327021122e-05,
-      "loss": 0.14759160995483397,
       "step": 425
     },
     {
       "epoch": 1.4754098360655736,
-      "grad_norm": 3.7750189304351807,
       "learning_rate": 1.5673707210487983e-05,
-      "loss": 0.14137668609619142,
       "step": 450
     },
     {
       "epoch": 1.5573770491803278,
-      "grad_norm": 1.8209186792373657,
       "learning_rate": 1.530954115076475e-05,
-      "loss": 0.19394855499267577,
       "step": 475
     },
     {
       "epoch": 1.639344262295082,
-      "grad_norm": 0.4824683368206024,
       "learning_rate": 1.4945375091041516e-05,
-      "loss": 0.1700056266784668,
       "step": 500
     },
     {
       "epoch": 1.721311475409836,
-      "grad_norm": 0.5682937502861023,
       "learning_rate": 1.4581209031318282e-05,
-      "loss": 0.17243267059326173,
       "step": 525
     },
     {
       "epoch": 1.8032786885245902,
-      "grad_norm": 2.2086634635925293,
       "learning_rate": 1.4217042971595047e-05,
-      "loss": 0.15430424690246583,
       "step": 550
     },
     {
       "epoch": 1.8852459016393444,
-      "grad_norm": 6.93908166885376,
       "learning_rate": 1.3852876911871815e-05,
-      "loss": 0.10752416610717773,
       "step": 575
     },
     {
       "epoch": 1.9672131147540983,
-      "grad_norm": 5.092395782470703,
       "learning_rate": 1.3488710852148582e-05,
-      "loss": 0.21721889495849608,
       "step": 600
     },
     {
       "epoch": 2.0,
-      "eval_accuracy": 0.9570552147239264,
-      "eval_f1": 0.7692307692307693,
-      "eval_loss": 0.12331932783126831,
-      "eval_precision": 0.6796116504854369,
-      "eval_recall": 0.8860759493670886,
-      "eval_roc_auc": 0.9679672209628138,
-      "eval_runtime": 3.5595,
-      "eval_samples_per_second": 274.754,
-      "eval_steps_per_second": 8.709,
       "step": 610
     }
   ],

 {
   "best_global_step": 305,
+  "best_metric": 0.7721518987341772,
   "best_model_checkpoint": "/content/agri-utilization-classifier/transformer/checkpoint-305",
   "epoch": 2.0,
   "eval_steps": 500,
   "log_history": [
     {
       "epoch": 0.08196721311475409,
+      "grad_norm": 6.055062770843506,
       "learning_rate": 3.157894736842105e-06,
+      "loss": 0.62972900390625,
       "step": 25
     },
     {
       "epoch": 0.16393442622950818,
+      "grad_norm": 10.6914701461792,
       "learning_rate": 6.447368421052632e-06,
+      "loss": 0.44850738525390627,
       "step": 50
     },
     {
       "epoch": 0.2459016393442623,
+      "grad_norm": 6.670228481292725,
       "learning_rate": 9.736842105263159e-06,
+      "loss": 0.3566379165649414,
       "step": 75
     },
     {
       "epoch": 0.32786885245901637,
+      "grad_norm": 2.589911937713623,
       "learning_rate": 1.3026315789473684e-05,
+      "loss": 0.2718839645385742,
       "step": 100
     },
     {
       "epoch": 0.4098360655737705,
+      "grad_norm": 22.02676773071289,
       "learning_rate": 1.6315789473684213e-05,
+      "loss": 0.1922766876220703,
       "step": 125
     },
     {
       "epoch": 0.4918032786885246,
+      "grad_norm": 2.6362855434417725,
       "learning_rate": 1.960526315789474e-05,
+      "loss": 0.1837622833251953,
       "step": 150
     },
     {
       "epoch": 0.5737704918032787,
+      "grad_norm": 3.478484630584717,
       "learning_rate": 1.9679533867443555e-05,
+      "loss": 0.18766048431396484,
       "step": 175
     },
     {
       "epoch": 0.6557377049180327,
+      "grad_norm": 8.077605247497559,
       "learning_rate": 1.9315367807720323e-05,
+      "loss": 0.23830581665039063,
       "step": 200
     },
     {
       "epoch": 0.7377049180327869,
+      "grad_norm": 0.7427046298980713,
       "learning_rate": 1.8951201747997088e-05,
+      "loss": 0.30742517471313474,
       "step": 225
     },
     {
       "epoch": 0.819672131147541,
+      "grad_norm": 36.34975051879883,
       "learning_rate": 1.8587035688273852e-05,
+      "loss": 0.22336017608642578,
       "step": 250
     },
     {
       "epoch": 0.9016393442622951,
+      "grad_norm": 5.215510845184326,
       "learning_rate": 1.822286962855062e-05,
+      "loss": 0.13779294967651368,
       "step": 275
     },
     {
       "epoch": 0.9836065573770492,
+      "grad_norm": 3.551121950149536,
       "learning_rate": 1.7858703568827385e-05,
+      "loss": 0.19200111389160157,
       "step": 300
     },
     {
       "epoch": 1.0,
+      "eval_accuracy": 0.9631901840490797,
+      "eval_f1": 0.7721518987341772,
+      "eval_loss": 0.1292734444141388,
+      "eval_precision": 0.7721518987341772,
+      "eval_recall": 0.7721518987341772,
+      "eval_roc_auc": 0.9563720589684741,
+      "eval_runtime": 3.3396,
+      "eval_samples_per_second": 292.853,
+      "eval_steps_per_second": 9.283,
       "step": 305
     },
     {
       "epoch": 1.0655737704918034,
+      "grad_norm": 0.5402449369430542,
       "learning_rate": 1.7494537509104153e-05,
+      "loss": 0.1241053295135498,
       "step": 325
     },
     {
       "epoch": 1.1475409836065573,
+      "grad_norm": 4.476892948150635,
       "learning_rate": 1.7130371449380918e-05,
+      "loss": 0.20724605560302733,
       "step": 350
     },
     {
       "epoch": 1.2295081967213115,
+      "grad_norm": 0.46729782223701477,
       "learning_rate": 1.6766205389657686e-05,
+      "loss": 0.13567353248596192,
       "step": 375
     },
     {
       "epoch": 1.3114754098360657,
+      "grad_norm": 0.1852118819952011,
       "learning_rate": 1.640203932993445e-05,
+      "loss": 0.13295170783996582,
       "step": 400
     },
     {
       "epoch": 1.3934426229508197,
+      "grad_norm": 1.2681413888931274,
       "learning_rate": 1.603787327021122e-05,
+      "loss": 0.2027936363220215,
       "step": 425
     },
     {
       "epoch": 1.4754098360655736,
+      "grad_norm": 7.484091281890869,
       "learning_rate": 1.5673707210487983e-05,
+      "loss": 0.12364128112792969,
       "step": 450
     },
     {
       "epoch": 1.5573770491803278,
+      "grad_norm": 0.46489500999450684,
       "learning_rate": 1.530954115076475e-05,
+      "loss": 0.14407362937927246,
       "step": 475
     },
     {
       "epoch": 1.639344262295082,
+      "grad_norm": 0.20967872440814972,
       "learning_rate": 1.4945375091041516e-05,
+      "loss": 0.12458925247192383,
       "step": 500
     },
     {
       "epoch": 1.721311475409836,
+      "grad_norm": 0.1643747240304947,
       "learning_rate": 1.4581209031318282e-05,
+      "loss": 0.21631996154785157,
       "step": 525
     },
     {
       "epoch": 1.8032786885245902,
+      "grad_norm": 7.073329448699951,
       "learning_rate": 1.4217042971595047e-05,
+      "loss": 0.16043865203857421,
       "step": 550
     },
     {
       "epoch": 1.8852459016393444,
+      "grad_norm": 1.744958758354187,
       "learning_rate": 1.3852876911871815e-05,
+      "loss": 0.0966644287109375,
       "step": 575
     },
     {
       "epoch": 1.9672131147540983,
+      "grad_norm": 12.79035472869873,
       "learning_rate": 1.3488710852148582e-05,
+      "loss": 0.15884541511535644,
       "step": 600
     },
     {
       "epoch": 2.0,
+      "eval_accuracy": 0.9611451942740287,
+      "eval_f1": 0.7432432432432432,
+      "eval_loss": 0.13287827372550964,
+      "eval_precision": 0.7971014492753623,
+      "eval_recall": 0.6962025316455697,
+      "eval_roc_auc": 0.9594697343039381,
+      "eval_runtime": 3.2739,
+      "eval_samples_per_second": 298.727,
+      "eval_steps_per_second": 9.469,
       "step": 610
     }
   ],

transformer/checkpoint-915/model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:250827bf49cc2428027e7841d0a3f651c274b7917ed1fbc9c81196269698a974
 size 1112205008

 version https://git-lfs.github.com/spec/v1
+oid sha256:49a18c813f49f0f53eef5e1646a8e80f88eb366c956b09301312f1a23e9fe977
 size 1112205008

transformer/checkpoint-915/optimizer.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a2fa7d37127b9b7cd1466132b3107ea1dd648d688b50858d287773be798818f6
 size 2224532875

 version https://git-lfs.github.com/spec/v1
+oid sha256:b2d30dd11d7303659faa787633c2b391e942f0078040c8d866d78e57de1a65f7
 size 2224532875

transformer/checkpoint-915/scaler.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:22f7d99f4415d4a26286d13914edfc607115fbcba314cd2c896416d1ea8f5425
 size 1383

 version https://git-lfs.github.com/spec/v1
+oid sha256:680cfaac80453c6e2276a5eeef2888cb64cee094a7610d7db58bd53646d2351a
 size 1383

transformer/checkpoint-915/trainer_state.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "best_global_step": 915,
-  "best_metric": 0.8143712574850299,
   "best_model_checkpoint": "/content/agri-utilization-classifier/transformer/checkpoint-915",
   "epoch": 3.0,
   "eval_steps": 500,
@@ -11,293 +11,293 @@
   "log_history": [
     {
       "epoch": 0.08196721311475409,
-      "grad_norm": Infinity,
       "learning_rate": 3.157894736842105e-06,
-      "loss": 0.7012384033203125,
       "step": 25
     },
     {
       "epoch": 0.16393442622950818,
-      "grad_norm": 11.968428611755371,
       "learning_rate": 6.447368421052632e-06,
-      "loss": 0.4254766845703125,
       "step": 50
     },
     {
       "epoch": 0.2459016393442623,
-      "grad_norm": 19.943565368652344,
       "learning_rate": 9.736842105263159e-06,
-      "loss": 0.3554811859130859,
       "step": 75
     },
     {
       "epoch": 0.32786885245901637,
-      "grad_norm": 5.117671489715576,
       "learning_rate": 1.3026315789473684e-05,
-      "loss": 0.3145046615600586,
       "step": 100
     },
     {
       "epoch": 0.4098360655737705,
-      "grad_norm": 11.402164459228516,
       "learning_rate": 1.6315789473684213e-05,
-      "loss": 0.2773847770690918,
       "step": 125
     },
     {
       "epoch": 0.4918032786885246,
-      "grad_norm": 8.077093124389648,
       "learning_rate": 1.960526315789474e-05,
-      "loss": 0.2556156539916992,
       "step": 150
     },
     {
       "epoch": 0.5737704918032787,
-      "grad_norm": 2.0055508613586426,
       "learning_rate": 1.9679533867443555e-05,
-      "loss": 0.25026893615722656,
       "step": 175
     },
     {
       "epoch": 0.6557377049180327,
-      "grad_norm": 0.9293265342712402,
       "learning_rate": 1.9315367807720323e-05,
-      "loss": 0.24691852569580078,
       "step": 200
     },
     {
       "epoch": 0.7377049180327869,
-      "grad_norm": 1.4303064346313477,
       "learning_rate": 1.8951201747997088e-05,
-      "loss": 0.21208112716674804,
       "step": 225
     },
     {
       "epoch": 0.819672131147541,
-      "grad_norm": 4.035928249359131,
       "learning_rate": 1.8587035688273852e-05,
-      "loss": 0.17017290115356445,
       "step": 250
     },
     {
       "epoch": 0.9016393442622951,
-      "grad_norm": 4.241303443908691,
       "learning_rate": 1.822286962855062e-05,
-      "loss": 0.16115386962890624,
       "step": 275
     },
     {
       "epoch": 0.9836065573770492,
-      "grad_norm": 0.3990240693092346,
       "learning_rate": 1.7858703568827385e-05,
-      "loss": 0.18343988418579102,
       "step": 300
     },
     {
       "epoch": 1.0,
-      "eval_accuracy": 0.9642126789366053,
-      "eval_f1": 0.7741935483870968,
-      "eval_loss": 0.14546315371990204,
-      "eval_precision": 0.7894736842105263,
-      "eval_recall": 0.759493670886076,
-      "eval_roc_auc": 0.9131946888948339,
-      "eval_runtime": 3.5921,
-      "eval_samples_per_second": 272.266,
-      "eval_steps_per_second": 8.63,
       "step": 305
     },
     {
       "epoch": 1.0655737704918034,
-      "grad_norm": 4.865096569061279,
       "learning_rate": 1.7494537509104153e-05,
-      "loss": 0.19335979461669922,
       "step": 325
     },
     {
       "epoch": 1.1475409836065573,
-      "grad_norm": 1.9780689477920532,
       "learning_rate": 1.7130371449380918e-05,
-      "loss": 0.2344082260131836,
       "step": 350
     },
     {
       "epoch": 1.2295081967213115,
-      "grad_norm": 1.3759413957595825,
       "learning_rate": 1.6766205389657686e-05,
-      "loss": 0.20309404373168946,
       "step": 375
     },
     {
       "epoch": 1.3114754098360657,
-      "grad_norm": 0.30811628699302673,
       "learning_rate": 1.640203932993445e-05,
-      "loss": 0.224365234375,
       "step": 400
     },
     {
       "epoch": 1.3934426229508197,
-      "grad_norm": 5.530014514923096,
       "learning_rate": 1.603787327021122e-05,
-      "loss": 0.14759160995483397,
       "step": 425
     },
     {
       "epoch": 1.4754098360655736,
-      "grad_norm": 3.7750189304351807,
       "learning_rate": 1.5673707210487983e-05,
-      "loss": 0.14137668609619142,
       "step": 450
     },
     {
       "epoch": 1.5573770491803278,
-      "grad_norm": 1.8209186792373657,
       "learning_rate": 1.530954115076475e-05,
-      "loss": 0.19394855499267577,
       "step": 475
     },
     {
       "epoch": 1.639344262295082,
-      "grad_norm": 0.4824683368206024,
       "learning_rate": 1.4945375091041516e-05,
-      "loss": 0.1700056266784668,
       "step": 500
     },
     {
       "epoch": 1.721311475409836,
-      "grad_norm": 0.5682937502861023,
       "learning_rate": 1.4581209031318282e-05,
-      "loss": 0.17243267059326173,
       "step": 525
     },
     {
       "epoch": 1.8032786885245902,
-      "grad_norm": 2.2086634635925293,
       "learning_rate": 1.4217042971595047e-05,
-      "loss": 0.15430424690246583,
       "step": 550
     },
     {
       "epoch": 1.8852459016393444,
-      "grad_norm": 6.93908166885376,
       "learning_rate": 1.3852876911871815e-05,
-      "loss": 0.10752416610717773,
       "step": 575
     },
     {
       "epoch": 1.9672131147540983,
-      "grad_norm": 5.092395782470703,
       "learning_rate": 1.3488710852148582e-05,
-      "loss": 0.21721889495849608,
       "step": 600
     },
     {
       "epoch": 2.0,
-      "eval_accuracy": 0.9570552147239264,
-      "eval_f1": 0.7692307692307693,
-      "eval_loss": 0.12331932783126831,
-      "eval_precision": 0.6796116504854369,
-      "eval_recall": 0.8860759493670886,
-      "eval_roc_auc": 0.9679672209628138,
-      "eval_runtime": 3.5595,
-      "eval_samples_per_second": 274.754,
-      "eval_steps_per_second": 8.709,
       "step": 610
     },
     {
       "epoch": 2.0491803278688523,
-      "grad_norm": 12.040640830993652,
       "learning_rate": 1.3124544792425346e-05,
-      "loss": 0.11267939567565918,
       "step": 625
     },
     {
       "epoch": 2.1311475409836067,
-      "grad_norm": 0.33291733264923096,
       "learning_rate": 1.2760378732702113e-05,
-      "loss": 0.16029356002807618,
       "step": 650
     },
     {
       "epoch": 2.2131147540983607,
-      "grad_norm": 0.1562187671661377,
       "learning_rate": 1.239621267297888e-05,
-      "loss": 0.13354766845703125,
       "step": 675
     },
     {
       "epoch": 2.2950819672131146,
-      "grad_norm": 0.39492854475975037,
       "learning_rate": 1.2032046613255645e-05,
-      "loss": 0.08748809814453125,
       "step": 700
     },
     {
       "epoch": 2.3770491803278686,
-      "grad_norm": 0.22857463359832764,
       "learning_rate": 1.1667880553532412e-05,
-      "loss": 0.1255797290802002,
       "step": 725
     },
     {
       "epoch": 2.459016393442623,
-      "grad_norm": 42.23853302001953,
       "learning_rate": 1.1303714493809176e-05,
-      "loss": 0.09398910522460938,
       "step": 750
     },
     {
       "epoch": 2.540983606557377,
-      "grad_norm": 9.628519058227539,
       "learning_rate": 1.0939548434085944e-05,
-      "loss": 0.12067486763000489,
       "step": 775
     },
     {
       "epoch": 2.6229508196721314,
-      "grad_norm": 8.281865119934082,
       "learning_rate": 1.057538237436271e-05,
-      "loss": 0.0960771656036377,
       "step": 800
     },
     {
       "epoch": 2.7049180327868854,
-      "grad_norm": 0.2366073578596115,
       "learning_rate": 1.0211216314639475e-05,
-      "loss": 0.1477354335784912,
       "step": 825
     },
     {
       "epoch": 2.7868852459016393,
-      "grad_norm": 2.127614974975586,
       "learning_rate": 9.847050254916243e-06,
-      "loss": 0.12143749237060547,
       "step": 850
     },
     {
       "epoch": 2.8688524590163933,
-      "grad_norm": 0.1283058375120163,
       "learning_rate": 9.482884195193008e-06,
-      "loss": 0.0978905963897705,
       "step": 875
     },
     {
       "epoch": 2.9508196721311473,
-      "grad_norm": 0.16377978026866913,
       "learning_rate": 9.118718135469774e-06,
-      "loss": 0.13501665115356445,
       "step": 900
     },
     {
       "epoch": 3.0,
-      "eval_accuracy": 0.9683026584867076,
-      "eval_f1": 0.8143712574850299,
-      "eval_loss": 0.11399859189987183,
-      "eval_precision": 0.7727272727272727,
-      "eval_recall": 0.8607594936708861,
-      "eval_roc_auc": 0.9685867560299067,
-      "eval_runtime": 3.583,
-      "eval_samples_per_second": 272.954,
-      "eval_steps_per_second": 8.652,
       "step": 915
     }
   ],

 {
   "best_global_step": 915,
+  "best_metric": 0.8220858895705522,
   "best_model_checkpoint": "/content/agri-utilization-classifier/transformer/checkpoint-915",
   "epoch": 3.0,
   "eval_steps": 500,
   "log_history": [
     {
       "epoch": 0.08196721311475409,
+      "grad_norm": 6.055062770843506,
       "learning_rate": 3.157894736842105e-06,
+      "loss": 0.62972900390625,
       "step": 25
     },
     {
       "epoch": 0.16393442622950818,
+      "grad_norm": 10.6914701461792,
       "learning_rate": 6.447368421052632e-06,
+      "loss": 0.44850738525390627,
       "step": 50
     },
     {
       "epoch": 0.2459016393442623,
+      "grad_norm": 6.670228481292725,
       "learning_rate": 9.736842105263159e-06,
+      "loss": 0.3566379165649414,
       "step": 75
     },
     {
       "epoch": 0.32786885245901637,
+      "grad_norm": 2.589911937713623,
       "learning_rate": 1.3026315789473684e-05,
+      "loss": 0.2718839645385742,
       "step": 100
     },
     {
       "epoch": 0.4098360655737705,
+      "grad_norm": 22.02676773071289,
       "learning_rate": 1.6315789473684213e-05,
+      "loss": 0.1922766876220703,
       "step": 125
     },
     {
       "epoch": 0.4918032786885246,
+      "grad_norm": 2.6362855434417725,
       "learning_rate": 1.960526315789474e-05,
+      "loss": 0.1837622833251953,
       "step": 150
     },
     {
       "epoch": 0.5737704918032787,
+      "grad_norm": 3.478484630584717,
       "learning_rate": 1.9679533867443555e-05,
+      "loss": 0.18766048431396484,
       "step": 175
     },
     {
       "epoch": 0.6557377049180327,
+      "grad_norm": 8.077605247497559,
       "learning_rate": 1.9315367807720323e-05,
+      "loss": 0.23830581665039063,
       "step": 200
     },
     {
       "epoch": 0.7377049180327869,
+      "grad_norm": 0.7427046298980713,
       "learning_rate": 1.8951201747997088e-05,
+      "loss": 0.30742517471313474,
       "step": 225
     },
     {
       "epoch": 0.819672131147541,
+      "grad_norm": 36.34975051879883,
       "learning_rate": 1.8587035688273852e-05,
+      "loss": 0.22336017608642578,
       "step": 250
     },
     {
       "epoch": 0.9016393442622951,
+      "grad_norm": 5.215510845184326,
       "learning_rate": 1.822286962855062e-05,
+      "loss": 0.13779294967651368,
       "step": 275
     },
     {
       "epoch": 0.9836065573770492,
+      "grad_norm": 3.551121950149536,
       "learning_rate": 1.7858703568827385e-05,
+      "loss": 0.19200111389160157,
       "step": 300
     },
     {
       "epoch": 1.0,
+      "eval_accuracy": 0.9631901840490797,
+      "eval_f1": 0.7721518987341772,
+      "eval_loss": 0.1292734444141388,
+      "eval_precision": 0.7721518987341772,
+      "eval_recall": 0.7721518987341772,
+      "eval_roc_auc": 0.9563720589684741,
+      "eval_runtime": 3.3396,
+      "eval_samples_per_second": 292.853,
+      "eval_steps_per_second": 9.283,
       "step": 305
     },
     {
       "epoch": 1.0655737704918034,
+      "grad_norm": 0.5402449369430542,
       "learning_rate": 1.7494537509104153e-05,
+      "loss": 0.1241053295135498,
       "step": 325
     },
     {
       "epoch": 1.1475409836065573,
+      "grad_norm": 4.476892948150635,
       "learning_rate": 1.7130371449380918e-05,
+      "loss": 0.20724605560302733,
       "step": 350
     },
     {
       "epoch": 1.2295081967213115,
+      "grad_norm": 0.46729782223701477,
       "learning_rate": 1.6766205389657686e-05,
+      "loss": 0.13567353248596192,
       "step": 375
     },
     {
       "epoch": 1.3114754098360657,
+      "grad_norm": 0.1852118819952011,
       "learning_rate": 1.640203932993445e-05,
+      "loss": 0.13295170783996582,
       "step": 400
     },
     {
       "epoch": 1.3934426229508197,
+      "grad_norm": 1.2681413888931274,
       "learning_rate": 1.603787327021122e-05,
+      "loss": 0.2027936363220215,
       "step": 425
     },
     {
       "epoch": 1.4754098360655736,
+      "grad_norm": 7.484091281890869,
       "learning_rate": 1.5673707210487983e-05,
+      "loss": 0.12364128112792969,
       "step": 450
     },
     {
       "epoch": 1.5573770491803278,
+      "grad_norm": 0.46489500999450684,
       "learning_rate": 1.530954115076475e-05,
+      "loss": 0.14407362937927246,
       "step": 475
     },
     {
       "epoch": 1.639344262295082,
+      "grad_norm": 0.20967872440814972,
       "learning_rate": 1.4945375091041516e-05,
+      "loss": 0.12458925247192383,
       "step": 500
     },
     {
       "epoch": 1.721311475409836,
+      "grad_norm": 0.1643747240304947,
       "learning_rate": 1.4581209031318282e-05,
+      "loss": 0.21631996154785157,
       "step": 525
     },
     {
       "epoch": 1.8032786885245902,
+      "grad_norm": 7.073329448699951,
       "learning_rate": 1.4217042971595047e-05,
+      "loss": 0.16043865203857421,
       "step": 550
     },
     {
       "epoch": 1.8852459016393444,
+      "grad_norm": 1.744958758354187,
       "learning_rate": 1.3852876911871815e-05,
+      "loss": 0.0966644287109375,
       "step": 575
     },
     {
       "epoch": 1.9672131147540983,
+      "grad_norm": 12.79035472869873,
       "learning_rate": 1.3488710852148582e-05,
+      "loss": 0.15884541511535644,
       "step": 600
     },
     {
       "epoch": 2.0,
+      "eval_accuracy": 0.9611451942740287,
+      "eval_f1": 0.7432432432432432,
+      "eval_loss": 0.13287827372550964,
+      "eval_precision": 0.7971014492753623,
+      "eval_recall": 0.6962025316455697,
+      "eval_roc_auc": 0.9594697343039381,
+      "eval_runtime": 3.2739,
+      "eval_samples_per_second": 298.727,
+      "eval_steps_per_second": 9.469,
       "step": 610
     },
     {
       "epoch": 2.0491803278688523,
+      "grad_norm": 17.520444869995117,
       "learning_rate": 1.3124544792425346e-05,
+      "loss": 0.08896012306213379,
       "step": 625
     },
     {
       "epoch": 2.1311475409836067,
+      "grad_norm": 0.16623224318027496,
       "learning_rate": 1.2760378732702113e-05,
+      "loss": 0.11752216339111328,
       "step": 650
     },
     {
       "epoch": 2.2131147540983607,
+      "grad_norm": 0.20762814581394196,
       "learning_rate": 1.239621267297888e-05,
+      "loss": 0.1193038272857666,
       "step": 675
     },
     {
       "epoch": 2.2950819672131146,
+      "grad_norm": 0.1500111073255539,
       "learning_rate": 1.2032046613255645e-05,
+      "loss": 0.0630855655670166,
       "step": 700
     },
     {
       "epoch": 2.3770491803278686,
+      "grad_norm": 0.17727839946746826,
       "learning_rate": 1.1667880553532412e-05,
+      "loss": 0.08730959892272949,
       "step": 725
     },
     {
       "epoch": 2.459016393442623,
+      "grad_norm": 4.3997321128845215,
       "learning_rate": 1.1303714493809176e-05,
+      "loss": 0.12114215850830078,
       "step": 750
     },
     {
       "epoch": 2.540983606557377,
+      "grad_norm": 34.47224044799805,
       "learning_rate": 1.0939548434085944e-05,
+      "loss": 0.11070786476135254,
       "step": 775
     },
     {
       "epoch": 2.6229508196721314,
+      "grad_norm": 25.977081298828125,
       "learning_rate": 1.057538237436271e-05,
+      "loss": 0.10845686912536621,
       "step": 800
     },
     {
       "epoch": 2.7049180327868854,
+      "grad_norm": 0.1657736450433731,
       "learning_rate": 1.0211216314639475e-05,
+      "loss": 0.1025285530090332,
       "step": 825
     },
     {
       "epoch": 2.7868852459016393,
+      "grad_norm": 34.05498504638672,
       "learning_rate": 9.847050254916243e-06,
+      "loss": 0.07825160026550293,
       "step": 850
     },
     {
       "epoch": 2.8688524590163933,
+      "grad_norm": 0.2868161201477051,
       "learning_rate": 9.482884195193008e-06,
+      "loss": 0.12041816711425782,
       "step": 875
     },
     {
       "epoch": 2.9508196721311473,
+      "grad_norm": 0.19192977249622345,
       "learning_rate": 9.118718135469774e-06,
+      "loss": 0.08709416389465333,
       "step": 900
     },
     {
       "epoch": 3.0,
+      "eval_accuracy": 0.9703476482617587,
+      "eval_f1": 0.8220858895705522,
+      "eval_loss": 0.11163181066513062,
+      "eval_precision": 0.7976190476190477,
+      "eval_recall": 0.8481012658227848,
+      "eval_roc_auc": 0.9661086157615353,
+      "eval_runtime": 3.1733,
+      "eval_samples_per_second": 308.193,
+      "eval_steps_per_second": 9.769,
       "step": 915
     }
   ],

transformer/config.json CHANGED Viewed

@@ -31,16 +31,16 @@
   "pad_token_id": 1,
   "position_embedding_type": "absolute",
   "problem_type": "single_label_classification",
-  "threshold": 0.4999122619628906,
   "tie_word_embeddings": true,
   "transformers_version": "5.9.0",
   "type_vocab_size": 1,
   "use_cache": false,
   "validation_threshold_report": {
-    "f1": 0.8484848484848485,
-    "precision": 0.813953488372093,
-    "recall": 0.8860759493670886,
-    "threshold": 0.4999122619628906
   },
   "vocab_size": 250002
 }

   "pad_token_id": 1,
   "position_embedding_type": "absolute",
   "problem_type": "single_label_classification",
+  "threshold": 0.4710787534713745,
   "tie_word_embeddings": true,
   "transformers_version": "5.9.0",
   "type_vocab_size": 1,
   "use_cache": false,
   "validation_threshold_report": {
+    "f1": 0.829268292682927,
+    "precision": 0.8,
+    "recall": 0.8607594936708861,
+    "threshold": 0.4710787534713745
   },
   "vocab_size": 250002
 }

transformer/model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:572ab7a5b2bc6140bc72ff08cc111f90496ceab40c64330fc6373973a0b6830c
 size 1112205008

 version https://git-lfs.github.com/spec/v1
+oid sha256:49a18c813f49f0f53eef5e1646a8e80f88eb366c956b09301312f1a23e9fe977
 size 1112205008

transformer/test_predictions.csv CHANGED Viewed

The diff for this file is too large to render. See raw diff

transformer/validation_predictions.csv CHANGED Viewed

The diff for this file is too large to render. See raw diff