LaProfeClaudis
/

LGBeTO_detection_Model

@@ -3,6 +3,8 @@ library_name: transformers
 base_model: dccuchile/bert-base-spanish-wwm-uncased
 tags:
 - generated_from_trainer
 metrics:
 - accuracy
 - f1
@@ -11,35 +13,58 @@ metrics:
 model-index:
 - name: LGBeTO_detection_Model
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
 # LGBeTO_detection_Model
-This model is a fine-tuned version of [dccuchile/bert-base-spanish-wwm-uncased](https://huggingface.co/dccuchile/bert-base-spanish-wwm-uncased) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5393
 - Accuracy: 0.835
 - F1: 0.8533
 - Precision: 0.8205
 - Recall: 0.8889
 ## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:
@@ -57,7 +82,7 @@ The following hyperparameters were used during training:
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
 | 0.4655        | 1.0   | 50   | 0.5517          | 0.755    | 0.7538 | 0.8242    | 0.6944 |
 | 0.1928        | 2.0   | 100  | 0.4830          | 0.825    | 0.8523 | 0.7829    | 0.9352 |
-| 0.0718        | 3.0   | 150  | 0.5393          | 0.835    | 0.8533 | 0.8205    | 0.8889 |
 ### Framework versions
@@ -65,4 +90,4 @@ The following hyperparameters were used during training:
 - Transformers 4.51.3
 - Pytorch 2.6.0+cu124
 - Datasets 3.6.0
-- Tokenizers 0.21.1

 base_model: dccuchile/bert-base-spanish-wwm-uncased
 tags:
 - generated_from_trainer
+- hatetoLGBTcomunities
+- BETO
 metrics:
 - accuracy
 - f1
 model-index:
 - name: LGBeTO_detection_Model
   results: []
+license: cc-by-4.0
+language:
+- es
+pipeline_tag: text-classification
 ---
 # LGBeTO_detection_Model
+This model is LGBeTO model. Corresponding to a fine-tuned version of [dccuchile/bert-base-spanish-wwm-uncased](https://huggingface.co/dccuchile/bert-base-spanish-wwm-uncased) (Cañete et al., 2023).
 It achieves the following results on the evaluation set:
 - Accuracy: 0.835
 - F1: 0.8533
 - Precision: 0.8205
 - Recall: 0.8889
 ## Model description
+LGBeTO was designed to detect discriminatory or hateful language directed toward the LGBTQIA+ community, aiming to support safer and more inclusive online environments.
 ## Intended uses & limitations
+This model was created for a study that was conducted strictly for academic and research purposes. The target of hate speech has been anonymised, and there is no intent to harm the perpetrators
+in any way. We prioritize protecting the privacy and confidentiality of vulnerable individuals.
+We carefully remove identifying data, such as user IDs, phone numbers, and addresses, to safeguard privacy before
+sharing the data with our annotators. All data collected comes from public sources.
+As authors, we affirm our deep respect for all individuals and explicitly state that we have no intention of prejudicing,
+biasing, or disrespecting the LGBTQIA+ community or any group. Our work seeks to contribute constructively to inclusive
+and ethical research in artificial intelligence.
 ## Training and evaluation data
+LGBeTO was fine-tuned using comments collected from digital media, such as Twitter, Instagram, websites, and YouTube comments
+The dataset is available in the Zenodo Repository.
+Cite as:
+Martínez-Araneda, C., Maldonado Montiel, D., Gutiérrez Valenzuela, M., Gómez Meneses, P., Segura Navarrete, A.,
+& Vidal-Castro, C. (2025). LGBTQIAphobia dataset (augmented and balanced) [Data set]. Zenodo.
+https://doi.org/10.5281/zenodo.15385622
 ## Training procedure
+- step 1: Load the dataSet
+- step 2: Tokenization and model generation
+- step 3: Split train-validation
+- step 4: Training configuration
+- step 5: Training/Evaluation
 ### Training hyperparameters
 The following hyperparameters were used during training:
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
 | 0.4655        | 1.0   | 50   | 0.5517          | 0.755    | 0.7538 | 0.8242    | 0.6944 |
 | 0.1928        | 2.0   | 100  | 0.4830          | 0.825    | 0.8523 | 0.7829    | 0.9352 |
+##| 0.0718        | 3.0  | 150  | 0.5393          | 0.835    | 0.8533 | 0.8205    | 0.8889 |
 ### Framework versions
 - Transformers 4.51.3
 - Pytorch 2.6.0+cu124
 - Datasets 3.6.0
+- Tokenizers 0.21.1