dpmendez commited on
Commit
156ed87
·
verified ·
1 Parent(s): 2ea677f

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +38 -0
README.md ADDED
@@ -0,0 +1,38 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - dpmendez/environmental-misinformation
4
+ language:
5
+ - en
6
+ base_model:
7
+ - distilbert/distilbert-base-uncased
8
+ ---
9
+ This model is a **DistilBERT-based transformer** fine-tuned for climate misinformation classification.
10
+ It predicts the veracity of individual climate-related claims using contextualized language representations.
11
+
12
+ The model was trained on a dataset combining:
13
+ * Climate Fever
14
+ * Science Feedback fact-checked claims
15
+
16
+ ## Model Details
17
+ * Model type: DistilBERT (distilbert-base-uncased)
18
+ * Task: Sequence classification
19
+ * Input: Single climate-related claim (text)
20
+ * Output: Claim label probabilities
21
+ * Framework: Hugging Face Transformers
22
+ * Model weights: Stored in model.safetensors
23
+
24
+ ## Labels
25
+ | Label | Description |
26
+ | ----------------- | --------------------------------------------- |
27
+ | `LIKELY_TRUE` | Claim is consistent with scientific consensus |
28
+ | `LIKELY_FALSE` | Claim contradicts scientific consensus |
29
+
30
+ Label mappings are defined in config.json and label_map.json.
31
+
32
+ ## Training Procedure
33
+ * Fine-tuned from distilbert-base-uncased
34
+ * Cross-entropy loss
35
+ * Class imbalance handled via training strategy (no oversampling)
36
+ * Inference threshold tuned post-training to decrease cost function (less false positives is better)
37
+
38
+ The selected inference threshold is stored in threshold.json.