Update README.md
Browse files
README.md
CHANGED
|
@@ -117,7 +117,7 @@ This model uses **LoRA (Low-Rank Adaptation)** for parameter-efficient fine-tuni
|
|
| 117 |
|
| 118 |
### Data Curation
|
| 119 |
|
| 120 |
-
Energy-labeled documents were sourced from four HuggingFace datasets (see above). Non-energy documents were sampled from a base document pipeline, with deduplication to ensure no overlap with energy documents (validated by both document ID and MD5 hash matching).
|
| 121 |
|
| 122 |
## 🎯 Use Cases
|
| 123 |
|
|
|
|
| 117 |
|
| 118 |
### Data Curation
|
| 119 |
|
| 120 |
+
Energy-labeled documents were sourced from four HuggingFace datasets (see above). Classification labels for the training data were created with Mistral 3 Large model and this classifier was distilled from this data. Non-energy documents were sampled from a base document pipeline, with deduplication to ensure no overlap with energy documents (validated by both document ID and MD5 hash matching).
|
| 121 |
|
| 122 |
## 🎯 Use Cases
|
| 123 |
|