kdubovikov commited on
Commit
2f62684
·
verified ·
1 Parent(s): a6256cd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -117,7 +117,7 @@ This model uses **LoRA (Low-Rank Adaptation)** for parameter-efficient fine-tuni
117
 
118
  ### Data Curation
119
 
120
- Energy-labeled documents were sourced from four HuggingFace datasets (see above). Non-energy documents were sampled from a base document pipeline, with deduplication to ensure no overlap with energy documents (validated by both document ID and MD5 hash matching).
121
 
122
  ## 🎯 Use Cases
123
 
 
117
 
118
  ### Data Curation
119
 
120
+ Energy-labeled documents were sourced from four HuggingFace datasets (see above). Classification labels for the training data were created with Mistral 3 Large model and this classifier was distilled from this data. Non-energy documents were sampled from a base document pipeline, with deduplication to ensure no overlap with energy documents (validated by both document ID and MD5 hash matching).
121
 
122
  ## 🎯 Use Cases
123