Update README.md
Browse filesLoads a dataset, presumably for Tamil language processing (prompt generation or translation-related).
Fine-tunes a pre-trained multilingual transformer model (MT5) on this dataset.
Preprocesses the data by combining "Act" and "Prompt" fields into the model input.
Trains the model using a standard training loop with a validation set.
Saves the fine-tuned model.
Includes a utility function to convert CSV data to JSON format for use with the training pipeline.
README.md
CHANGED
|
@@ -6,4 +6,10 @@ language:
|
|
| 6 |
- ta
|
| 7 |
base_model:
|
| 8 |
- google/mt5-small
|
| 9 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 6 |
- ta
|
| 7 |
base_model:
|
| 8 |
- google/mt5-small
|
| 9 |
+
---
|
| 10 |
+
Loads a dataset, presumably for Tamil language processing (prompt generation or translation-related).
|
| 11 |
+
Fine-tunes a pre-trained multilingual transformer model (MT5) on this dataset.
|
| 12 |
+
Preprocesses the data by combining "Act" and "Prompt" fields into the model input.
|
| 13 |
+
Trains the model using a standard training loop with a validation set.
|
| 14 |
+
Saves the fine-tuned model.
|
| 15 |
+
Includes a utility function to convert CSV data to JSON format for use with the training pipeline.
|