File size: 575 Bytes
0c94d57
 
 
 
 
 
 
 
c13c136
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
---
license: apache-2.0
datasets:
- fka/awesome-chatgpt-prompts
language:
- ta
base_model:
- google/mt5-small
---
Loads a dataset, presumably for Tamil language processing (prompt generation or translation-related).
Fine-tunes a pre-trained multilingual transformer model (MT5) on this dataset.
Preprocesses the data by combining "Act" and "Prompt" fields into the model input.
Trains the model using a standard training loop with a validation set.
Saves the fine-tuned model.
Includes a utility function to convert CSV data to JSON format for use with the training pipeline.