fka/prompts.chat
Viewer • Updated • 1.82k • 54.9k • 9.69k
Loads a dataset, presumably for Tamil language processing (prompt generation or translation-related). Fine-tunes a pre-trained multilingual transformer model (MT5) on this dataset. Preprocesses the data by combining "Act" and "Prompt" fields into the model input. Trains the model using a standard training loop with a validation set. Saves the fine-tuned model. Includes a utility function to convert CSV data to JSON format for use with the training pipeline.
Base model
google/mt5-small