PEAllm / README.md
jackyanghxc's picture
Update README.md
e404780 verified

A newer version of the Gradio SDK is available: 6.4.0

Upgrade
metadata
title: Thai Energy AI Ambassador
emoji: 
colorFrom: purple
colorTo: pink
sdk: gradio
python_version: '3.10'
app_file: app.py
license: mit
short_description: This is an AI ambassador for the Thai energy sector.
pinned: false
sdk_version: 5.47.2

Thai Energy AI Ambassador

This project demonstrates a secure, PDPA-compliant AI avatar that acts as a brand ambassador for the Thai energy sector. It utilizes a custom-trained Thai language model that is exclusively knowledgeable about documents from the MEA, PEA, EGAT, and other Thai energy authorities.

Project Phases

  • Phase 1: Data Pipeline: Automated web scraping of official websites to create a private and secure dataset.
  • Phase 2: LLM Training: Fine-tuning a small, efficient Thai language model on the collected data.
  • Phase 3: AI Avatar Development: Building a desktop application with a visual avatar and text-to-speech (TTS) capabilities.

How it Works

  1. Data Collection: A Zapier workflow scrapes public-facing documents and news from official Thai energy websites.
  2. Dataset Transformation: The raw data is automatically formatted into a structured dataset for model training.
  3. Model Training: The custom dataset is used to fine-tune a small language model on a Hugging Face Space.
  4. Secure Deployment: The trained model is served privately from the Hugging Face Space to a local Windows application, ensuring no data leaves the controlled environment.

Getting Started

To get started with this project, ensure you have the required files in your repository:

  • README.md (this file)
  • requirements.txt
  • app.py