Spaces:

haneph033
/

ersi

Sleeping

App Files Files Community

haneph033 commited on Oct 5, 2025

Commit

2c4cdea

1 Parent(s): efc0d7e

update app first

Browse files

Files changed (4) hide show

.gitignore +140 -0
README.md +114 -5
app.py +303 -0
requirements.txt +10 -0

.gitignore ADDED Viewed

	@@ -0,0 +1,140 @@

+# Python
+__pycache__/
+*.py[cod]
+*$py.class
+*.so
+.Python
+build/
+develop-eggs/
+dist/
+downloads/
+eggs/
+.eggs/
+lib/
+lib64/
+parts/
+sdist/
+var/
+wheels/
+*.egg-info/
+.installed.cfg
+*.egg
+MANIFEST
+# PyInstaller
+*.manifest
+*.spec
+# Installer logs
+pip-log.txt
+pip-delete-this-directory.txt
+# Unit test / coverage reports
+htmlcov/
+.tox/
+.coverage
+.coverage.*
+.cache
+nosetests.xml
+coverage.xml
+*.cover
+.hypothesis/
+.pytest_cache/
+# Translations
+*.mo
+*.pot
+# Django stuff:
+*.log
+local_settings.py
+db.sqlite3
+# Flask stuff:
+instance/
+.webassets-cache
+# Scrapy stuff:
+.scrapy
+# Sphinx documentation
+docs/_build/
+# PyBuilder
+target/
+# Jupyter Notebook
+.ipynb_checkpoints
+# pyenv
+.python-version
+# celery beat schedule file
+celerybeat-schedule
+# SageMath parsed files
+*.sage.py
+# Environments
+.env
+.venv
+env/
+venv/
+ENV/
+env.bak/
+venv.bak/
+# Spyder project settings
+.spyderproject
+.spyproject
+# Rope project settings
+.ropeproject
+# mkdocs documentation
+/site
+# mypy
+.mypy_cache/
+.dmypy.json
+dmypy.json
+# Temporary files
+*.tmp
+*.temp
+temp/
+tmp/
+# Audio files
+*.mp3
+*.wav
+*.ogg
+*.m4a
+# Model files (if too large)
+*.bin
+*.safetensors
+# Hugging Face cache
+.cache/
+huggingface/
+# OS generated files
+.DS_Store
+.DS_Store?
+._*
+.Spotlight-V100
+.Trashes
+ehthumbs.db
+Thumbs.db
+# IDE files
+.vscode/
+.idea/
+*.swp
+*.swo
+*~
+# Logs
+logs/
+*.log

README.md CHANGED Viewed

@@ -1,12 +1,121 @@
 ---
-title: Ersi
-emoji: 🌖
-colorFrom: yellow
 colorTo: purple
 sdk: gradio
-sdk_version: 5.49.0
 app_file: app.py
 pinned: false
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 ---
+title: Health Article Generator
+emoji: 🏥
+colorFrom: blue
 colorTo: purple
 sdk: gradio
+sdk_version: 5.43.1
 app_file: app.py
 pinned: false
 ---
+# Health Article Generator
+Aplikasi AI untuk generate artikel kesehatan menggunakan model Meta Llama 3.1 8B Instruct dengan fitur text-to-speech dan download audio MP3.
+## 🚀 Features
+- **AI Text Generation**: Menggunakan Meta Llama 3.1 8B Instruct untuk generate artikel kesehatan berkualitas tinggi
+- **Topik Kesehatan**: 20+ topik kesehatan yang dapat dipilih
+- **Customizable Length**: Pilihan panjang artikel (Pendek, Sedang, Panjang)
+- **Subtopik**: Hingga 5 subtopik opsional untuk fokus artikel
+- **Text-to-Speech**: Konversi artikel ke audio dengan Google TTS
+- **Download Audio**: Download hasil audio dalam format MP3
+- **Modern UI**: Interface yang user-friendly dengan Gradio 5.43.1
+## 📋 Requirements
+- Python 3.8+
+- CUDA (opsional, untuk GPU acceleration)
+- 8GB+ RAM (untuk model Llama 3.1 8B)
+## 🛠️ Installation
+1. Clone repository ini:
+```bash
+git clone <repository-url>
+cd ersi
+```
+2. Install dependencies:
+```bash
+pip install -r requirements.txt
+```
+3. Run aplikasi:
+```bash
+python app.py
+```
+## 🚀 Deployment ke Hugging Face Spaces
+1. Buat akun di [Hugging Face](https://huggingface.co)
+2. Buat Space baru dengan tipe "Gradio"
+3. Upload semua file ke repository Space
+4. Set environment variables jika diperlukan
+5. Space akan otomatis deploy
+### File yang diperlukan untuk deployment:
+- `app.py` - Aplikasi utama
+- `requirements.txt` - Dependencies
+- `README.md` - Dokumentasi
+- `.gitignore` - Git ignore file
+## 📖 Cara Penggunaan
+1. **Pilih Topik**: Pilih topik kesehatan dari dropdown
+2. **Set Panjang**: Pilih panjang artikel yang diinginkan
+3. **Tambah Subtopik** (Opsional): Masukkan hingga 5 subtopik untuk fokus artikel
+4. **Generate**: Klik tombol "Generate Article"
+5. **Convert to Speech**: Klik "Convert to Speech" untuk generate audio
+6. **Download**: Download file MP3 yang dihasilkan
+## 🎯 Topik Kesehatan yang Tersedia
+- Nutrisi dan Diet Sehat
+- Olahraga dan Kebugaran
+- Kesehatan Mental
+- Penyakit Jantung
+- Diabetes dan Gula Darah
+- Kesehatan Pencernaan
+- Kesehatan Kulit
+- Kesehatan Mata
+- Kesehatan Gigi dan Mulut
+- Kesehatan Reproduksi
+- Kesehatan Anak
+- Kesehatan Lansia
+- Pencegahan Kanker
+- Kesehatan Tulang dan Sendi
+- Kesehatan Pernapasan
+- Kesehatan Hati
+- Kesehatan Ginjal
+- Kesehatan Saraf
+- Kesehatan Kardiovaskular
+- Kesehatan Imunitas
+## 🔧 Technical Details
+- **Model**: Meta-Llama-3.1-8B-Instruct
+- **Framework**: Gradio 5.43.1
+- **TTS Engine**: Google Text-to-Speech (gTTS)
+- **Audio Format**: MP3
+- **Language**: Indonesian
+## 📝 Notes
+- Model akan di-download otomatis saat pertama kali dijalankan
+- Untuk performa terbaik, gunakan GPU dengan CUDA
+- Audio generation membutuhkan koneksi internet untuk Google TTS
+- File audio temporary akan dihapus otomatis
+## 🤝 Contributing
+Pull requests dan suggestions sangat diterima! Untuk perubahan besar, silakan buka issue terlebih dahulu.
+## 📄 License
+MIT License - lihat file LICENSE untuk detail.
+## 🆘 Support
+Jika mengalami masalah, silakan buat issue di repository ini atau hubungi developer.

app.py ADDED Viewed

	@@ -0,0 +1,303 @@

+import gradio as gr
+import torch
+from transformers import AutoTokenizer, AutoModelForCausalLM
+from gtts import gTTS
+import tempfile
+import os
+import json
+from typing import List, Optional
+class HealthTextGenerator:
+    def __init__(self):
+        self.model = None
+        self.tokenizer = None
+        self.device = "cuda" if torch.cuda.is_available() else "cpu"
+    def load_model(self):
+        """Load the Llama 3.1 8B Instruct model"""
+        if self.model is None:
+            print("Loading Llama 3.1 8B Instruct model...")
+            model_name = "meta-llama/Meta-Llama-3.1-8B-Instruct"
+            self.tokenizer = AutoTokenizer.from_pretrained(model_name)
+            self.model = AutoModelForCausalLM.from_pretrained(
+                model_name,
+                torch_dtype=torch.float16 if self.device == "cuda" else torch.float32,
+                device_map="auto" if self.device == "cuda" else None,
+                low_cpu_mem_usage=True
+            )
+            if self.device == "cpu":
+                self.model = self.model.to(self.device)
+            print("Model loaded successfully!")
+    def generate_health_text(self, topic: str, text_length: str, subtopics: List[str]) -> str:
+        """Generate health-related text based on topic and subtopics"""
+        if self.model is None:
+            self.load_model()
+        # Prepare subtopics text
+        subtopics_text = ""
+        if subtopics and any(subtopic.strip() for subtopic in subtopics):
+            valid_subtopics = [s.strip() for s in subtopics if s.strip()]
+            if valid_subtopics:
+                subtopics_text = f" dengan fokus pada: {', '.join(valid_subtopics)}"
+        # Create prompt based on text length
+        length_instructions = {
+            "Pendek (100-200 kata)": "Buatlah artikel kesehatan yang singkat dan padat",
+            "Sedang (300-500 kata)": "Buatlah artikel kesehatan yang informatif dan detail",
+            "Panjang (600-1000 kata)": "Buatlah artikel kesehatan yang komprehensif dan mendalam"
+        }
+        prompt = f"""Buatlah artikel kesehatan tentang {topic}{subtopics_text}.
+        {length_instructions.get(text_length, "Buatlah artikel kesehatan yang informatif")}.
+        Pastikan artikel:
+        - Berisi informasi yang akurat dan bermanfaat
+        - Menggunakan bahasa Indonesia yang mudah dipahami
+        - Menyertakan tips praktis jika relevan
+        - Memiliki struktur yang jelas dengan paragraf yang terorganisir
+        Artikel:"""
+        # Tokenize and generate
+        inputs = self.tokenizer(prompt, return_tensors="pt").to(self.device)
+        with torch.no_grad():
+            outputs = self.model.generate(
+                **inputs,
+                max_new_tokens=1024,
+                temperature=0.7,
+                do_sample=True,
+                pad_token_id=self.tokenizer.eos_token_id,
+                eos_token_id=self.tokenizer.eos_token_id,
+                repetition_penalty=1.1
+            )
+        # Decode the generated text
+        generated_text = self.tokenizer.decode(outputs[0], skip_special_tokens=True)
+        # Extract only the generated part (remove the prompt)
+        generated_text = generated_text[len(prompt):].strip()
+        return generated_text
+    def text_to_speech(self, text: str, language: str = "id") -> str:
+        """Convert text to speech and return the audio file path"""
+        if not text.strip():
+            return None
+        try:
+            # Create temporary file
+            with tempfile.NamedTemporaryFile(delete=False, suffix=".mp3") as tmp_file:
+                temp_path = tmp_file.name
+            # Generate speech
+            tts = gTTS(text=text, lang=language, slow=False)
+            tts.save(temp_path)
+            return temp_path
+        except Exception as e:
+            print(f"Error in text-to-speech: {e}")
+            return None
+# Initialize the generator
+generator = HealthTextGenerator()
+# Health topics
+HEALTH_TOPICS = [
+    "Nutrisi dan Diet Sehat",
+    "Olahraga dan Kebugaran",
+    "Kesehatan Mental",
+    "Penyakit Jantung",
+    "Diabetes dan Gula Darah",
+    "Kesehatan Pencernaan",
+    "Kesehatan Kulit",
+    "Kesehatan Mata",
+    "Kesehatan Gigi dan Mulut",
+    "Kesehatan Reproduksi",
+    "Kesehatan Anak",
+    "Kesehatan Lansia",
+    "Pencegahan Kanker",
+    "Kesehatan Tulang dan Sendi",
+    "Kesehatan Pernapasan",
+    "Kesehatan Hati",
+    "Kesehatan Ginjal",
+    "Kesehatan Saraf",
+    "Kesehatan Kardiovaskular",
+    "Kesehatan Imunitas"
+]
+def generate_article(topic, text_length, subtopic1, subtopic2, subtopic3, subtopic4, subtopic5):
+    """Generate health article with given parameters"""
+    subtopics = [subtopic1, subtopic2, subtopic3, subtopic4, subtopic5]
+    subtopics = [s for s in subtopics if s and s.strip()]
+    try:
+        article = generator.generate_health_text(topic, text_length, subtopics)
+        return article, None
+    except Exception as e:
+        return f"Error generating article: {str(e)}", None
+def convert_to_speech(text):
+    """Convert generated text to speech"""
+    if not text or not text.strip():
+        return None
+    try:
+        audio_path = generator.text_to_speech(text)
+        return audio_path
+    except Exception as e:
+        print(f"Error converting to speech: {e}")
+        return None
+# Create Gradio interface
+with gr.Blocks(
+    title="Health Article Generator",
+    theme=gr.themes.Soft(),
+    css="""
+    .gradio-container {
+        max-width: 1200px !important;
+        margin: auto !important;
+    }
+    .main-header {
+        text-align: center;
+        background: linear-gradient(135deg, #667eea 0%, #764ba2 100%);
+        color: white;
+        padding: 2rem;
+        border-radius: 10px;
+        margin-bottom: 2rem;
+    }
+    """
+) as app:
+    gr.HTML("""
+    <div class="main-header">
+        <h1>🏥 Health Article Generator</h1>
+        <p>Generate comprehensive health articles using AI and convert them to speech</p>
+    </div>
+    """)
+    with gr.Row():
+        with gr.Column(scale=1):
+            gr.Markdown("### ⚙️ Settings")
+            topic = gr.Dropdown(
+                choices=HEALTH_TOPICS,
+                label="Pilih Topik Kesehatan",
+                value=HEALTH_TOPICS[0],
+                interactive=True
+            )
+            text_length = gr.Radio(
+                choices=["Pendek (100-200 kata)", "Sedang (300-500 kata)", "Panjang (600-1000 kata)"],
+                label="Panjang Artikel",
+                value="Sedang (300-500 kata)",
+                interactive=True
+            )
+            gr.Markdown("### 📝 Subtopik (Opsional)")
+            gr.Markdown("Tambahkan hingga 5 subtopik untuk fokus artikel:")
+            subtopic1 = gr.Textbox(
+                label="Subtopik 1",
+                placeholder="Contoh: Tips diet sehat",
+                interactive=True
+            )
+            subtopic2 = gr.Textbox(
+                label="Subtopik 2",
+                placeholder="Contoh: Makanan yang harus dihindari",
+                interactive=True
+            )
+            subtopic3 = gr.Textbox(
+                label="Subtopik 3",
+                placeholder="Contoh: Jadwal makan yang baik",
+                interactive=True
+            )
+            subtopic4 = gr.Textbox(
+                label="Subtopik 4",
+                placeholder="Contoh: Suplemen yang direkomendasikan",
+                interactive=True
+            )
+            subtopic5 = gr.Textbox(
+                label="Subtopik 5",
+                placeholder="Contoh: Olahraga pendukung",
+                interactive=True
+            )
+            generate_btn = gr.Button(
+                "🚀 Generate Article",
+                variant="primary",
+                size="lg"
+            )
+        with gr.Column(scale=2):
+            gr.Markdown("### 📄 Generated Article")
+            article_output = gr.Textbox(
+                label="Artikel yang Dihasilkan",
+                lines=15,
+                max_lines=20,
+                interactive=False,
+                show_copy_button=True
+            )
+            with gr.Row():
+                tts_btn = gr.Button(
+                    "🔊 Convert to Speech",
+                    variant="secondary"
+                )
+                download_audio = gr.File(
+                    label="Download Audio (MP3)",
+                    visible=False
+                )
+            gr.Markdown("### 🔊 Audio Player")
+            audio_player = gr.Audio(
+                label="Generated Audio",
+                type="filepath",
+                visible=False
+            )
+    # Event handlers
+    generate_btn.click(
+        fn=generate_article,
+        inputs=[topic, text_length, subtopic1, subtopic2, subtopic3, subtopic4, subtopic5],
+        outputs=[article_output, audio_player]
+    )
+    tts_btn.click(
+        fn=convert_to_speech,
+        inputs=[article_output],
+        outputs=[audio_player]
+    )
+    # Show download button when audio is generated
+    audio_player.change(
+        fn=lambda x: gr.update(visible=True, value=x) if x else gr.update(visible=False),
+        inputs=[audio_player],
+        outputs=[download_audio]
+    )
+    # Footer
+    gr.HTML("""
+    <div style="text-align: center; margin-top: 2rem; padding: 1rem; background-color: #f8f9fa; border-radius: 10px;">
+        <p><strong>Health Article Generator</strong> - Powered by Meta Llama 3.1 8B Instruct</p>
+        <p>Generate comprehensive health articles and convert them to speech for better accessibility</p>
+    </div>
+    """)
+if __name__ == "__main__":
+    app.launch(
+        server_name="0.0.0.0",
+        server_port=7860,
+        share=False,
+        show_error=True
+    )

requirements.txt ADDED Viewed

	@@ -0,0 +1,10 @@

+gradio==5.43.1
+transformers==4.45.2
+torch==2.4.1
+accelerate==0.33.2
+sentencepiece==0.2.0
+protobuf==4.25.5
+gtts==2.5.1
+pydub==0.25.1
+requests==2.32.3
+numpy==1.26.4