v2.0: Update README.md
Browse files
README.md
CHANGED
|
@@ -1,138 +1,35 @@
|
|
| 1 |
---
|
| 2 |
-
title: Architech
|
| 3 |
emoji: 🏗️
|
| 4 |
colorFrom: blue
|
| 5 |
colorTo: purple
|
| 6 |
sdk: gradio
|
| 7 |
-
sdk_version:
|
| 8 |
app_file: app.py
|
| 9 |
-
pinned:
|
| 10 |
-
license:
|
|
|
|
| 11 |
---
|
| 12 |
|
| 13 |
-
# 🏗️ Architech
|
| 14 |
|
| 15 |
-
|
| 16 |
|
| 17 |
-
|
| 18 |
|
| 19 |
-
##
|
| 20 |
-
- Generate high-quality training data from simple descriptions
|
| 21 |
-
- Support for multiple domains: Technology, Healthcare, Finance, Education
|
| 22 |
-
- Multiple format types: Conversational, Instruction-following
|
| 23 |
-
- 50-500 examples per dataset
|
| 24 |
|
| 25 |
-
|
| 26 |
-
- Fine-tune
|
| 27 |
-
-
|
| 28 |
-
-
|
| 29 |
-
-
|
|
|
|
| 30 |
|
| 31 |
-
##
|
| 32 |
-
- Load and test your trained models instantly
|
| 33 |
-
- Interactive inference with adjustable parameters
|
| 34 |
-
- Real-time generation with temperature and length controls
|
| 35 |
|
| 36 |
-
|
| 37 |
-
- **Rate Limiting**: Fair usage for all users
|
| 38 |
-
- Dataset Generation: 10/hour
|
| 39 |
-
- Model Training: 3/hour
|
| 40 |
-
- Model Inference: 50/hour
|
| 41 |
-
- **Token Authentication**: Secure HuggingFace integration
|
| 42 |
-
- **Error Handling**: Comprehensive error messages and recovery
|
| 43 |
|
| 44 |
-
##
|
| 45 |
|
| 46 |
-
|
| 47 |
-
1. Go to the **"Generate Dataset"** tab
|
| 48 |
-
2. Describe your task (e.g., "Customer support chatbot for tech products")
|
| 49 |
-
3. Select domain and size
|
| 50 |
-
4. Click **"Generate Dataset"**
|
| 51 |
-
|
| 52 |
-
### 2. Train Your Model
|
| 53 |
-
1. Go to the **"Train Model"** tab
|
| 54 |
-
2. Enter your model name and HuggingFace token
|
| 55 |
-
3. Choose to use synthetic data or provide your own
|
| 56 |
-
4. Click **"Train Model"**
|
| 57 |
-
5. Wait for training to complete (5-15 minutes)
|
| 58 |
-
|
| 59 |
-
### 3. Test Your Model
|
| 60 |
-
1. Go to the **"Test Model"** tab
|
| 61 |
-
2. Enter your model name and token
|
| 62 |
-
3. Click **"Load Model"**
|
| 63 |
-
4. Enter a test prompt and generate!
|
| 64 |
-
|
| 65 |
-
## 📋 Requirements
|
| 66 |
-
|
| 67 |
-
- HuggingFace account with **write** token
|
| 68 |
-
- For training: GPU recommended (CPU works but slower)
|
| 69 |
-
- Patience during training (coffee break recommended ☕)
|
| 70 |
-
|
| 71 |
-
## 🎯 Use Cases
|
| 72 |
-
|
| 73 |
-
- **Customer Support Bots**: Train chatbots for specific products/services
|
| 74 |
-
- **Content Generation**: Create domain-specific text generators
|
| 75 |
-
- **Educational Tools**: Build tutoring and explanation systems
|
| 76 |
-
- **Creative Writing**: Fine-tune for specific writing styles
|
| 77 |
-
- **Technical Documentation**: Generate code explanations and docs
|
| 78 |
-
|
| 79 |
-
## ⚙️ Technical Details
|
| 80 |
-
|
| 81 |
-
### Supported Base Models
|
| 82 |
-
- `distilgpt2` (fastest, smallest)
|
| 83 |
-
- `gpt2` (balanced)
|
| 84 |
-
- `microsoft/DialoGPT-small` (conversational)
|
| 85 |
-
|
| 86 |
-
### Training Features
|
| 87 |
-
- Gradient accumulation for memory efficiency
|
| 88 |
-
- Mixed precision training (FP16)
|
| 89 |
-
- Automatic learning rate optimization
|
| 90 |
-
- Smart tokenization and padding
|
| 91 |
-
|
| 92 |
-
### Synthetic Data Quality
|
| 93 |
-
- Domain-specific vocabulary
|
| 94 |
-
- Natural language variations
|
| 95 |
-
- Contextually relevant examples
|
| 96 |
-
- Edge case handling
|
| 97 |
-
|
| 98 |
-
## 🛠️ Troubleshooting
|
| 99 |
-
|
| 100 |
-
### "GPU Memory Overflow"
|
| 101 |
-
- Reduce batch size to 1
|
| 102 |
-
- Use smaller base model (distilgpt2)
|
| 103 |
-
- Reduce dataset size
|
| 104 |
-
|
| 105 |
-
### "Permission Denied"
|
| 106 |
-
- Check your HuggingFace token has **WRITE** access
|
| 107 |
-
- Generate new token at: https://huggingface.co/settings/tokens
|
| 108 |
-
|
| 109 |
-
### "Rate Limit Exceeded"
|
| 110 |
-
- Wait for the cooldown period
|
| 111 |
-
- Check remaining requests in error message
|
| 112 |
-
|
| 113 |
-
## 📚 Best Practices
|
| 114 |
-
|
| 115 |
-
1. **Start Small**: Begin with 100 examples and 3 epochs
|
| 116 |
-
2. **Be Specific**: Detailed task descriptions yield better results
|
| 117 |
-
3. **Test First**: Use the Test tab before deploying
|
| 118 |
-
4. **Iterate**: Train multiple versions with different parameters
|
| 119 |
-
5. **Monitor**: Watch training logs for issues
|
| 120 |
-
|
| 121 |
-
## 🤝 Contributing
|
| 122 |
-
|
| 123 |
-
Found a bug? Have a feature request? Open an issue!
|
| 124 |
-
|
| 125 |
-
## 📜 License
|
| 126 |
-
|
| 127 |
-
MIT License - feel free to use and modify!
|
| 128 |
-
|
| 129 |
-
## 🙏 Acknowledgments
|
| 130 |
-
|
| 131 |
-
Built with:
|
| 132 |
-
- [Gradio](https://gradio.app/) - Interface
|
| 133 |
-
- [Transformers](https://huggingface.co/transformers/) - Models
|
| 134 |
-
- [HuggingFace](https://huggingface.co/) - Infrastructure
|
| 135 |
-
|
| 136 |
-
---
|
| 137 |
-
|
| 138 |
-
*No PhD required. Just ideas.* ✨
|
|
|
|
| 1 |
---
|
| 2 |
+
title: Architech — CognoSphere Model Factory
|
| 3 |
emoji: 🏗️
|
| 4 |
colorFrom: blue
|
| 5 |
colorTo: purple
|
| 6 |
sdk: gradio
|
| 7 |
+
sdk_version: 5.23.0
|
| 8 |
app_file: app.py
|
| 9 |
+
pinned: true
|
| 10 |
+
license: apache-2.0
|
| 11 |
+
short_description: Build, train, and deploy CSUMLM-class language models
|
| 12 |
---
|
| 13 |
|
| 14 |
+
# 🏗️ Architech — CognoSphere Model Factory
|
| 15 |
|
| 16 |
+
> Build, train, and deploy CSUMLM-class language models
|
| 17 |
|
| 18 |
+
**By [Or4cl3 AI Solutions](https://github.com/or4cl3-ai-1)**
|
| 19 |
|
| 20 |
+
## Features
|
|
|
|
|
|
|
|
|
|
|
|
|
| 21 |
|
| 22 |
+
- 📊 **Synthetic Data Generation** — Domain-specific training data
|
| 23 |
+
- 🚀 **Model Training** — Fine-tune with LoRA on modern base models (Gemma 4, Llama 3, TinyLlama, etc.)
|
| 24 |
+
- 🧪 **Model Testing** — Interactive inference and evaluation
|
| 25 |
+
- 💾 **Model Management** — Upload, download, organize models
|
| 26 |
+
- 📄 **Documentation** — Auto-generated model cards and research papers
|
| 27 |
+
- 💬 **Repository Chat** — Manage HuggingFace repos conversationally
|
| 28 |
|
| 29 |
+
## Part of the CognoSphere CSUMLM Ecosystem
|
|
|
|
|
|
|
|
|
|
| 30 |
|
| 31 |
+
Architech is the model factory for the **CognoSphere Unified Multimodal Language Model (CSUMLM)** — a unified AI system integrating the CognoSphere Multimodal AI Engine (CSMAE) and CognoSphere Large Language Model (CSLLM).
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 32 |
|
| 33 |
+
## License
|
| 34 |
|
| 35 |
+
Apache 2.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|