vectorplasticity's picture
Add comprehensive README.md with documentation
a6d556d verified
|
Raw
History Blame Contribute Delete
2.75 kB
---
title: Universal Model Trainer
emoji: πŸš€
colorFrom: blue
colorTo: purple
sdk: docker
pinned: true
license: mit
short_description: Universal ML training dashboard for HuggingFace
tag:
- training
- machine-learning
- transformers
- pytorch
- docker
---
# πŸš€ Universal Model Trainer
A comprehensive, production-ready dashboard for training machine learning models on the HuggingFace ecosystem. Supports multiple task types, datasets, and model architectures with full user control.
## ✨ Features
### Supported Task Types
- **Causal Language Modeling** - Text generation (GPT-style)
- **Masked Language Modeling** - BERT-style pretraining
- **Sequence-to-Sequence** - Translation, summarization
- **Token Classification** - NER, POS tagging
- **Text Classification** - Sentiment, topic classification
- **Question Answering** - Extractive QA
- **Image Classification** - Vision tasks
- **Audio Classification** - Sound classification
### Training Features
- **PEFT/LoRA Support** - Memory-efficient fine-tuning
- **DeepSpeed Integration** - Distributed training
- **Quantization Options** - 4-bit, 8-bit training
- **Custom Hyperparameters** - Full control over training config
- **Real-time Progress** - Live training metrics
- **Job Queue System** - Manage multiple training jobs
- **Model Versioning** - Track experiment history
### Dataset Support
- **HuggingFace Hub** - 100,000+ datasets
- **Custom Upload** - CSV, JSON, JSONL support
- **Dataset Preprocessing** - Automatic tokenization
## πŸ”§ API Endpoints
### Training
```
POST /api/train/start - Start a new training job
GET /api/train/status - Get job status
POST /api/train/stop - Stop running job
GET /api/train/history - View training history
```
### Models & Datasets
```
GET /api/models/search - Search HuggingFace models
GET /api/datasets/search - Search HuggingFace datasets
GET /api/models/info - Get model info
```
### System
```
GET /api/system/status - System health and resources
GET /api/system/gpus - Available GPU info
```
## πŸ—οΈ Architecture
- **FastAPI Backend** - Modern async Python web framework
- **Redis Queue** - Background job processing
- **SQLite/PostgreSQL** - Job and experiment persistence
- **HuggingFace Hub** - Model and dataset hosting
## πŸ“ Configuration
Set the following environment variables:
- `HF_TOKEN` - Your HuggingFace API token (required for pushing models)
- `WANDB_API_KEY` - Weights & Biases API key (optional, for experiment tracking)
## πŸ“š Documentation
Full API documentation available at `/docs` when the Space is running.
## 🀝 Contributing
Contributions welcome! See issues for planned features.
## πŸ“„ License
MIT License - See LICENSE file for details.