Spaces:

vectorplasticity
/

universal-model-trainer

Sleeping

App Files Files Community

universal-model-trainer / README.md

vectorplasticity

Add comprehensive README.md with documentation

a6d556d verified 3 months ago

preview code

Raw

History Blame Contribute Delete

2.75 kB

	---
	title: Universal Model Trainer
	emoji: 🚀
	colorFrom: blue
	colorTo: purple
	sdk: docker
	pinned: true
	license: mit
	short_description: Universal ML training dashboard for HuggingFace
	tag:
	- training
	- machine-learning
	- transformers
	- pytorch
	- docker
	---

	# 🚀 Universal Model Trainer

	A comprehensive, production-ready dashboard for training machine learning models on the HuggingFace ecosystem. Supports multiple task types, datasets, and model architectures with full user control.

	## ✨ Features

	### Supported Task Types
	- Causal Language Modeling - Text generation (GPT-style)
	- Masked Language Modeling - BERT-style pretraining
	- Sequence-to-Sequence - Translation, summarization
	- Token Classification - NER, POS tagging
	- Text Classification - Sentiment, topic classification
	- Question Answering - Extractive QA
	- Image Classification - Vision tasks
	- Audio Classification - Sound classification

	### Training Features
	- PEFT/LoRA Support - Memory-efficient fine-tuning
	- DeepSpeed Integration - Distributed training
	- Quantization Options - 4-bit, 8-bit training
	- Custom Hyperparameters - Full control over training config
	- Real-time Progress - Live training metrics
	- Job Queue System - Manage multiple training jobs
	- Model Versioning - Track experiment history

	### Dataset Support
	- HuggingFace Hub - 100,000+ datasets
	- Custom Upload - CSV, JSON, JSONL support
	- Dataset Preprocessing - Automatic tokenization

	## 🔧 API Endpoints

	### Training
	```
	POST /api/train/start - Start a new training job
	GET /api/train/status - Get job status
	POST /api/train/stop - Stop running job
	GET /api/train/history - View training history
	```

	### Models & Datasets
	```
	GET /api/models/search - Search HuggingFace models
	GET /api/datasets/search - Search HuggingFace datasets
	GET /api/models/info - Get model info
	```

	### System
	```
	GET /api/system/status - System health and resources
	GET /api/system/gpus - Available GPU info
	```

	## 🏗️ Architecture

	- FastAPI Backend - Modern async Python web framework
	- Redis Queue - Background job processing
	- SQLite/PostgreSQL - Job and experiment persistence
	- HuggingFace Hub - Model and dataset hosting

	## 📝 Configuration

	Set the following environment variables:
	- `HF_TOKEN` - Your HuggingFace API token (required for pushing models)
	- `WANDB_API_KEY` - Weights & Biases API key (optional, for experiment tracking)

	## 📚 Documentation

	Full API documentation available at `/docs` when the Space is running.

	## 🤝 Contributing

	Contributions welcome! See issues for planned features.

	## 📄 License

	MIT License - See LICENSE file for details.