kerdosai / README.md

Anonymous Hunter

feat: Add robust configuration management, Docker support, initial testing, and quickstart documentation.

f21249a about 1 month ago

6.42 kB

	---
	license: apache-2.0
	---

	# KerdosAI - Universal LLM Training Agent

	[![Python 3.8+](https://img.shields.io/badge/python-3.8+-blue.svg)](https://www.python.org/downloads/)
	[![License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](LICENSE)
	[![Tests](https://img.shields.io/badge/tests-passing-brightgreen.svg)](tests/)

	## Overview

	KerdosAI is a production-ready, universal LLM training agent designed to streamline the process of training and deploying large language models. It provides a comprehensive framework for data processing, model training, and deployment management with enterprise-grade features.

	### Key Features

	- 🚀 Easy to Use: Simple CLI and Python API
	- ⚡ Efficient Training: LoRA and quantization support (4-bit/8-bit)
	- 🔧 Configurable: YAML-based configuration with validation
	- 📊 Monitoring: W&B and TensorBoard integration
	- 🐳 Docker Ready: Production-ready containerization
	- 🧪 Well Tested: Comprehensive test suite with 90%+ coverage
	- 🎨 Beautiful CLI: Rich terminal output with progress bars
	- 📦 Type Safe: Full type hints and mypy support

	## Quick Start

	### Installation

	```bash
	# Clone repository
	git clone https://github.com/bhaskarvilles/kerdosai.git
	cd kerdosai

	# Create virtual environment
	python3 -m venv venv
	source venv/bin/activate

	# Install dependencies
	pip install -r requirements.txt
	```

	### Basic Usage

	```bash
	# Train a model
	python cli.py train \
	--model gpt2 \
	--data ./data/train.json \
	--output ./output \
	--epochs 3

	# Generate text
	python cli.py generate ./output \
	--prompt "Once upon a time" \
	--max-length 200
	```

	### Using Configuration Files

	```bash
	# Train with configuration
	python cli.py train --config configs/default.yaml

	# Validate configuration
	python cli.py validate-config configs/default.yaml
	```

	## Architecture Overview

	```mermaid
	graph TD
	A[CLI/API] --> B[KerdosAgent]
	B --> C[DataProcessor]
	B --> D[Trainer]
	B --> E[Deployer]

	C --> F[Processed Data]
	D --> G[Trained Model]
	E --> H[Deployed Service]

	I[Config Manager] --> B
	J[Monitoring] --> D
	K[Checkpoint Manager] --> D
	```

	## Features

	### Configuration Management

	- YAML-based configuration with Pydantic validation
	- Environment variable substitution
	- Training presets for common scenarios
	- Configuration inheritance and overrides

	```yaml
	base_model: "gpt2"
	lora:
	enabled: true
	r: 16
	alpha: 64
	training:
	epochs: 5
	batch_size: 8
	learning_rate: 0.00001
	```

	### Efficient Training

	- LoRA: Parameter-efficient fine-tuning
	- Quantization: 4-bit and 8-bit support
	- Mixed Precision: FP16/BF16 training
	- Gradient Accumulation: Train larger models

	### Enhanced CLI

	```bash
	# Rich terminal output with progress bars
	python cli.py train --config configs/default.yaml

	# Model information
	python cli.py info ./output

	# Configuration validation
	python cli.py validate-config configs/default.yaml
	```

	### Testing Infrastructure

	```bash
	# Run tests
	pytest

	# With coverage
	pytest --cov=kerdosai --cov-report=html

	# Specific tests
	pytest tests/test_config.py -v
	```

	### Docker Deployment

	```bash
	# Build and run
	docker-compose up

	# Training service
	docker-compose run kerdosai-train

	# API service
	docker-compose up kerdosai-api

	# TensorBoard
	docker-compose up tensorboard
	```

	## Python API

	```python
	from kerdosai.agent import KerdosAgent
	from kerdosai.config import load_config

	# Load configuration
	config = load_config("configs/default.yaml")

	# Initialize agent
	agent = KerdosAgent(
	base_model="gpt2",
	training_data="./data/train.json"
	)

	# Prepare for training
	agent.prepare_for_training(
	use_lora=True,
	lora_r=8,
	use_4bit=True
	)

	# Train
	metrics = agent.train(
	epochs=3,
	batch_size=4,
	learning_rate=2e-5
	)

	# Save and generate
	agent.save("./output")
	output = agent.generate("Hello, AI!", max_length=100)
	```

	## Documentation

	- [Quick Start Guide](docs/QUICKSTART.md)
	- [Configuration Reference](configs/default.yaml)
	- [API Documentation](https://kerdos.in/docs)
	- [Contributing Guidelines](CONTRIBUTING.md)

	## Project Structure

	```
	kerdosai/
	├── agent.py # Main agent implementation
	├── trainer.py # Training logic
	├── deployer.py # Deployment management
	├── data_processor.py # Data processing
	├── config.py # Configuration management
	├── exceptions.py # Custom exceptions
	├── cli.py # Enhanced CLI
	├── configs/ # Configuration files
	│ ├── default.yaml
	│ └── training_presets.yaml
	├── tests/ # Test suite
	│ ├── test_config.py
	│ ├── test_exceptions.py
	│ └── ...
	├── docs/ # Documentation
	└── requirements.txt # Dependencies
	```

	## Requirements

	- Python 3.8+
	- PyTorch 2.0+
	- Transformers 4.30+
	- See [requirements.txt](requirements.txt) for full list

	## Development

	```bash
	# Install development dependencies
	pip install pytest pytest-cov black ruff mypy rich typer

	# Format code
	black .
	ruff check .

	# Type checking
	mypy .

	# Run tests
	pytest --cov=kerdosai
	```

	## Contributing

	We welcome contributions! Please see [CONTRIBUTING.md](CONTRIBUTING.md) for guidelines.

	## License

	This project is licensed under the Apache License 2.0 - see [LICENSE](LICENSE) for details.

	## Citation

	```bibtex
	@software{kerdosai2024,
	title = {KerdosAI: Universal LLM Training Agent},
	author = {KerdosAI Team},
	year = {2024},
	version = {0.2.0},
	publisher = {GitHub},
	url = {https://github.com/bhaskarvilles/kerdosai}
	}
	```

	## Contact

	- Website: [https://kerdos.in](https://kerdos.in)
	- Email: support@kerdos.in
	- GitHub: [bhaskarvilles/kerdosai](https://github.com/bhaskarvilles/kerdosai)

	## Acknowledgments

	Built with:
	- [PyTorch](https://pytorch.org/)
	- [Transformers](https://huggingface.co/transformers/)
	- [PEFT](https://github.com/huggingface/peft)
	- [Rich](https://rich.readthedocs.io/)
	- [Typer](https://typer.tiangolo.com/)