UltraThinking-LLM-Training / CONTRIBUTING.md

Upload folder using huggingface_hub

54c5666 verified 4 months ago

8.58 kB

	# Contributing to ULTRATHINK

	Thank you for your interest in contributing to ULTRATHINK! This document provides guidelines and information for contributors.

	## 🤝 How to Contribute

	### Reporting Issues

	Before creating an issue, please:

	1. Search existing issues to avoid duplicates
	2. Use the issue templates when available
	3. Provide detailed information including:
	- Python version and OS
	- PyTorch version
	- Full error traceback
	- Steps to reproduce
	- Expected vs actual behavior

	### Suggesting Features

	We welcome feature suggestions! Please:

	1. Check existing feature requests first
	2. Describe the use case and motivation
	3. Provide implementation ideas if possible
	4. Consider backwards compatibility

	### Pull Requests

	#### Before You Start

	1. Fork the repository and create a feature branch
	2. Check existing PRs to avoid duplicate work
	3. Discuss major changes in an issue first

	#### Development Setup

	```bash
	# Clone your fork
	git clone https://github.com/yourusername/ultrathink.git
	cd ultrathink

	# Create virtual environment
	python -m venv .venv
	source .venv/bin/activate # Windows: .venv\Scripts\activate

	# Install in development mode
	pip install -e .
	pip install -r requirements.txt

	# Install development tools
	pip install pre-commit pytest pytest-cov black flake8 mypy

	# Setup pre-commit hooks
	pre-commit install
	```

	#### Code Standards

	We maintain high code quality standards:

	Code Formatting
	```bash
	# Format code with Black
	black src/ tests/ scripts/

	# Check formatting
	black --check src/ tests/ scripts/
	```

	Linting
	```bash
	# Run flake8
	flake8 src/ tests/ scripts/

	# Run mypy for type checking
	mypy src/
	```

	Testing
	```bash
	# Run all tests
	python -m pytest tests/

	# Run with coverage
	python -m pytest tests/ --cov=src --cov-report=html

	# Run smoke test
	python -m tests.smoke_test
	```

	#### Code Style Guidelines

	1. Follow PEP 8 with Black formatting
	2. Use type hints for all functions and methods
	3. Write docstrings for public APIs (Google style)
	4. Keep functions focused and reasonably sized
	5. Use meaningful variable names
	6. Add comments for complex logic

	Example:
	```python
	def train_model(
	model: torch.nn.Module,
	dataloader: DataLoader,
	optimizer: torch.optim.Optimizer,
	device: torch.device,
	epochs: int = 10
	) -> Dict[str, float]:
	"""Train a PyTorch model.

	Args:
	model: The model to train
	dataloader: Training data loader
	optimizer: Optimizer for training
	device: Device to train on
	epochs: Number of training epochs

	Returns:
	Dictionary containing training metrics

	Raises:
	ValueError: If epochs is not positive
	"""
	if epochs <= 0:
	raise ValueError("Epochs must be positive")

	# Training implementation...
	return {"loss": final_loss, "accuracy": final_acc}
	```

	#### Commit Guidelines

	Commit Message Format
	```
	<type>(<scope>): <description>

	[optional body]

	[optional footer]
	```

	Types:
	- `feat`: New feature
	- `fix`: Bug fix
	- `docs`: Documentation changes
	- `style`: Code style changes (formatting, etc.)
	- `refactor`: Code refactoring
	- `test`: Adding or updating tests
	- `chore`: Maintenance tasks

	Examples:
	```
	feat(models): add flash attention support

	fix(training): resolve gradient accumulation bug

	docs(readme): update installation instructions

	test(models): add unit tests for MoE routing
	```

	#### Pull Request Process

	1. Create a feature branch from `main`
	```bash
	git checkout -b feature/your-feature-name
	```

	2. Make your changes following the guidelines above

	3. Write or update tests for your changes

	4. Update documentation if needed

	5. Run the full test suite
	```bash
	python -m pytest tests/
	python -m tests.smoke_test
	```

	6. Run pre-commit checks
	```bash
	pre-commit run --all-files
	```

	7. Push to your fork and create a pull request

	8. Fill out the PR template completely

	9. Respond to review feedback promptly

	#### PR Review Process

	- All PRs require at least one review
	- Automated checks must pass
	- Documentation must be updated for user-facing changes
	- Breaking changes require discussion and migration guide

	## 🏗️ Architecture Guidelines

	### Adding New Models

	When adding new model components:

	1. Follow the existing patterns in `src/models/`
	2. Inherit from appropriate base classes
	3. Add comprehensive docstrings
	4. Include configuration classes
	5. Add unit tests
	6. Update integration tests

	### Adding New Training Features

	For training enhancements:

	1. Consider backwards compatibility
	2. Add configuration options
	3. Include proper logging
	4. Add evaluation metrics
	5. Document hyperparameter effects

	### Adding New Data Loaders

	For data pipeline additions:

	1. Support streaming when possible
	2. Include quality filtering options
	3. Add proper error handling
	4. Support multiple formats
	5. Include data validation

	## 🧪 Testing Guidelines

	### Test Structure

	```
	tests/
	├── unit/ # Unit tests for individual components
	├── integration/ # Integration tests for workflows
	├── fixtures/ # Test data and fixtures
	└── conftest.py # Pytest configuration
	```

	### Writing Tests

	1. Use descriptive test names
	```python
	def test_transformer_block_forward_pass_with_attention_mask():
	```

	2. Test edge cases and error conditions
	```python
	def test_model_raises_error_with_invalid_vocab_size():
	```

	3. Use fixtures for common setup
	```python
	@pytest.fixture
	def small_model_config():
	return ModelConfig(vocab_size=1000, n_embd=128, n_layer=2)
	```

	4. Mock external dependencies
	```python
	@patch('wandb.init')
	def test_training_without_wandb(mock_wandb):
	```

	### Performance Tests

	Include performance benchmarks for:
	- Model forward/backward pass timing
	- Memory usage patterns
	- Throughput measurements

	## 📚 Documentation

	### Code Documentation

	- Docstrings: Use Google style for all public APIs
	- Type hints: Required for all function signatures
	- Comments: Explain complex algorithms and business logic

	### User Documentation

	- README updates: For user-facing changes
	- Configuration docs: For new parameters
	- Examples: Include usage examples
	- Migration guides: For breaking changes

	## 🚀 Release Process

	### Version Numbering

	We follow [Semantic Versioning](https://semver.org/):
	- `MAJOR.MINOR.PATCH`
	- Major: Breaking changes
	- Minor: New features (backwards compatible)
	- Patch: Bug fixes

	### Release Checklist

	1. Update version in `setup.py`
	2. Update `CHANGELOG.md`
	3. Run full test suite
	4. Update documentation
	5. Create release PR
	6. Tag release after merge
	7. Update GitHub release notes

	## 🎯 Priority Areas

	We're particularly interested in contributions to:

	1. Performance optimizations
	- Memory efficiency improvements
	- Training speed optimizations
	- Inference acceleration

	2. New model architectures
	- Novel attention mechanisms
	- Advanced MoE strategies
	- Multi-modal improvements

	3. Training improvements
	- Better data loading
	- Advanced RLHF techniques
	- Distributed training optimizations

	4. Evaluation and benchmarking
	- New benchmark integrations
	- Evaluation metrics
	- Analysis tools

	5. Documentation and examples
	- Tutorial notebooks
	- Use case examples
	- API documentation

	## ❓ Questions?

	- General questions: Open a [Discussion](https://github.com/yourusername/ultrathink/discussions)
	- Bug reports: Open an [Issue](https://github.com/yourusername/ultrathink/issues)
	- Feature requests: Open an [Issue](https://github.com/yourusername/ultrathink/issues) with the feature template
	- Security issues: Email security@ultrathink.ai

	## 🙏 Recognition

	Contributors will be:
	- Listed in the README
	- Mentioned in release notes
	- Invited to the contributors team (for significant contributions)

	Thank you for helping make ULTRATHINK better! 🚀