A CPU-friendly system for fine-tuning small language models on your own conversation data and deploying them for efficient local inference.
-
Base model