ainowmk
/

MK-LLM-Mistral

Text Generation

text-generation-inference

Model card Files Files and versions

MK-LLM-Mistral / docs /EXTENDING.md

ainow-mk's picture

Upload 65 files

f29d474 verified 4 months ago

|

history blame contribute delete

1.09 kB

	# Extending MK-LLM

	This guide shows how to plug in different base models, datasets, and adapters.

	## Swap base model
	- Set `MODEL_PATH` in `.env` to a local dir or HF repo id.
	- If using a HF repo, set `TRUST_REMOTE_CODE=true` when custom code is required.
	- Low-VRAM: set `LOAD_IN_4BIT=true` (or `LOAD_IN_8BIT=true`).

	## Add datasets
	- Place cleaned text into `data/cleaned/*.txt` or generate `data/cleaned/mk_combined_data.txt` via `python -m data.process_all_data`.
	- The trainer uses `examples/data_loader.load_mk_dataset()` which prefers the combined file.

	## Instruction tuning
	- Convert text into chat turns and use `tokenizer.apply_chat_template` in the training collator.
	- Provide Macedonian system prompts and stop sequences as needed.

	## Custom inference params
	- Use `POST /v1/chat/completions` with `temperature`, `top_p`, `max_tokens`, `stream`.
	- Configure defaults via `.env`.

	## Contribute plugins
	- Add new data collectors under `data/` and document flags in README.
	- Add new generation strategies or safety middlewares in `inference/`.