boxwrench
/

potable-lm

Text Generation

water-treatment

critical-infrastructure

Model card Files Files and versions

potable-lm / README.md

boxwrench's picture

Tighten model card for project stage

c5daafc verified about 2 months ago

|

history blame contribute delete

2.72 kB

	---
	license: apache-2.0
	language:
	- en
	pipeline_tag: text-generation
	tags:
	- water-treatment
	- drinking-water
	- critical-infrastructure
	- gemma
	- fine-tuning
	---

	# PotableLM

	## Model Summary

	PotableLM is a planned domain-adapted model family for drinking water treatment operations, built on the [Potable Dataset](https://huggingface.co/datasets/boxwrench/potable) — an expert-curated corpus of operational water treatment knowledge.

	Two tracks are planned: a municipal track for licensed plant operators (on-premises deployable) and a developing regions track for community water workers (offline-capable, fully open).

	No model weights have been released yet. This page establishes the project's intended scope while development continues.

	## Intended Use

	The model is intended as a technical assistant for:

	- licensed operators
	- utility staff
	- trainers and technical reviewers
	- researchers evaluating domain adaptation in critical infrastructure

	Primary target behaviors:

	- practical operational reasoning
	- troubleshooting support
	- calculation walkthroughs
	- technically grounded explanations in operator voice

	## Out-of-Scope Use

	- direct control of treatment processes
	- fully autonomous safety-critical decision-making
	- compliance interpretation without human review
	- replacement for plant procedures, regulations, or licensed judgment

	## Base Model

	Base model selection is ongoing. The project prioritizes permissive licensing, local deployment potential, and strong fine-tuning characteristics.

	## Training Data

	The model will be trained on the [Potable Dataset](https://huggingface.co/datasets/boxwrench/potable), an expert-curated corpus covering treatment process knowledge, plant operations, troubleshooting, calculations, and regulatory context. Every example is authored or reviewed by a licensed operator.

	## Training Procedure

	Training procedure will be documented with the first checkpoint release.

	## Evaluation

	No benchmark results are published yet. Evaluation details will accompany each released checkpoint.

	## Risks and Limitations

	- Water treatment advice is context-dependent and should not be generalized blindly across plants.
	- Model outputs can be plausible and still wrong.
	- The model must be treated as an assistant, not an authority.
	- Current and local regulations always override model output.

	## License

	License will be specified with each released checkpoint.

	## Contact

	Keith Wilkinson
	Operational Inference — [operationalinference.com](https://operationalinference.com)
	GitHub: [boxwrench](https://github.com/boxwrench)
	Writing: [title22.org](https://title22.org)