---
license: apache-2.0
language:
- en
pipeline_tag: text-generation
tags:
- water-treatment
- drinking-water
- critical-infrastructure
- gemma
- fine-tuning
---

# PotableLM

## Model Summary

PotableLM is a planned domain-adapted model family for drinking water treatment operations, built on the [Potable Dataset](https://huggingface.co/datasets/boxwrench/potable) — an expert-curated corpus of operational water treatment knowledge.

Two tracks are planned: a municipal track for licensed plant operators (on-premises deployable) and a developing regions track for community water workers (offline-capable, fully open).

No model weights have been released yet. This page establishes the project's intended scope while development continues.

## Intended Use

The model is intended as a technical assistant for:

- licensed operators
- utility staff
- trainers and technical reviewers
- researchers evaluating domain adaptation in critical infrastructure

Primary target behaviors:

- practical operational reasoning
- troubleshooting support
- calculation walkthroughs
- technically grounded explanations in operator voice

## Out-of-Scope Use

- direct control of treatment processes
- fully autonomous safety-critical decision-making
- compliance interpretation without human review
- replacement for plant procedures, regulations, or licensed judgment

## Base Model

Base model selection is ongoing. The project prioritizes permissive licensing, local deployment potential, and strong fine-tuning characteristics.

## Training Data

The model will be trained on the [Potable Dataset](https://huggingface.co/datasets/boxwrench/potable), an expert-curated corpus covering treatment process knowledge, plant operations, troubleshooting, calculations, and regulatory context. Every example is authored or reviewed by a licensed operator.

## Training Procedure

Training procedure will be documented with the first checkpoint release.

## Evaluation

No benchmark results are published yet. Evaluation details will accompany each released checkpoint.

## Risks and Limitations

- Water treatment advice is context-dependent and should not be generalized blindly across plants.
- Model outputs can be plausible and still wrong.
- The model must be treated as an assistant, not an authority.
- Current and local regulations always override model output.

## License

License will be specified with each released checkpoint.

## Contact

Keith Wilkinson
Operational Inference — [operationalinference.com](https://operationalinference.com)
GitHub: [boxwrench](https://github.com/boxwrench)
Writing: [title22.org](https://title22.org)