| ---
|
| license: apache-2.0
|
| language:
|
| - en
|
| pipeline_tag: text-generation
|
| tags:
|
| - water-treatment
|
| - drinking-water
|
| - critical-infrastructure
|
| - gemma
|
| - fine-tuning
|
| ---
|
|
|
| # PotableLM
|
|
|
| ## Model Summary
|
|
|
| PotableLM is a planned domain-adapted model family for drinking water treatment operations, built on the [Potable Dataset](https://huggingface.co/datasets/boxwrench/potable) — an expert-curated corpus of operational water treatment knowledge.
|
|
|
| Two tracks are planned: a municipal track for licensed plant operators (on-premises deployable) and a developing regions track for community water workers (offline-capable, fully open).
|
|
|
| No model weights have been released yet. This page establishes the project's intended scope while development continues.
|
|
|
| ## Intended Use
|
|
|
| The model is intended as a technical assistant for:
|
|
|
| - licensed operators
|
| - utility staff
|
| - trainers and technical reviewers
|
| - researchers evaluating domain adaptation in critical infrastructure
|
|
|
| Primary target behaviors:
|
|
|
| - practical operational reasoning
|
| - troubleshooting support
|
| - calculation walkthroughs
|
| - technically grounded explanations in operator voice
|
|
|
| ## Out-of-Scope Use
|
|
|
| - direct control of treatment processes
|
| - fully autonomous safety-critical decision-making
|
| - compliance interpretation without human review
|
| - replacement for plant procedures, regulations, or licensed judgment
|
|
|
| ## Base Model
|
|
|
| Base model selection is ongoing. The project prioritizes permissive licensing, local deployment potential, and strong fine-tuning characteristics.
|
|
|
| ## Training Data
|
|
|
| The model will be trained on the [Potable Dataset](https://huggingface.co/datasets/boxwrench/potable), an expert-curated corpus covering treatment process knowledge, plant operations, troubleshooting, calculations, and regulatory context. Every example is authored or reviewed by a licensed operator.
|
|
|
| ## Training Procedure
|
|
|
| Training procedure will be documented with the first checkpoint release.
|
|
|
| ## Evaluation
|
|
|
| No benchmark results are published yet. Evaluation details will accompany each released checkpoint.
|
|
|
| ## Risks and Limitations
|
|
|
| - Water treatment advice is context-dependent and should not be generalized blindly across plants.
|
| - Model outputs can be plausible and still wrong.
|
| - The model must be treated as an assistant, not an authority.
|
| - Current and local regulations always override model output.
|
|
|
| ## License
|
|
|
| License will be specified with each released checkpoint.
|
|
|
| ## Contact
|
|
|
| Keith Wilkinson
|
| Operational Inference — [operationalinference.com](https://operationalinference.com)
|
| GitHub: [boxwrench](https://github.com/boxwrench)
|
| Writing: [title22.org](https://title22.org)
|
|
|