potable-lm / README.md
boxwrench's picture
Tighten model card for project stage
c5daafc verified
---
license: apache-2.0
language:
- en
pipeline_tag: text-generation
tags:
- water-treatment
- drinking-water
- critical-infrastructure
- gemma
- fine-tuning
---
# PotableLM
## Model Summary
PotableLM is a planned domain-adapted model family for drinking water treatment operations, built on the [Potable Dataset](https://huggingface.co/datasets/boxwrench/potable) — an expert-curated corpus of operational water treatment knowledge.
Two tracks are planned: a municipal track for licensed plant operators (on-premises deployable) and a developing regions track for community water workers (offline-capable, fully open).
No model weights have been released yet. This page establishes the project's intended scope while development continues.
## Intended Use
The model is intended as a technical assistant for:
- licensed operators
- utility staff
- trainers and technical reviewers
- researchers evaluating domain adaptation in critical infrastructure
Primary target behaviors:
- practical operational reasoning
- troubleshooting support
- calculation walkthroughs
- technically grounded explanations in operator voice
## Out-of-Scope Use
- direct control of treatment processes
- fully autonomous safety-critical decision-making
- compliance interpretation without human review
- replacement for plant procedures, regulations, or licensed judgment
## Base Model
Base model selection is ongoing. The project prioritizes permissive licensing, local deployment potential, and strong fine-tuning characteristics.
## Training Data
The model will be trained on the [Potable Dataset](https://huggingface.co/datasets/boxwrench/potable), an expert-curated corpus covering treatment process knowledge, plant operations, troubleshooting, calculations, and regulatory context. Every example is authored or reviewed by a licensed operator.
## Training Procedure
Training procedure will be documented with the first checkpoint release.
## Evaluation
No benchmark results are published yet. Evaluation details will accompany each released checkpoint.
## Risks and Limitations
- Water treatment advice is context-dependent and should not be generalized blindly across plants.
- Model outputs can be plausible and still wrong.
- The model must be treated as an assistant, not an authority.
- Current and local regulations always override model output.
## License
License will be specified with each released checkpoint.
## Contact
Keith Wilkinson
Operational Inference — [operationalinference.com](https://operationalinference.com)
GitHub: [boxwrench](https://github.com/boxwrench)
Writing: [title22.org](https://title22.org)