README.md · Ch3DS/ssi-bert-v1 at main

ssi-bert-v1 / README.md

Ch3w3y

Update README.md

9288b34 verified 3 months ago

preview code

raw

history blame contribute delete

9.16 kB

	---
	license: apache-2.0
	language:
	- en
	metrics:
	- accuracy
	base_model:
	- google-bert/bert-base-uncased
	pipeline_tag: text-classification
	tags:
	- SSI
	- Health
	- Surgery
	- Infection
	- AMR
	- Automation
	- Surveillance
	- Epidemiology
	- Clinical
	- Raw
	- Text
	---
	# Model Card: SSI-BERT-v1

	Surgical Site Infection Detection from Clinical Notes

	---

	## Model Details

	### Model Architecture
	- Base Model: BERT (Bidirectional Encoder Representations from Transformers)
	- Model Name: `bert-base-uncased`
	- HuggingFace ID: `google-bert/bert-base-uncased`
	- Task: Binary Classification (SSI Detection)
	- Fine-tuned: Yes
	- Model Size: 340 MB
	- Parameters: 110M

	### Training Configuration
	- Framework: PyTorch 2.7.0+
	- GPU: NVIDIA RTX 5070 Ti (16GB VRAM)
	- Precision: BF16 (Mixed precision)
	- Optimizer: AdamW
	- Learning Rate: 2e-5
	- Batch Size: 32 (with gradient accumulation)
	- Epochs: 3
	- Training Time: ~5-6 hours
	- Training Date: 2025-01-15

	### Tokenizer
	- Type: WordPiece
	- Vocabulary Size: 30,522
	- Max Sequence Length: 512 tokens
	- Special Tokens: [CLS], [SEP], [PAD], [UNK]

	---

	## Intended Use

	### Primary Use Case
	Epidemiological surveillance of surgical site infections (SSI) from clinical notes in healthcare systems. This model is designed for monitoring and trend detection, not clinical decision support.

	### Use Context
	- Post-operative clinical notes (0-30 days post-surgery)
	- Batch processing of clinical documentation
	- Surveillance alert generation
	- Procedure-specific SSI rate tracking

	### Appropriate Uses
	- ✓ Identifying potential SSI cases for further review
	- ✓ Tracking SSI incidence trends across departments/procedures
	- ✓ Flagging high-risk cases for clinician review
	- ✓ Epidemiological research and surveillance

	### Inappropriate Uses
	- ✗ Standalone clinical diagnosis
	- ✗ Real-time patient triage decisions
	- ✗ Treatment recommendations
	- ✗ Automated patient management without human review

	---

	## Performance Metrics

	### Validation Results (Synthetic Data)
	- Accuracy: 0.8900
	- Precision: 0.8500
	- Recall (Sensitivity): 0.8800
	- Specificity: ~0.8500
	- F1 Score: 0.8650
	- AUC-ROC: 0.9200
	- Dataset Size: 200,000 test samples

	### Performance Notes
	- Metrics calculated on held-out test set (synthetic data)
	- Real-world performance expected to vary (±5-10%)
	- Model optimized for recall (catching SSI cases)
	- 12% false negative rate acceptable for surveillance use
	- 15% false positive rate manageable with clinician review

	### Threshold Analysis
	- Default Threshold: 0.50
	- Surveillance Threshold: 0.45 (optimized for sensitivity)
	- Conservative Threshold: 0.60 (high precision)

	---

	## Training Data

	### Data Source
	Synthetic clinical notes generated for validation purposes

	### Data Composition
	- Total Training Samples: 1,000,000
	- SSI Cases (Positive): 150,000 (15%)
	- Normal Cases (Negative): 850,000 (85%)
	- Train/Val/Test Split: 70% / 15% / 15%

	### Data Characteristics
	- Post-operative clinical notes (0-30 days post-surgery)
	- 12 surgical procedure types represented
	- Clinical terminology and medical abbreviations
	- Vital signs and clinical findings
	- Note length: 300-2000 characters (avg 800)

	### Limitations
	- Synthetic Generation: Notes generated using templates
	- Not Trained on Real Clinical Data: Performance on real clinical notes may differ
	- English Only: No multi-language support
	- US Healthcare Context: Terminology based on US clinical practice

	---

	## Limitations and Bias

	### Known Limitations
	1. Domain Shift: Trained on synthetic data; real clinical text may have different patterns
	2. Class Imbalance: Model trained with 15% SSI prevalence (real ~5-10%)
	3. Language: English-only, US healthcare context
	4. Temporal Bias: No temporal ordering in training (shuffled data)
	5. Procedure Coverage: Limited to 12 procedure types
	6. Post-operative Window: Optimized for 0-30 days post-op only

	### Potential Biases
	- Clinical Documentation Style: Model may perform differently across hospitals with different documentation practices
	- Terminology Variation: May struggle with rare/novel clinical abbreviations
	- Provider Bias: Performance may vary by note author/department

	### Generalization
	- Not validated on external datasets
	- Expected performance drop on out-of-distribution data
	- Requires validation on real clinical data before deployment

	---

	## Ethical Considerations

	### Fairness
	- Model developed for epidemiological surveillance, not individual diagnosis
	- Not intended for resource allocation decisions
	- Should not be sole factor in clinical decisions

	### Transparency
	- Decision threshold can be adjusted for sensitivity/specificity trade-off
	- Model provides probability scores for human interpretation
	- Predictions should always be reviewed by clinicians

	### Safety
	- Model designed as surveillance tool, not clinical decision support
	- Includes explicit warnings against standalone clinical use
	- Requires human-in-the-loop for alert validation

	### Privacy
	- Model does not store patient data
	- De-identified text input only
	- No identifiable information in model outputs

	---

	## Model Inputs and Outputs

	### Input
	```json
	{
	"text": "Clinical note text here...",
	"threshold": 0.5
	}
	```

	### Output
	```json
	{
	"ssi_probability": 0.8234,
	"label": 1,
	"prediction": "SSI",
	"threshold": 0.5,
	"timestamp": "2025-01-15T16:04:43"
	}
	```

	### Input Constraints
	- Text length: 50-10,000 characters
	- Language: English only
	- Format: Plain text clinical notes
	- Context: Post-operative (0-30 days)

	### Output Interpretation
	- `ssi_probability` (0-1): Confidence score for SSI presence
	- `label` (0 or 1): Binary classification
	- `prediction`: Human-readable class label
	- Scores <0.4: Likely negative
	- Scores 0.4-0.6: Uncertain (requires review)
	- Scores >0.6: Likely positive

	---

	## Training and Evaluation

	### Training Parameters
	```yaml
	Model: bert-base-uncased
	Optimizer: adamw_torch
	Learning Rate: 2e-5
	Batch Size: 32
	Gradient Accumulation: 2
	Gradient Checkpointing: True
	Mixed Precision: BF16
	Warmup Steps: 100
	Max Grad Norm: 1.0
	Weight Decay: 0.01
	```

	### Evaluation Methodology
	- Stratified train/val/test split (70/15/15)
	- Class-weighted metrics due to imbalance
	- Threshold optimization on validation set
	- Held-out test set evaluation

	### Hardware
	- GPU: NVIDIA RTX 5070 Ti (16GB VRAM)
	- CPU: Multi-core processor
	- RAM: 16GB+
	- Storage: 1GB for model + dependencies

	---

	## Model Versioning

	### Version: 1.0.0
	- Release Date: 2025-01-15
	- Status: Beta/Prototype
	- Base Model: bert-base-uncased
	- Training Epochs: 3
	- Data: Synthetic (1M samples)
	- Validation: Synthetic test set

	### Future Versions
	- v1.1: Expected after real clinical data validation
	- v2.0: Planned with ClinicalBERT base model
	- v2.1: Multi-procedure optimization

	---

	## How to Use

	### Installation
	```bash
	pip install transformers torch
	```

	### Local Inference
	```python
	from transformers import BertTokenizer, BertForSequenceClassification
	import torch

	model_path = "output/models/ssi-bert-pipeline/initial/final"
	tokenizer = BertTokenizer.from_pretrained(model_path)
	model = BertForSequenceClassification.from_pretrained(model_path)

	text = "Clinical note here..."
	inputs = tokenizer(text, return_tensors='pt', truncation=True, max_length=512)

	with torch.no_grad():
	outputs = model(**inputs)
	probability = torch.softmax(outputs.logits, dim=1)[0, 1].item()
	```

	### Via API
	```bash
	curl -X POST "http://localhost:8000/predict" \
	-H "Content-Type: application/json" \
	-d '{"text":"Clinical note text", "threshold":0.5}'
	```

	### Batch Processing
	```bash
	python cli.py monitor \
	--model-path output/models/ssi-bert-pipeline/initial/final \
	--data data/clinical_notes.csv \
	--period january_2024 \
	--save-predictions
	```

	---

	## Citation

	If you use this model, please cite:

	```bibtex
	@model{ssi_bert_v1_2025,
	title={SSI-BERT-v1: BERT-based Surgical Site Infection Detection},
	author={Daryn Sutton/Ch3DS},
	year={2025},
	month={January},
	note={Trained on synthetic clinical notes for epidemiological surveillance}
	}
	```

	---

	## License

	Model: Apache 2.0 (inherited from BERT-base-uncased)
	Documentation: CC-BY-4.0

	---

	## Changelog

	### Version 1.0.0 (2025-01-15)
	- Initial release
	- Trained on 1M synthetic clinical notes
	- Validated on 200k test samples
	- Performance: 89% accuracy, 92% AUC-ROC

	---

	## Contact and Support

	For questions or issues:
	- Documentation: See README.md
	- Issue Tracking: github.com/Ch3w3y/SSIBERT
	- Email: darynsutton@hotmail.com

	---

	## Disclaimer

	This model is provided for research and surveillance purposes only. It is not intended for clinical diagnosis or treatment decisions. Always consult with qualified healthcare professionals for clinical decisions. The developers assume no liability for misuse or unintended consequences.

	---

	Model Card Last Updated: 2025-01-15
	Model Version: 1.0.0
	Status: Beta (Pre-production)