File size: 5,461 Bytes
3e25607 6c8e05e 3e25607 6c8e05e 3e25607 6c8e05e 3e25607 6c8e05e 3e25607 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 |
---
base_model:
- unsloth/gemma-3n-E4B-it-unsloth-bnb-4bit
pipeline_tag: text-generation
library_name: transformers
language:
- en
license: apache-2.0
datasets:
- ClinVar
- COSMIC
tags:
- medical
- genomics
- cancer
- oncology
- mutation-analysis
- precision-medicine
- GGUF
- Ollama
model_type: gemma3n
quantized_by: OncoScope
---
# OncoScope Cancer Genomics Analysis Model
OncoScope is a specialized AI model fine-tuned for cancer genomics analysis and precision oncology. Built on Google's Gemma 3n architecture, this model provides expert-level analysis of cancer mutations, risk assessments, and therapeutic recommendations while maintaining complete privacy through on-device inference.
## Model Details
- **Base Model**: Google Gemma 3n 2B E4B Chat IT
- **Parameters**: 6.9B (quantized from fine-tuned model)
- **Architecture**: Gemma3n
- **Quantization**: Q8_0 GGUF format
- **Context Length**: 32,768 tokens
- **Embedding Length**: 2,048
## Key Features
- **Cancer Mutation Analysis**: Pathogenicity assessment using ACMG/AMP guidelines
- **Risk Stratification**: Hereditary cancer syndrome evaluation
- **Therapeutic Recommendations**: Evidence-based drug target identification
- **Privacy-First**: Designed for on-device inference with Ollama
- **Clinical Guidelines**: Incorporates established medical standards
- **Multi-mutation Analysis**: Complex genomic interaction assessment
## Training Data
The model was fine-tuned on a curated dataset of 5,998 cancer genomics examples from:
- **ClinVar**: Clinical variant database
- **COSMIC Top 50**: Cancer mutation signatures
- **Expert-curated**: Clinical oncology cases
## Usage
### With Ollama
1. **Download the model files**:
- `oncoscope-gemma-3n-merged.Q8_0.gguf` (6.8GB)
- `Modelfile`
2. **Create the model**:
```bash
ollama create oncoscope -f Modelfile
```
3. **Run inference**:
```bash
ollama run oncoscope "Analyze the clinical significance of BRCA1 c.5266dupC mutation"
```
### Example Usage
```bash
ollama run oncoscope "Patient: 45-year-old female with family history of breast cancer.
Mutation: BRCA1 c.68_69delAG (p.Glu23ValfsTer17).
Please provide pathogenicity assessment and recommendations."
```
**Example Response**:
```json
{
"mutation_analysis": {
"gene": "BRCA1",
"variant": "c.68_69delAG",
"protein_change": "p.Glu23ValfsTer17",
"pathogenicity": "Pathogenic",
"confidence_score": 0.95,
"acmg_classification": "PVS1, PM2, PP3"
},
"clinical_significance": {
"cancer_risk": "High",
"associated_cancers": ["Breast", "Ovarian"],
"lifetime_risk": {
"breast_cancer": "55-85%",
"ovarian_cancer": "15-40%"
}
},
"recommendations": {
"genetic_counseling": "Strongly recommended",
"screening": "Enhanced surveillance starting age 25",
"prevention": "Consider prophylactic surgery",
"family_testing": "Cascade testing recommended"
}
}
```
## Model Capabilities
- **Pathogenicity Assessment**: ACMG/AMP guideline compliance
- **Risk Calculation**: Quantitative cancer risk estimates
- **Drug Recommendations**: FDA-approved targeted therapies
- **Family History Analysis**: Hereditary pattern recognition
- **Genetic Counseling**: Evidence-based guidance
- **Multi-lingual Support**: Medical terminology in multiple languages
## Limitations
- **Medical Disclaimer**: This model is for research and educational purposes only. Always consult qualified healthcare professionals for medical decisions.
- **Training Cutoff**: Knowledge based on training data through early 2024
- **Quantization**: Some precision loss due to Q8_0 quantization
- **Context Window**: Limited to 4,096 tokens for optimal performance
## Technical Specifications
- **Model Size**: 6.8GB (GGUF Q8_0)
- **Memory Requirements**: 8GB+ RAM recommended
- **Hardware**: CPU inference optimized, GPU acceleration supported
- **Operating Systems**: Cross-platform (macOS, Linux, Windows)
## Performance
The model demonstrates expert-level performance on:
- Variant pathogenicity classification (>90% accuracy vs. clinical consensus)
- Cancer risk assessment correlation with established guidelines
- Therapeutic recommendation alignment with FDA approvals
- Response time: 20-40 seconds for complex genomic analysis
## Privacy & Security
- **On-Device Inference**: No data transmitted to external servers
- **HIPAA Compliance**: Suitable for clinical environments
- **Offline Operation**: Full functionality without internet connection
- **Data Security**: Patient genetic information remains local
## Citation
If you use this model in your research, please cite:
```bibtex
@misc{oncoscope2025,
title={OncoScope: Privacy-First Cancer Genomics Analysis with Gemma 3n},
author={Sheldon Aristide},
year={2025},
url={https://huggingface.co/Zero21/OncoScope}
}
```
## License
This model is released under the Apache 2.0 license, consistent with the base Gemma model licensing.
## Support & Contact
For questions, issues, or contributions:
- GitHub: [OncoScope Project](https://github.com/Aristide021/OncoScope)
- Issues: Please report bugs or feature requests via GitHub Issues
## Disclaimer
This AI model is intended for research and educational purposes only. It should not be used as a substitute for professional medical advice, diagnosis, or treatment. Always seek the advice of qualified healthcare professionals regarding any medical condition or genetic testing decisions. |