Add paper link and update model card metadata
#1
by nielsr HF Staff - opened
README.md
CHANGED
|
@@ -1,19 +1,22 @@
|
|
| 1 |
---
|
| 2 |
-
license: cc-by-nc-4.0
|
| 3 |
language:
|
| 4 |
- en
|
|
|
|
|
|
|
|
|
|
| 5 |
tags:
|
| 6 |
- medical
|
| 7 |
- text-generation
|
| 8 |
- language-model
|
| 9 |
- biopan
|
| 10 |
- jepa
|
| 11 |
-
|
| 12 |
-
pipeline_tag: text-generation
|
| 13 |
---
|
| 14 |
|
| 15 |
# SMB-v1-8B-Structure
|
| 16 |
|
|
|
|
|
|
|
| 17 |
## Documentation & Quickstart
|
| 18 |
|
| 19 |
For a comprehensive guide on getting started, architecture details, and advanced usage, please visit our official documentation: [**📖 SMB-v1 Quickstart Guide**](https://docs.standardmodel.bio/get-started/quickstart)
|
|
@@ -29,7 +32,7 @@ For a comprehensive guide on getting started, architecture details, and advanced
|
|
| 29 |
|
| 30 |
## Model Description
|
| 31 |
|
| 32 |
-
**SMB-v1-
|
| 33 |
|
| 34 |
Unlike general-purpose models, SMB-v1 is designed to ingest and synthesize diverse structured modalities across the patient journey, including:
|
| 35 |
|
|
@@ -74,7 +77,7 @@ from transformers import AutoModelForCausalLM, AutoTokenizer
|
|
| 74 |
from smb_biopan_utils import process_ehr_info
|
| 75 |
|
| 76 |
# 1. Load Model and Tokenizer
|
| 77 |
-
model_id = "standardmodelbio/SMB-v1-
|
| 78 |
tokenizer = AutoTokenizer.from_pretrained(model_id)
|
| 79 |
model = AutoModelForCausalLM.from_pretrained(
|
| 80 |
model_id,
|
|
@@ -114,11 +117,10 @@ print(f"Patient Representation Shape: {patient_embedding.shape}")
|
|
| 114 |
If you use this model in your research or application, please cite:
|
| 115 |
|
| 116 |
```bibtex
|
| 117 |
-
@
|
| 118 |
-
|
| 119 |
-
|
| 120 |
-
|
| 121 |
-
|
| 122 |
-
url = {https://huggingface.co/standardmodelbio/SMB-v1-8B-Structure}
|
| 123 |
}
|
| 124 |
```
|
|
|
|
| 1 |
---
|
|
|
|
| 2 |
language:
|
| 3 |
- en
|
| 4 |
+
library_name: transformers
|
| 5 |
+
license: cc-by-nc-4.0
|
| 6 |
+
pipeline_tag: text-generation
|
| 7 |
tags:
|
| 8 |
- medical
|
| 9 |
- text-generation
|
| 10 |
- language-model
|
| 11 |
- biopan
|
| 12 |
- jepa
|
| 13 |
+
arxiv: 2601.22128
|
|
|
|
| 14 |
---
|
| 15 |
|
| 16 |
# SMB-v1-8B-Structure
|
| 17 |
|
| 18 |
+
This repository contains the model weights for **SMB-v1-8B-Structure**, as introduced in the paper [The Patient is not a Moving Document: A World Model Training Paradigm for Longitudinal EHR](https://huggingface.co/papers/2601.22128).
|
| 19 |
+
|
| 20 |
## Documentation & Quickstart
|
| 21 |
|
| 22 |
For a comprehensive guide on getting started, architecture details, and advanced usage, please visit our official documentation: [**📖 SMB-v1 Quickstart Guide**](https://docs.standardmodel.bio/get-started/quickstart)
|
|
|
|
| 32 |
|
| 33 |
## Model Description
|
| 34 |
|
| 35 |
+
**SMB-v1-8B-Structure** is the initial release of the SMB-v1 family, specifically engineered to model the complex, time-varying dynamics of cancer biology through structured clinical signals. It treats structured clinical data as a multimodal environment, fusing heterogeneous data streams into a unified patient state representation.
|
| 36 |
|
| 37 |
Unlike general-purpose models, SMB-v1 is designed to ingest and synthesize diverse structured modalities across the patient journey, including:
|
| 38 |
|
|
|
|
| 77 |
from smb_biopan_utils import process_ehr_info
|
| 78 |
|
| 79 |
# 1. Load Model and Tokenizer
|
| 80 |
+
model_id = "standardmodelbio/SMB-v1-8B-Structure"
|
| 81 |
tokenizer = AutoTokenizer.from_pretrained(model_id)
|
| 82 |
model = AutoModelForCausalLM.from_pretrained(
|
| 83 |
model_id,
|
|
|
|
| 117 |
If you use this model in your research or application, please cite:
|
| 118 |
|
| 119 |
```bibtex
|
| 120 |
+
@article{chen2025patient,
|
| 121 |
+
title={The Patient is not a Moving Document: A World Model Training Paradigm for Longitudinal EHR},
|
| 122 |
+
author={Chen, Jiuhai and Yang, Jianwei and Wu, Haiping and Li, Dianqi and Gao, Jianfeng and Zhou, Tianyi and Xiao, Bin},
|
| 123 |
+
journal={arXiv preprint arXiv:2601.22128},
|
| 124 |
+
year={2025}
|
|
|
|
| 125 |
}
|
| 126 |
```
|