Add paper link and update model card metadata

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +13 -11
README.md CHANGED
@@ -1,19 +1,22 @@
1
  ---
2
- license: cc-by-nc-4.0
3
  language:
4
  - en
 
 
 
5
  tags:
6
  - medical
7
  - text-generation
8
  - language-model
9
  - biopan
10
  - jepa
11
- library_name: transformers
12
- pipeline_tag: text-generation
13
  ---
14
 
15
  # SMB-v1-8B-Structure
16
 
 
 
17
  ## Documentation & Quickstart
18
 
19
  For a comprehensive guide on getting started, architecture details, and advanced usage, please visit our official documentation: [**📖 SMB-v1 Quickstart Guide**](https://docs.standardmodel.bio/get-started/quickstart)
@@ -29,7 +32,7 @@ For a comprehensive guide on getting started, architecture details, and advanced
29
 
30
  ## Model Description
31
 
32
- **SMB-v1-1.7B-Structure** is the initial release of the SMB-v1 family, specifically engineered to model the complex, time-varying dynamics of cancer biology through structured clinical signals. It treats structured clinical data as a multimodal environment, fusing heterogeneous data streams into a unified patient state representation.
33
 
34
  Unlike general-purpose models, SMB-v1 is designed to ingest and synthesize diverse structured modalities across the patient journey, including:
35
 
@@ -74,7 +77,7 @@ from transformers import AutoModelForCausalLM, AutoTokenizer
74
  from smb_biopan_utils import process_ehr_info
75
 
76
  # 1. Load Model and Tokenizer
77
- model_id = "standardmodelbio/SMB-v1-1.7B-Structure"
78
  tokenizer = AutoTokenizer.from_pretrained(model_id)
79
  model = AutoModelForCausalLM.from_pretrained(
80
  model_id,
@@ -114,11 +117,10 @@ print(f"Patient Representation Shape: {patient_embedding.shape}")
114
  If you use this model in your research or application, please cite:
115
 
116
  ```bibtex
117
- @misc{biopan_omni,
118
- author = {standardmodelbio},
119
- title = {SMB-v1-8B-Structure},
120
- year = {2025},
121
- publisher = {HuggingFace},
122
- url = {https://huggingface.co/standardmodelbio/SMB-v1-8B-Structure}
123
  }
124
  ```
 
1
  ---
 
2
  language:
3
  - en
4
+ library_name: transformers
5
+ license: cc-by-nc-4.0
6
+ pipeline_tag: text-generation
7
  tags:
8
  - medical
9
  - text-generation
10
  - language-model
11
  - biopan
12
  - jepa
13
+ arxiv: 2601.22128
 
14
  ---
15
 
16
  # SMB-v1-8B-Structure
17
 
18
+ This repository contains the model weights for **SMB-v1-8B-Structure**, as introduced in the paper [The Patient is not a Moving Document: A World Model Training Paradigm for Longitudinal EHR](https://huggingface.co/papers/2601.22128).
19
+
20
  ## Documentation & Quickstart
21
 
22
  For a comprehensive guide on getting started, architecture details, and advanced usage, please visit our official documentation: [**📖 SMB-v1 Quickstart Guide**](https://docs.standardmodel.bio/get-started/quickstart)
 
32
 
33
  ## Model Description
34
 
35
+ **SMB-v1-8B-Structure** is the initial release of the SMB-v1 family, specifically engineered to model the complex, time-varying dynamics of cancer biology through structured clinical signals. It treats structured clinical data as a multimodal environment, fusing heterogeneous data streams into a unified patient state representation.
36
 
37
  Unlike general-purpose models, SMB-v1 is designed to ingest and synthesize diverse structured modalities across the patient journey, including:
38
 
 
77
  from smb_biopan_utils import process_ehr_info
78
 
79
  # 1. Load Model and Tokenizer
80
+ model_id = "standardmodelbio/SMB-v1-8B-Structure"
81
  tokenizer = AutoTokenizer.from_pretrained(model_id)
82
  model = AutoModelForCausalLM.from_pretrained(
83
  model_id,
 
117
  If you use this model in your research or application, please cite:
118
 
119
  ```bibtex
120
+ @article{chen2025patient,
121
+ title={The Patient is not a Moving Document: A World Model Training Paradigm for Longitudinal EHR},
122
+ author={Chen, Jiuhai and Yang, Jianwei and Wu, Haiping and Li, Dianqi and Gao, Jianfeng and Zhou, Tianyi and Xiao, Bin},
123
+ journal={arXiv preprint arXiv:2601.22128},
124
+ year={2025}
 
125
  }
126
  ```