Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,44 @@
|
|
| 1 |
-
---
|
| 2 |
-
|
| 3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
language:
|
| 3 |
+
- en
|
| 4 |
+
|
| 5 |
+
license: mit
|
| 6 |
+
tags:
|
| 7 |
+
- biology
|
| 8 |
+
- genomics
|
| 9 |
+
- transcriptomics
|
| 10 |
+
- cancer
|
| 11 |
+
- bulk-rnaseq
|
| 12 |
+
- foundation-model
|
| 13 |
+
- masked-reconstruction
|
| 14 |
+
- performer
|
| 15 |
+
- gcn
|
| 16 |
+
- pytorch
|
| 17 |
+
|
| 18 |
+
library_name: pytorch
|
| 19 |
+
pipeline_tag: feature-extraction
|
| 20 |
+
|
| 21 |
+
model_name: CancerTranscriptome-Mini-48M
|
| 22 |
+
model_type: transformer
|
| 23 |
+
|
| 24 |
+
datasets:
|
| 25 |
+
- ARCHS4
|
| 26 |
+
|
| 27 |
+
papers:
|
| 28 |
+
- https://doi.org/10.1101/2025.06.11.659222
|
| 29 |
+
|
| 30 |
+
authors:
|
| 31 |
+
- name: Walter Alvarado
|
| 32 |
+
affiliation: NASA Ames Research Center
|
| 33 |
+
github: https://github.com/alwalt
|
| 34 |
+
|
| 35 |
+
model_size:
|
| 36 |
+
total_params: 48336162
|
| 37 |
+
|
| 38 |
+
description: >
|
| 39 |
+
CancerTranscriptome-Mini-48M is a small, proof-of-concept BulkFormer-inspired model
|
| 40 |
+
trained on cancer-only bulk RNA-seq (ARCHS4, TCGA, GEO). It integrates ESM2 gene
|
| 41 |
+
identity embeddings, Rotary Expression Embeddings (REE), GCN message passing, local
|
| 42 |
+
bin-based Performer attention, and global Performer attention. This model is designed
|
| 43 |
+
as a research prototype showing that BulkFormer-like architectures can be trained and
|
| 44 |
+
used end-to-end on a single consumer GPU.
|