Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ctheodoris
/
Geneformer
like
274
Fill-Mask
Transformers
Safetensors
ctheodoris/Genecorpus-30M
bert
single-cell
genomics
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
582
Deploy
Use this model
main
Geneformer
/
geneformer
11.5 MB
20 contributors
History:
169 commits
ctheodoris
Update geneformer/mtl/train.py (
#582
)
05fcbeb
6 days ago
gene_dictionaries_30m
Update geneformer/tokenizer.py (#415)
over 1 year ago
mtl
Update geneformer/mtl/train.py (#582)
6 days ago
__init__.py
1.65 kB
update with V2 models
8 months ago
classifier.py
67.4 kB
track metadata in predictions
3 months ago
classifier_utils.py
24.4 kB
default id_class_dict to None for gene classifiers
2 months ago
collator_for_classification.py
31.7 kB
silence tensor copy warning
8 months ago
emb_extractor.py
35.4 kB
add emb extractor option for saving all gene embs
3 months ago
ensembl_mapping_dict_gc104M.pkl
3.96 MB
xet
add V2 models
8 months ago
evaluation_utils.py
10.5 kB
track metadata in predictions
3 months ago
gene_median_dictionary_gc104M.pkl
1.51 MB
xet
add V2 models
8 months ago
gene_name_id_dict_gc104M.pkl
1.66 MB
xet
add V2 models
8 months ago
in_silico_perturber.py
67.5 kB
update V1 token dict usage to self attr
8 months ago
in_silico_perturber_stats.py
45.9 kB
add pickle suffix option to isp stats
3 months ago
mtl_classifier.py
14.5 kB
fully qualified imports to resolve name-space conflicts (#532)
9 months ago
perturber_utils.py
32.5 kB
Fix TypeError in make_perturbation_batch_special (#567)
3 months ago
pretrainer.py
29.5 kB
remove unused imports while no longer using distributed sampler
about 1 year ago
token_dictionary_gc104M.pkl
426 kB
xet
add V2 models
8 months ago
tokenizer.py
34.7 kB
add empty tokenized_counts for loom to pass until keep_counts implemented
4 months ago