Commit History
tensorboard add to reqs (#407) e8fa43a verified
dictionaries from parent dir (#405) 85f295e verified
remove token dictionary and unpickling from init (#403) 7eca269 verified
move dict loading to function in eval utils 57bc17e
edit doc formatting fce3f6e
edit docs formatting ef094b2
add mtl_classifier to docs bedb3b7
add input size tip to instructions 2732369 verified
update tokenizer to defaults for 95M models for special token and input size da8cf3d verified
update instructions to include reminder about token dictionary cb1b0d5 verified
pointing dictionaries from the mtl module's init (#397) 7470753 verified
Refactored token dictionary loading and encapsulated dictionary (#398) beb62a4 verified
Refactor: Convert mask_token_id, pad_token_id, and all_special_ids to properties (#395) 2e06f1a verified
sync token_dictionary variable name w/ classifier a021deb verified
update setup with req and manifest with updated filenames a34fbc2
fix imports mtl/eval_utils eab1878
allow model_type valid options to take params model_type : {"Pretrained", "GeneClassifier", "CellClassifier", "MTLCellClassifier", "MTLCellClassifier-Quantized"} (#390) 47e0ef8 verified
peft>=0.11.1 (#387) 4c9dda5 verified
"save_model_without_heads" is redundant (#385) de10ab0 verified
comment out "def save_model_without_heads(original_model_save_directory)"; redundant for ISP/Emb extractor (#382) 22bf20f verified
fixed bug related to dynamic ranges in dictionary with 'min' and 'max' value mismatch in optuna suggest fn (#380) fe1640b verified
Update README.md 11bcee7 verified
precommit formatting f07bfd7
update with 12L and 20L i4096 gc95M models, multitask and quantiz code 933ca80
rename for consistency ec19834 verified
delete old gene name dict 817eca2 verified
update to only have gene names as keys in gene_name_id_dict e61485e verified
Add function for summing of Ensembl IDs (#377) 1e18102 verified
save pval b07f4b1
add typing list import 42053dc
move dicts to init ea428cb
add random state to umap eb2a04b
update get_embs with token_gene_dict arg ace12e9 verified
update refs to get_model_emb_dims 3fe35ba verified
embs_df with all model embeddings (#363) 2e64874 verified
Add function to get number of model embeddings (#364) c90d791 verified
clone embs_i to resolve memory leak in cls embs 57f02a4
update perturber stats to reflect cos sim and emb_extractor to suppress warnings for non-cls 25dd1da
update to account for set of perturbed genes with aggregate_data eb038a6
update to enable cls emb b2bbd7c
update tokenizer to include eos token ead0550
Christina Theodoris commited on
Update geneformer/emb_extractor.py (#350) 471eefc verified
Upload in_silico_perturber_stats.py (#313) 8aee0ff verified
fix cell state gene embeddings bug (#345) c0e7b19 verified
patch datasets save_to_disk 75c67a1
Christina Theodoris commited on
update kwargs for pretrainer fb130e6
Christina Theodoris commited on
refer to token dictionary in self 86fe0dd
Christina Theodoris commited on