ctheodoris
/

Geneformer

Model card Files Files and versions

Commit History

adding tensorboard to setup.py

0560619
verified

madhavanvenkatesh commited on Sep 5, 2024

tensorboard add to reqs (#407)

e8fa43a
verified

madhavanvenkatesh commited on Sep 5, 2024

dictionaries from parent dir (#405)

85f295e
verified

madhavanvenkatesh commited on Sep 3, 2024

remove token dictionary and unpickling from init (#403)

7eca269
verified

madhavanvenkatesh commited on Sep 3, 2024

move dict loading to function in eval utils

57bc17e

ctheodoris commited on Sep 2, 2024

edit doc formatting

fce3f6e

ctheodoris commited on Sep 2, 2024

edit docs formatting

ef094b2

ctheodoris commited on Sep 2, 2024

add mtl_classifier to docs

bedb3b7

ctheodoris commited on Sep 2, 2024

add input size tip to instructions

2732369
verified

ctheodoris commited on Sep 2, 2024

update tokenizer to defaults for 95M models for special token and input size

da8cf3d
verified

ctheodoris commited on Sep 2, 2024

update instructions to include reminder about token dictionary

cb1b0d5
verified

ctheodoris commited on Sep 2, 2024

pointing dictionaries from the mtl module's init (#397)

7470753
verified

madhavanvenkatesh commited on Aug 28, 2024

Refactored token dictionary loading and encapsulated dictionary (#398)

beb62a4
verified

madhavanvenkatesh commited on Aug 28, 2024

Refactor: Convert mask_token_id, pad_token_id, and all_special_ids to properties (#395)

2e06f1a
verified

madhavanvenkatesh commited on Aug 28, 2024

sync token_dictionary variable name w/ classifier

a021deb
verified

ctheodoris commited on Aug 26, 2024

update setup with req and manifest with updated filenames

a34fbc2

ctheodoris commited on Aug 21, 2024

fix imports mtl/eval_utils

eab1878

ctheodoris commited on Aug 20, 2024

allow model_type valid options to take params model_type : {"Pretrained", "GeneClassifier", "CellClassifier", "MTLCellClassifier", "MTLCellClassifier-Quantized"} (#390)

47e0ef8
verified

madhavanvenkatesh commited on Aug 21, 2024

peft>=0.11.1 (#387)

4c9dda5
verified

madhavanvenkatesh commited on Aug 21, 2024

"save_model_without_heads" is redundant (#385)

de10ab0
verified

madhavanvenkatesh commited on Aug 21, 2024

comment out "def save_model_without_heads(original_model_save_directory)"; redundant for ISP/Emb extractor (#382)

22bf20f
verified

madhavanvenkatesh commited on Aug 21, 2024

fixed bug related to dynamic ranges in dictionary with 'min' and 'max' value mismatch in optuna suggest fn (#380)

fe1640b
verified

madhavanvenkatesh commited on Aug 21, 2024

Update README.md

11bcee7
verified

ctheodoris commited on Aug 19, 2024

precommit formatting

f07bfd7

ctheodoris commited on Aug 15, 2024

update with 12L and 20L i4096 gc95M models, multitask and quantiz code

933ca80

ctheodoris commited on Aug 15, 2024

rename for consistency

ec19834
verified

ctheodoris commited on Aug 11, 2024

delete old gene name dict

817eca2
verified

ctheodoris commited on Aug 11, 2024

update to only have gene names as keys in gene_name_id_dict

e61485e
verified

ctheodoris commited on Aug 11, 2024

Add function for summing of Ensembl IDs (#377)

1e18102
verified

hchen725 commited on Aug 11, 2024

save pval

b07f4b1

ctheodoris commited on Jul 15, 2024

add typing list import

42053dc

ctheodoris commited on Jul 15, 2024

move dicts to init

ea428cb

ctheodoris commited on Jul 13, 2024

add random state to umap

eb2a04b

ctheodoris commited on Jul 13, 2024

update get_embs with token_gene_dict arg

ace12e9
verified

ctheodoris commited on Jul 10, 2024

update refs to get_model_emb_dims

3fe35ba
verified

ctheodoris commited on Jul 10, 2024

embs_df with all model embeddings (#363)

2e64874
verified

hchen725 commited on Jul 9, 2024

Add function to get number of model embeddings (#364)

c90d791
verified

hchen725 commited on Jul 9, 2024

clone embs_i to resolve memory leak in cls embs

57f02a4

ctheodoris commited on Jul 8, 2024

update perturber stats to reflect cos sim and emb_extractor to suppress warnings for non-cls

25dd1da

ctheodoris commited on Jul 7, 2024

update to account for set of perturbed genes with aggregate_data

eb038a6

ctheodoris commited on Jul 2, 2024

update to enable cls emb

b2bbd7c

ctheodoris commited on Jun 30, 2024

update tokenizer to include eos token

ead0550

Christina Theodoris commited on Jun 19, 2024

Update geneformer/emb_extractor.py (#350)

471eefc
verified

hchen725 commited on Jun 13, 2024

Upload in_silico_perturber_stats.py (#313)

8aee0ff
verified

davidjwen commited on Jun 7, 2024

fix cell state gene embeddings bug (#345)

c0e7b19
verified

ctheodoris commited on Jun 7, 2024

patch datasets save_to_disk

75c67a1

Christina Theodoris commited on May 21, 2024

update kwargs for pretrainer

fb130e6

Christina Theodoris commited on Apr 21, 2024

refer to token dictionary in self

86fe0dd

Christina Theodoris commited on Apr 14, 2024

Update for gene classification (#330)

94095d1
verified

hchen725 commited on Apr 8, 2024

Update with gene classifier, custom token dict, and str validate options (#329)

0568479
verified

hchen725 commited on Apr 8, 2024