NTv3 Models Text & DNA Collection DNA Models are trained on single bp MLM on OpenGenome2. Text models are trained on single character MLM on Wikipedia. Models trained on 40B tokens. • 6 items • Updated 22 days ago
BERT Models Text & DNA Collection Models trained for the paper "Entropy, Disagreement, and the Limits of Foundation Models in Genomics". • 15 items • Updated Mar 3 • 1