| license: apache-2.0 | |
| llama-gene train datasets | |
| # DNA | |
| dna/ | |
| dna_seq.txt dna sequence | |
| sft_dna_eva.json,sft_dna_train.json dna instruction finetune data | |
| # protein data | |
| protein/ | |
| uni_16.fasta.line protein sequence | |
| protein_sft_train.json, sft_protein_eva.json protein instruction finetune data | |