Jspinad commited on
Commit
38d1860
·
verified ·
1 Parent(s): 75ca407

Add model card for binary

Browse files
Files changed (1) hide show
  1. README.md +42 -0
README.md ADDED
@@ -0,0 +1,42 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - dna
4
+ tags:
5
+ - biology
6
+ - genomics
7
+ - transposable-elements
8
+ - dnabert
9
+ - bilstm
10
+ - sequence-classification
11
+ license: mit
12
+ ---
13
+
14
+ # TE-GER — Binary Detection
15
+
16
+ Part of the **TE-GER** (Transposable Elements Genomic Entity Recognition) toolkit.
17
+
18
+ TE-GER binary model: detects presence/absence of Transposable Elements (TE vs Background) in genomic sequences. Architecture: DNABERT-2 + BiLSTM hybrid. Labels: Background, TE.
19
+
20
+ ## Model Architecture
21
+
22
+ - **Base:** [DNABERT-2](https://huggingface.co/zhihan1996/DNABERT-2-117M) (DNA language model)
23
+ - **Head:** Bidirectional LSTM + Linear Classifier
24
+ - **Input:** 512 bp sliding windows over raw FASTA sequences
25
+ - **Task:** Sequence classification (token-level TE annotation)
26
+
27
+ ## Usage
28
+
29
+ Use this model via the [TE-GER CLI](https://github.com/johanpina/te-ger):
30
+
31
+ ```bash
32
+ python Te_annotator.py genome.fasta output.gff3 --level binary
33
+ ```
34
+
35
+ ## Labels
36
+
37
+ - `0`: Background
38
+ - `1`: TE
39
+
40
+ ## Citation
41
+
42
+ Developed by Johan S. Piña — 2025