File size: 852 Bytes
b97b478
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
298699b
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
# Gene Extraction Model

This model is fine-tuned for gene extraction using BERT-CRF architecture.

## Usage

```python
from transformers import AutoTokenizer, AutoModelForTokenClassification
from transformers import pipeline

# Load model and tokenizer
model_name = "RaduGabriel/gene-entity-recognition"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForTokenClassification.from_pretrained(model_name)

# Create NER pipeline
ner_pipeline = pipeline(
    "ner",
    model=model,
    tokenizer=tokenizer,
    aggregation_strategy="simple"
)

# Example usage
text = "The BRCA1 gene is associated with breast cancer."
results = ner_pipeline(text)
```

## Labels
- O
- B-GENE
- I-GENE
- E-GENE
- S-GENE

## Model Details
- Architecture: BERT-CRF
- Base Model: answerdotai/ModernBERT-large
- Number of Labels: 5
- CRF Layer: Enabled