File size: 2,264 Bytes
c05929b
 
887d47e
c05929b
 
 
 
 
f2aacf6
c05929b
 
 
f2aacf6
c05929b
 
 
 
887d47e
 
c05929b
 
887d47e
 
 
 
 
 
 
c05929b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
887d47e
 
 
 
 
 
 
 
 
 
 
c05929b
 
 
 
 
 
 
283c607
451d2a9
88bc88f
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
---
language: en
license: apache-2.0
library_name: transformers
pipeline_tag: token-classification
tags:
- veterinary
- nlp
- ner
- named-entity-recognition
- biomedical
- feline
- bert
---

# Feline-NER

Named Entity Recognition model for feline veterinary medicine, trained on 1,300 annotated sentences from PubMed literature.

## Model Description

**Feline-NER** is a token classification model fine-tuned for extracting clinical entities from feline veterinary scientific literature.

**Model lineage:**
- Base: **BERT** (Devlin et al., 2019)
- Domain-adapted: **BioBERT v1.2** (Lee et al., 2019) - biomedical literature
- Further adapted: **Feline-BERT** - 11,830 feline PubMed articles (MLM fine-tuning)
- Task-specific: **Feline-NER** - 1,300 manually annotated sentences (token classification)

This model is intended **solely for research and educational use**.

## Task
Span-level named entity recognition using a BIO tagging scheme over five entity types:
- DISEASE
- SYMPTOM
- MEDICATION
- PROCEDURE
- ANATOMY

## Training Data
Fine-tuned on a manually annotated dataset of **1,300 sentences** extracted from feline-related PubMed Central articles.
Annotations were produced by a non-veterinary researcher using an iterative human-in-the-loop workflow with LLM-assisted pre-labeling.

## Evaluation
- **Macro F1:** ~0.65  
- **Micro F1:** ~0.64  
(Evaluated on a 150-sentence held-out test set)

Performance varies by entity type; PROCEDURE and ANATOMY remain challenging due to boundary ambiguity.

## Intended Use
- Veterinary NLP research
- Information extraction from feline scientific literature
- Educational demonstrations

## Usage

This model can be used with the Hugging Face `pipeline` API:

```python
from transformers import pipeline
ner = pipeline("ner", model="Statistical-Impossibility/Feline-NER", aggregation_strategy="simple")
print(ner("The cat was diagnosed with FIV and treated with prednisolone."))
```


## ⚠️ Limitations & Warnings
- **NOT FOR CLINICAL USE**
- Not validated for diagnosis or treatment
- Annotation noise and boundary ambiguity are present
- Single-annotator dataset

## Author
**Statistical-Impossibility**

Project repository: [Feline-Project](https://github.com/Statistical-Impossibility/Feline-Project)