Statistical-Impossibility commited on
Commit
c05929b
·
verified ·
1 Parent(s): 1d18970

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +54 -0
README.md ADDED
@@ -0,0 +1,54 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: en
3
+ license: cc-by-nc-4.0
4
+ library_name: transformers
5
+ pipeline_tag: token-classification
6
+ tags:
7
+ - veterinary
8
+ - nlp
9
+ - named-entity-recognition
10
+ - biomedical
11
+ - feline
12
+ ---
13
+
14
+ # Feline-NER
15
+
16
+ ## Model Description
17
+ **Feline-NER** is a transformer-based named entity recognition (NER) model for **feline veterinary scientific literature**.
18
+
19
+ The model follows the BERT architecture and was initialized from **BioBERT v1.2**, further domain-adapted on feline-related PubMed Central articles (*Feline-BERT*), and fine-tuned for NER (*Feline-NER*).
20
+
21
+ This model is intended **solely for research and educational use**.
22
+
23
+ ## Task
24
+ Span-level named entity recognition using a BIO tagging scheme over five entity types:
25
+ - DISEASE
26
+ - SYMPTOM
27
+ - MEDICATION
28
+ - PROCEDURE
29
+ - ANATOMY
30
+
31
+ ## Training Data
32
+ Fine-tuned on a manually annotated dataset of **1,300 sentences** extracted from feline-related PubMed Central articles.
33
+ Annotations were produced by a non-veterinary researcher using an iterative human-in-the-loop workflow with LLM-assisted pre-labeling.
34
+
35
+ ## Evaluation
36
+ - **Macro F1:** ~0.65
37
+ - **Micro F1:** ~0.64
38
+ (Evaluated on a 150-sentence held-out test set)
39
+
40
+ Performance varies by entity type; PROCEDURE and ANATOMY remain challenging due to boundary ambiguity.
41
+
42
+ ## Intended Use
43
+ - Veterinary NLP research
44
+ - Information extraction from feline scientific literature
45
+ - Educational demonstrations
46
+
47
+ ## ⚠️ Limitations & Warnings
48
+ - **NOT FOR CLINICAL USE**
49
+ - Not validated for diagnosis or treatment
50
+ - Annotation noise and boundary ambiguity are present
51
+ - Single-annotator dataset
52
+
53
+ ## Author
54
+ Statistical-Impossibility