christine-withers commited on
Commit
2967af6
·
verified ·
1 Parent(s): 8fb0ee8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -4
README.md CHANGED
@@ -1,10 +1,29 @@
1
  ---
2
  title: README
3
- emoji: 📈
4
- colorFrom: gray
5
- colorTo: gray
6
  sdk: static
7
  pinned: false
8
  ---
9
 
10
- Edit this `README.md` markdown file to author your organization card.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  title: README
3
+ emoji: 🐠
4
+ colorFrom: pink
5
+ colorTo: purple
6
  sdk: static
7
  pinned: false
8
  ---
9
 
10
+ # OTAR3088 NLP-model organisation
11
+
12
+ ## Work Package 1 - Knowledge Extraction (NLP)
13
+
14
+ _**Background**_
15
+
16
+ Within this working group of the greater _OTAR3088, 'Automating Knowledge Management'_ project, we aim to modernise and extend the current named entity recognition workflows of EuropePMC / Open Targets to cover an array of entity types of entities relevant to drug discovery (such as variants, biomarkers, tissues/cell types, adverse events, and assay conditions). These new entities will provide higher confidence in the relevance of a target-disease association.
17
+
18
+ Since NLP models are constantly updated and fine-tuned, we have created a modular, flexible framework that facilitates the creation of new NLP models.
19
+
20
+ _**OTAR3088 HuggingFace**_
21
+
22
+ This organisation space details all of the data development and model generation of the project. Data is sectioned by the greater entity-type being studied by the group at a given time, sources of data are described in the data cards. Output models are also shared here.
23
+
24
+
25
+ _**Learn more about our project, resources and others:**_
26
+
27
+ * [OTAR3088 - The project](https://aws.amazon.com/ec2/instance-types/g4/?trk=5767130d-03c1-443e-908e-5202162d30d3&sc_channel=el)
28
+ * [Our flexible NLP-model production pipeline](https://github.com/ML4LitS/OTAR3088)
29
+ * [Helpful wider-context reading](https://www.tandfonline.com/doi/full/10.1080/17460441.2025.2490835)