RohanKarthikeyan commited on
Commit
404cda1
·
verified ·
1 Parent(s): 27163e0

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -0
README.md CHANGED
@@ -15,6 +15,16 @@ tags:
15
 
16
  A MCP server for extracting and normalizing domain-specific entities from biomedical text. We leverage OpenAI LLMs to identify entities and match them to standardized terminology.
17
 
 
 
 
 
 
 
 
 
 
 
18
  ## Installation
19
 
20
  This project uses `uv` from Astral for dependency management. Follow these steps to set up the project:
@@ -81,3 +91,11 @@ The server exposes an MCP-compatible endpoint that can be used by AI agents. The
81
  - `target_entity`: Type of entity to extract ("Disease", "Tissue", or "Cell Type")
82
 
83
  and returns a list of normalized entities.
 
 
 
 
 
 
 
 
 
15
 
16
  A MCP server for extracting and normalizing domain-specific entities from biomedical text. We leverage OpenAI LLMs to identify entities and match them to standardized terminology.
17
 
18
+ ## Motivation
19
+
20
+ Biomedical text normalization addresses a critical challenge in healthcare informatics: approximately 80% of electronic health record (EHR) data exists as unstructured medical text. Such text often contains abbreviations, misspellings, and non-standardized terminology, creating barriers to effective data utilization. This variability hinders leveraging clinical narratives for:
21
+
22
+ - **Clinical decision support** at the point of care
23
+ - **Patient comprehension** of their own medical records
24
+ - **Biomedical research** including cohort identification and pharmacovigilance
25
+
26
+ By implementing named entity recognition and normalization to controlled vocabularies like SNOMED-CT, our MCP server enables downstream applications to process biomedical text with greater accuracy, bridging the gap between natural clinical language and structured data requirements of modern healthcare systems.
27
+
28
  ## Installation
29
 
30
  This project uses `uv` from Astral for dependency management. Follow these steps to set up the project:
 
91
  - `target_entity`: Type of entity to extract ("Disease", "Tissue", or "Cell Type")
92
 
93
  and returns a list of normalized entities.
94
+
95
+ ## Future Improvements
96
+
97
+ Our biomedical text normalization MCP server can be enhanced in several ways:
98
+
99
+ - Expanded Entity Coverage: Extend beyond the current entity types (Disease, Tissue, Cell Type) to include medications, procedures, laboratory tests, and genomic entities.
100
+ - User Feedback Loop: Implement a mechanism for users to correct normalization errors, creating a dataset for continuous model improvement.
101
+ - Multilingual Support: Expand capabilities to handle medical text in languages beyond English.