File size: 2,157 Bytes

a0b6484
2407792
ea23db8
 
 
 
 
 
 
942bf44
 
 
08cacf0
942bf44
5d928b3
 
 
 
 
 
05c9b60
a0b6484
28b6782
cafd8e4
6ab1345
 
f39f8c1
a0b6484
08cacf0
a0b6484
f39f8c1
a0b6484
d2849ca
 
a8e20d3
d2849ca
cafd8e4

---
license: cc-by-nc-4.0
datasets:
- kartoun/Alcohol_Use_Clinical_Notes_GPT4
metrics:
- precision
- recall
- accuracy
- f1
language:
- en
base_model:
- UFNLP/gatortron-base
pipeline_tag: text-classification
tags:
- emrs
- ehrs
- clinical
- alcohol
- liver
- hepatology
---
**Acknowledgment:** This project utilizes the dataset and fine-tuned model developed by Dr. Uri Kartoun.

**Article:** [Leveraging Large Language Models for Enhanced Clinical Narrative Analysis: An Application in Alcohol Use Detection](https://www.linkedin.com/pulse/leveraging-large-language-models-enhanced-clinical-uri-w6qye/?trackingId=06CMNcZa95lObWau2Ha%2FEg%3D%3D)

**Overview:** This repository hosts the fine-tuned model, adapted specifically for the detection of alcohol use expressions in clinical narratives. This fine-tuned model is based on 1,000 simulated expressions, labeled as either 'inappropriate use of alcohol' or 'no use or acceptable use of alcohol'. It may serve particularly for studies that need to consider alcohol consumption as a significant covariate, such as those excluding patients from cohorts in liver disease research.

**Model Description:** The base model, UFNLP/gatortron-base, has been fine-tuned to better recognize and categorize expressions related to alcohol use. This adaptation makes it highly suited for parsing and understanding nuanced medical texts where alcohol use status is relevant.

**Performance:** The fine-tuned model demonstrates high accuracy in classifying alcohol-related expressions, ensuring that its application in research and clinical settings is both reliable and effective.

**Classification performance using a held-out set:**

![ROC curve](https://github.com/kartoun/alcohol_use_classification_llms/blob/main/images/ROC%20Feb%209%202025.png?raw=true)

**Getting Started:** To use or further fine-tune the model with your own dataset of clinical expressions, please refer to the source code: https://github.com/kartoun/alcohol_use_classification_llms. The code provides all necessary instructions to replicate the fine-tuning process or to adapt it to new datasets potentially drawn from real healthcare systems.