jcho02
/

whisper_cleft

Model card Files Files and versions

whisper_cleft / README.md

jcho02's picture

Update README.md

8a8b48a verified about 2 years ago

|

2.54 kB

	---
	license: apache-2.0
	language:
	- en
	tags:
	- medical
	---

	# Model Card for Fine-Tuned Whisper Model: Hypernasality Identification

	This model card details the Whisper model, fine-tuned to identify hypernasality in patients from audio recordings. The model is developed to assist in the diagnosis and monitoring of speech disorders.

	- Model Interactive Demo: https://huggingface.co/spaces/jcho02/Transformers_whisper_cleft
	## Model Details

	### Model Description
	- Developed by: The Data Science team at the Vanderbilt Data Science Institute
	- Funded by: This project was supported by healthcare-focused research grants and institutional funding.
	- Shared by: The model is shared by Vanderbilt University with the aim to assist clinicians and speech therapists.
	- Model type: Deep Learning, Neural Network (Fine-Tuned Whisper Model)
	- Language(s) (NLP): The model primarily focuses on English language audio samples but has the potential to be adapted for multilingual use.
	- License: The model is released under an open-source license for non-commercial use, details of which can be found in the repository.
	- Finetuned from model: The base model is OpenAI's Whisper, which has been fine-tuned using a specialized dataset for hypernasality detection.


	### Model Sources
	- Repository: https://github.com/jaewoocho/Transformers_whisper_cleft/tree/main?tab=readme-ov-file
	- Demo : https://huggingface.co/spaces/jcho02/Transformers_whisper_cleft

	## Uses

	### Direct Use
	- This model can be directly used by healthcare professionals to analyze patient speech samples for signs of hypernasality, aiding in diagnosis and treatment planning.

	### Downstream Use
	- Potential use in automated speech therapy tools or in research settings for studying speech disorders.

	### Out-of-Scope Use
	- Not intended for non-medical use or as a sole diagnostic tool without clinical oversight.

	## Bias, Risks, and Limitations
	- The model might have limitations in accurately identifying hypernasality across diverse accents and languages. User discretion is advised, and results should be considered alongside clinical assessments.

	## Recommendations
	- Healthcare professionals should use this tool as a supplement to traditional diagnostic methods. Continuous updating with diverse datasets is recommended to improve accuracy.

	## How to Get Started with the Model
	- Use the following code snippet to get started with the model:
	- https://github.com/jaewoocho/Transformers_whisper_cleft/tree/main?tab=readme-ov-file