Spaces:

GenSEC-LLM
/

README

No application file

README / README.md

Update README.md

69df3f8 verified 11 months ago

1.26 kB

	---
	title: README
	emoji: 📈
	colorFrom: pink
	colorTo: red
	sdk: streamlit
	pinned: false
	sdk_version: 1.43.2
	thumbnail: >-
	https://cdn-uploads.huggingface.co/production/uploads/629e1b71bb6419817ed7566c/jeUU2sPSuMRP9IIqVnufk.png
	---

	- GenSEC: Text-based Generative Audio & Speech Recognition with Cascaded ASR-LLMs
	- Task 1: ASR N-best hypotheses correction
	- Task 2: Speaker Tagging from N-best hypotheses
	- Task 3: Emotion Recognition from N-best hypotheses

	- Open Source Model

	- Llama-7b pre-training for ASR correction
	- https://huggingface.co/GenSEC-LLM/SLT-Task1-Llama2-7b-HyPo-baseline


	- IEEE SLT 2024, References [Paper](https://arxiv.org/abs/2409.09785). See below resources for baseline models and datasets.

	```bib
	@inproceedings{yang2024large,
	title={Large language model based generative error correction: A challenge and baselines for speech recognition, speaker tagging, and emotion recognition},
	author={Yang, Chao-Han Huck and Park, Taejin and Gong, Yuan and Li, Yuanchao and Chen, Zhehuai and Lin, Yen-Ting and Chen, Chen and Hu, Yuchen and Dhawan, Kunal and {\.Z}elasko, Piotr and others},
	booktitle={2024 IEEE Spoken Language Technology Workshop (SLT)},
	pages={371--378},
	year={2024},
	organization={IEEE}
	}
	```