๐๏ธ ํ๊ตญ์ด ์์ฑ ์ค์ํธ โ STT + ํ์๋ถ๋ฆฌ
Collection
batisay(STT, ๋ฌด์์ ๋งํ๋) + batispeak(ํ์๋ถ๋ฆฌ, ๋๊ฐ ๋งํ๋) = ํตํยทํ์ ํ์๋ณ ์ ์ฌ. 16GB Mac on-device. โข 4 items โข Updated
ํ๊ตญ์ด fine-tuned Whisper Large v3 Turbo (809M). Apache 2.0 ยท ๊ฒ์ดํ ์์ ยท ์ธ๋ถ ํ๋งค ํฌํจ ์์ ์ฌ์ฉ. ์ค์ฌ์ฉ ํตํ(long-form)์ ๊ฐ๊ฑดํ ๋ฌด๋ฃ base ๋ชจ๋ธ โ BatiFlow App ์ ๊ธฐ๋ณธ ํตํ ๋ฐฑ์ ๋ชจ๋ธ.
| ์ธก์ | batisay-ko-base | ๋น๊ต |
|---|---|---|
| ์ผ๋ฐ ์์ฑ (KsponSpeech clean, RTZR-strict N=500) | 7.77% | Whisper Large v3 raw 17.03% / RTZR API 5.91-6.18% |
| ์ค์ฌ์ฉ ํตํ (2~51๋ถ long-form, 5๊ฑด) | 19.11% | ๋ค์ํ ํ์ยทํ๊ฒฝ์์ ์์ ์ (turbo 1.1 ์ 14.85%) |
base ์ ๊ฐ์ ์ ์คํตํ ์ผ๋ฐํ ๊ฐ๊ฑด์ฑ ์ ๋๋ค. ํ์ต์ ์ผ๋ฐ ์์ฑ ์์ฃผ์ง๋ง, ๊ณผ์ ํฉ์ด ์์ด ์ค์ฌ์ฉ ํตํ์์ ๋ฌด๋ํฉ๋๋ค.
ggml-batisay-ko-base.bin 1.6 GB (F32, ์ต๊ณ quality)
ggml-batisay-ko-base-q5_0.bin 547 MB (Q5, balanced) โญ ๊ถ์ฅ
ggml-batisay-ko-base-q4_0.bin 452 MB (Q4, Mac 8GB)
model.safetensors 1.6 GB (transformers / MLX ์์ค)
huggingface-cli download batiai/batisay-ko-base ggml-batisay-ko-base-q5_0.bin --local-dir .
./whisper-cli -m ggml-batisay-ko-base-q5_0.bin -l ko -f audio.wav --output-txt
from transformers import WhisperForConditionalGeneration, WhisperProcessor
model = WhisperForConditionalGeneration.from_pretrained('batiai/batisay-ko-base')
processor = WhisperProcessor.from_pretrained('batiai/batisay-ko-base', language='Korean', task='transcribe')
Apache 2.0 (BatiAI Open Tier 1) โ ๊ฒ์ดํ ์์, ์์ ยท์ธ๋ถ SaaS ํฌํจ ์ ์ฝ ์์.
| ๋ชจ๋ธ | ๋ฒ์ | ๋ผ์ด์ ์ค | ํน์ง |
|---|---|---|---|
| batisay-ko-base (๋ณธ ๋ชจ๋ธ) | 1.0 | Apache 2.0 (๋ฌด๋ฃ) | ๋ฌด๋ฃยท๊ฐ๊ฑด. ์ผ๋ฐ 7.77% / ์คํตํ 19.11% |
| batisay-ko-turbo | 1.1 | Community v2 (gated) | ํตํยทํ์ยท๋ํ ๋ฉํฐ๋๋ฉ์ธ ๊ฐํ. ์คํตํ 14.85% (base ์ถ์), ์ผ๋ฐ 6.99% |
| batisay-ko-large | 1.0 | Community v2 (gated) | ๊ณ ํ์ง ๋ฌธ์/clean ์ ์ฌ ํนํ |
โ ๋ฌด๋ฃยท์ธ๋ถ SaaS = base (๋ณธ ๋ชจ๋ธ) / ํตํยท๋ฉํฐ๋๋ฉ์ธ ์ ํ๋ = turbo 1.1 / ๋ฌธ์ ์ ์ฌ = large
๋ฌธ์: support@bati.ai ยท https://flow.bati.ai
Base model
openai/whisper-large-v3