TinyLlama-1.1B-Phonetic-Liaison-Katakana-Generator
This model is a fine-tuned version of TinyLlama/TinyLlama-1.1B-Chat-v1.0 designed to predict connected phoneme sequences and rhythm-optimized Katakana. It focuses on capturing real-world auditory phenomena like liaison, reduction, and flapping.
๐ The Concept: "Phonetic Bridge for Natural Speech"
Traditional G2P (Grapheme-to-Phoneme) converters often treat words in isolation. This model serves as a Phonetic Bridge, predicting how sounds change in continuous speech.
For Global Developers (The "Connected Phonemes" Advantage)
While the model outputs Katakana, its core intelligence lies in generating Connected Phoneme Sequences (ARPAbet).
- TTS Frontend: Use the linked phoneme output to improve the prosody of your Text-to-Speech engines.
- ESL Tools: Visualize for learners how "Take it" becomes
/t ey1 k ih1 t/instead of two separate words.
For Japanese Learners ("The Training Wheels")
I am a firm believer that English should ideally be learned through ears, not Katakana. However, beginners often face a "fear of the written word." This model provides **"Supportive Katakana"**โnot a translation, but a phonetic map that mimics native rhythm, acting as training wheels for the ear.
โจ Key Features
- Connected Phonemes (ARPAbet): Outputs the exact phonetic string including liaison (e.g.,
a little bit->AH0 L IH1 D AH0 L B IH1 T). - Liaison & Flapping: Naturally handles
TtoDtransformations and word-to-word connections. - Silent Letters: Intelligently ignores non-vocalized consonants.
- Modern ESL Approach: Designed for high-speed inference on mobile devices (ready for GGUF/on-device PoC).
๐ Comparison: Beyond Dictionary Rules
| English Phrase | Dictionary Phonemes | This Model (Linked Phonemes) | Supportive Katakana |
|---|---|---|---|
| A little bit | [AH0] [L IH1 T AH0 L] [B IH1 T] |
AH0 L IH1 D AH0 L B IH1 T |
ใขใชใญใใ |
| Check it out | [CH EH1 K] [IH1 T] [AW1 T] |
CH EH1 K IH1 T AW1 T |
ใใงใญใฉใ |
| Middle of the night | [M IH1 D AH0 L] [AH1 V]... |
M IH1 D AH0 L AH1 V DH AH0 N AY1 T |
ใใใญใดใถใใคใ |
๐ Prompt Format
To extract both Katakana and the connected phoneme sequence, use the following format:
่ฑ่ชใจใใฎๅ่ชๅไฝใฎ้ณ็ด ใใใใชใจใพใณใ่ๆ
ฎใใใซใฟใซใใจ็นใใฃใ้ณ็ด ๅใ็ๆใใฆใใ ใใใ
่ฑ่ช: take it easy
ๅ่ช้ณ็ด : [T EY1 K] [IH1 T] [IY1 Z IY0]
ใซใฟใซใ: ใใคใญใใใคใผใธใผ
็นใใฃใ้ณ็ด : T EY1 K IH1 T IY1 Z IY0
่ฑ่ช: {Your Phrase}
ๅ่ช้ณ็ด : {Standard G2P Output}
ใซใฟใซใ:
๐ Technical Specs & Dataset
- Dataset: 1,200+ hand-curated pairs of English phrases and their auditory-correct phonetic mappings.
- Evaluation: Currently being benchmarked against the
speechocean762dataset for pronunciation scoring PoC. - Architecture: LoRA fine-tuning on TinyLlama 1.1B.
- Optimization: Highly compatible with GGUF for ultra-lightweight mobile app integration (MFCC/DTW based evaluation).
โ ๏ธ Limitations & Bias
- Model Size: 1.1B parameters. While fast, it may hallucinate on rare proper nouns.
- Accent: Optimized for General American English (GenAm) commonly found in global pop music and media.
๐ License
Apache 2.0
- Downloads last month
- 19
Model tree for pyon0024/tinyllama-katakana-converter
Base model
TinyLlama/TinyLlama-1.1B-Chat-v1.0