|
|
--- |
|
|
license: apache-2.0 |
|
|
language: |
|
|
- en |
|
|
- zh |
|
|
- de |
|
|
metrics: |
|
|
- accuracy |
|
|
--- |
|
|
|
|
|
# Lattice-1 |
|
|
|
|
|
**Release Date:** November 30, 2025 |
|
|
|
|
|
## Model Description |
|
|
|
|
|
Lattice-1 is an advanced forced alignment model designed for precise word-level alignment between audio and text. This model provides state-of-the-art performance in aligning speech with its corresponding word transcription, enabling accurate time-stamping of words in spoken language. |
|
|
|
|
|
## Key Features |
|
|
|
|
|
- **High-Precision Alignment**: Achieves accurate word-level forced alignment with millisecond precision |
|
|
- **Multilingual Support**: Supports English, Chinese, and German, with support for mixed-language content |
|
|
- **Efficient Processing**: Optimized for fast inference with ONNX runtime support |
|
|
- **Flexible Integration**: Easy to integrate into various speech processing pipelines |
|
|
|
|
|
## Supported Languages |
|
|
|
|
|
Lattice-1 supports the following languages: |
|
|
- 🇺🇸 English |
|
|
- 🇨🇳 Chinese (䏿–‡) |
|
|
- 🇩🇪 German (Deutsch) |
|
|
|
|
|
## Quick Start with Python SDK |
|
|
|
|
|
The easiest way to use Lattice-1 is through the **[Lattifai Python SDK](https://github.com/lattifai/lattifai-python)**. |
|
|
|
|
|
For more detailed usage instructions and API documentation, please visit: |
|
|
**[Lattifai Python SDK Documentation](https://github.com/lattifai/lattifai-python)** |
|
|
|
|
|
## Applications |
|
|
|
|
|
- **Speech Data Processing**: Creating high-quality training data for ASR, TTS, and Speech LLM systems |
|
|
- **Pronunciation Assessment**: Evaluating learner pronunciation accuracy |
|
|
- **Audio-Text Synchronization**: Subtitle generation and synchronization |
|
|
- **Voice Conversion**: Time-aligned feature extraction for voice transformation |
|
|
|
|
|
## Performance |
|
|
|
|
|
Lattice-1 has been evaluated on standard benchmarks and demonstrates: |
|
|
- High accuracy in word boundary detection |
|
|
- Robust performance across different speaking styles and languages |
|
|
- Fast inference speed suitable for real-time applications |
|
|
|
|
|
## Stay Connected |
|
|
|
|
|
Follow us on Twitter/X for the latest updates: [@Lattifai_HQ](https://x.com/Lattifai_HQ) |
|
|
|
|
|
## Citation |
|
|
|
|
|
If you use Lattice-1 in your research, please cite: |
|
|
|
|
|
```bibtex |
|
|
@misc{lattice1, |
|
|
title={Lattice-1: Multilingual High-Precision Word-Level Forced Alignment}, |
|
|
author={LattifAI Team}, |
|
|
year={2025}, |
|
|
publisher={Hugging Face}, |
|
|
howpublished={\url{https://huggingface.co/lattifai/Lattice-1}} |
|
|
} |
|
|
``` |
|
|
|
|
|
## License |
|
|
|
|
|
Please refer to the license file for usage terms and conditions. |
|
|
|
|
|
## Support |
|
|
|
|
|
For issues, questions, or feature requests, please visit: |
|
|
- GitHub: [https://github.com/lattifai/lattifai-python](https://github.com/lattifai/lattifai-python) |
|
|
- Documentation: [LattifAI Python SDK Documentation](https://github.com/lattifai/lattifai-python?tab=readme-ov-file#quick-start) |
|
|
|
|
|
## Acknowledgments |
|
|
|
|
|
This model is developed and maintained by the LattifAI team. |
|
|
|