File size: 2,777 Bytes
1e1d51a
 
 
 
 
 
17f73c6
 
d0ad17b
 
 
 
 
 
17f73c6
 
 
 
 
 
 
 
 
 
 
d0ad17b
 
 
 
 
 
 
17f73c6
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
d0ad17b
 
 
17f73c6
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
---
license: apache-2.0
language:
- en
- zh
- de
metrics:
- accuracy
---

# Lattice-1

**Release Date:** November 30, 2025

## Model Description

Lattice-1 is an advanced forced alignment model designed for precise word-level alignment between audio and text. This model provides state-of-the-art performance in aligning speech with its corresponding word transcription, enabling accurate time-stamping of words in spoken language.

## Key Features

- **High-Precision Alignment**: Achieves accurate word-level forced alignment with millisecond precision
- **Multilingual Support**: Supports English, Chinese, and German, with support for mixed-language content
- **Efficient Processing**: Optimized for fast inference with ONNX runtime support
- **Flexible Integration**: Easy to integrate into various speech processing pipelines

## Supported Languages

Lattice-1 supports the following languages:
- 🇺🇸 English
- 🇨🇳 Chinese (中文)
- 🇩🇪 German (Deutsch)

## Quick Start with Python SDK

The easiest way to use Lattice-1 is through the **[Lattifai Python SDK](https://github.com/lattifai/lattifai-python)**.

For more detailed usage instructions and API documentation, please visit:
**[Lattifai Python SDK Documentation](https://github.com/lattifai/lattifai-python)**

## Applications

- **Speech Data Processing**: Creating high-quality training data for ASR, TTS, and Speech LLM systems
- **Pronunciation Assessment**: Evaluating learner pronunciation accuracy
- **Audio-Text Synchronization**: Subtitle generation and synchronization
- **Voice Conversion**: Time-aligned feature extraction for voice transformation

## Performance

Lattice-1 has been evaluated on standard benchmarks and demonstrates:
- High accuracy in word boundary detection
- Robust performance across different speaking styles and languages
- Fast inference speed suitable for real-time applications

## Stay Connected

Follow us on Twitter/X for the latest updates: [@Lattifai_HQ](https://x.com/Lattifai_HQ)

## Citation

If you use Lattice-1 in your research, please cite:

```bibtex
@misc{lattice1,
  title={Lattice-1: Multilingual High-Precision Word-Level Forced Alignment},
  author={LattifAI Team},
  year={2025},
  publisher={Hugging Face},
  howpublished={\url{https://huggingface.co/lattifai/Lattice-1}}
}
```

## License

Please refer to the license file for usage terms and conditions.

## Support

For issues, questions, or feature requests, please visit:
- GitHub: [https://github.com/lattifai/lattifai-python](https://github.com/lattifai/lattifai-python)
- Documentation: [LattifAI Python SDK Documentation](https://github.com/lattifai/lattifai-python?tab=readme-ov-file#quick-start)

## Acknowledgments

This model is developed and maintained by the LattifAI team.