NeuraCraft commited on
Commit
4514b3d
Β·
1 Parent(s): 35d23e6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +31 -17
README.md CHANGED
@@ -8,7 +8,6 @@ tags:
8
  - asr
9
  - pytorch
10
  - transformer
11
- - lance-ai
12
  license: apache-2.0
13
  ---
14
 
@@ -65,30 +64,45 @@ print(f"Transcription: {transcription}")
65
 
66
  ---
67
 
68
- ## πŸ“Š Model Architecture
69
 
70
- Lance ASR is built on a robust Transformer backbone:
71
- - **Audio Front-end**: Dual `Conv1d` layers with GELU activation and stride-2 subsampling.
72
- - **Encoder**: 4-layer `TransformerEncoder` with 12 attention heads.
73
- - **Decoder**: 4-layer `TransformerDecoder` with cross-attention to encoder states.
74
- - **Hidden Size**: 768
75
- - **Vocab Size**: ~100k (Tiktoken)
 
 
 
 
76
 
77
  ---
78
 
79
- ## πŸš€ Training
80
 
81
- The model is trained using the `PolyAI/minds14` dataset (or custom datasets) using the Hugging Face `Trainer` API. The training script (`main.py`) supports `bf16` and automatic uploading to the Hugging Face Hub.
 
82
 
83
- ```bash
84
- python main.py
85
- ```
 
 
 
 
86
 
87
  ---
88
 
89
- ## πŸ— Development & Contributions
 
 
 
 
 
 
 
 
90
 
91
- Lance ASR is developed by **NeuraCraft**. We welcome contributions to improve the efficiency and accuracy of the model!
92
 
93
- **Project Status**: 🚧 In Active Development
94
- **Developer**: NeuraCraft
 
8
  - asr
9
  - pytorch
10
  - transformer
 
11
  license: apache-2.0
12
  ---
13
 
 
64
 
65
  ---
66
 
67
+ πŸ“Š Performance & Evaluation
68
 
69
+ Lance ASR is currently in its early stages, and performance is being actively tested. Initial evaluations focus on:
70
+ πŸ”Ή **WER (Word Error Rate)** – Measures transcription accuracy
71
+ πŸ”Ή **CER (Character Error Rate)** – Measures character-level precision
72
+ πŸ”Ή **Inference Latency** – Optimized for real-time local processing
73
+
74
+ βœ… Planned Enhancements
75
+
76
+ πŸ”Ή Larger training datasets (e.g., Common Voice, LibriSpeech)
77
+ πŸ”Ή Advanced noise-robustness for real-world environments
78
+ πŸ”Ή Multilingual ASR support for global accessibility
79
 
80
  ---
81
 
82
+ πŸš€ Future Roadmap
83
 
84
+ Lance ASR is just getting started! The goal is to transform it into the core auditory component of an advanced AI assistant.
85
+ πŸ“… Planned Features:
86
 
87
+ πŸ”œ Real-time live transcription & streaming support
88
+
89
+ πŸ”œ Multi-speaker identification (Diarization)
90
+
91
+ πŸ”œ Integrated Voice Activity Detection (VAD)
92
+
93
+ πŸ”œ High-efficiency deployment for mobile and edge devices
94
 
95
  ---
96
 
97
+ πŸ— Development & Contributions
98
+
99
+ Lance ASR is being developed by **NeuraCraft**. Contributions, suggestions, and testing feedback are welcome!
100
+
101
+ πŸ“¬ Contact & Updates:
102
+
103
+ Developer: NeuraCraft
104
+
105
+ Project Status: 🚧 In Development
106
 
107
+ Follow for updates: Coming soon
108