NeuraCraft
/

Lance-ASR

Automatic Speech Recognition

text2text-generation

Model card Files Files and versions

NeuraCraft commited on 10 days ago

Commit

4514b3d

·

1 Parent(s): 35d23e6

Update README.md

Files changed (1) hide show

README.md +31 -17

README.md CHANGED Viewed

@@ -8,7 +8,6 @@ tags:
 - asr
 - pytorch
 - transformer
-- lance-ai
 license: apache-2.0
 ---
@@ -65,30 +64,45 @@ print(f"Transcription: {transcription}")
 ---
-## 📊 Model Architecture
-Lance ASR is built on a robust Transformer backbone:
-- **Audio Front-end**: Dual `Conv1d` layers with GELU activation and stride-2 subsampling.
-- **Encoder**: 4-layer `TransformerEncoder` with 12 attention heads.
-- **Decoder**: 4-layer `TransformerDecoder` with cross-attention to encoder states.
-- **Hidden Size**: 768
-- **Vocab Size**: ~100k (Tiktoken)
 ---
-## 🚀 Training
-The model is trained using the `PolyAI/minds14` dataset (or custom datasets) using the Hugging Face `Trainer` API. The training script (`main.py`) supports `bf16` and automatic uploading to the Hugging Face Hub.
-```bash
-python main.py
-```
 ---
-## 🏗 Development & Contributions
-Lance ASR is developed by **NeuraCraft**. We welcome contributions to improve the efficiency and accuracy of the model!
-**Project Status**: 🚧 In Active Development
-**Developer**: NeuraCraft

 - asr
 - pytorch
 - transformer
 license: apache-2.0
 ---
 ---
+📊 Performance & Evaluation
+Lance ASR is currently in its early stages, and performance is being actively tested. Initial evaluations focus on:
+🔹 **WER (Word Error Rate)** – Measures transcription accuracy
+🔹 **CER (Character Error Rate)** – Measures character-level precision
+🔹 **Inference Latency** – Optimized for real-time local processing
+✅ Planned Enhancements
+🔹 Larger training datasets (e.g., Common Voice, LibriSpeech)
+🔹 Advanced noise-robustness for real-world environments
+🔹 Multilingual ASR support for global accessibility
 ---
+🚀 Future Roadmap
+Lance ASR is just getting started! The goal is to transform it into the core auditory component of an advanced AI assistant.
+📅 Planned Features:
+🔜 Real-time live transcription & streaming support
+🔜 Multi-speaker identification (Diarization)
+🔜 Integrated Voice Activity Detection (VAD)
+🔜 High-efficiency deployment for mobile and edge devices
 ---
+🏗 Development & Contributions
+Lance ASR is being developed by **NeuraCraft**. Contributions, suggestions, and testing feedback are welcome!
+📬 Contact & Updates:
+Developer: NeuraCraft
+Project Status: 🚧 In Development
+Follow for updates: Coming soon