28 1 75

Neurlang Project

neurlang

https://blog.neurlang.online

neurlang

AI & ML interests

hashtrons/weightless (non neural) networks

Recent Activity

updated a model about 4 hours ago

neurlang/piper-onnx-slovakspeech-female-slovak-0.8.0

published a model about 4 hours ago

neurlang/piper-onnx-slovakspeech-female-slovak-0.8.0

liked a model 6 days ago

owensong/Inflect-Nano-v1

View all activity

Organizations

None yet

updated a model about 4 hours ago

neurlang/piper-onnx-slovakspeech-female-slovak-0.8.0

Updated about 4 hours ago

published a model about 4 hours ago

neurlang/piper-onnx-slovakspeech-female-slovak-0.8.0

Updated about 4 hours ago

liked a model 6 days ago

owensong/Inflect-Nano-v1

Text-to-Speech • Updated 1 day ago • 198

New activity in neurlang/low-quality-multilingual-sentences 26 days ago

[bot] Conversion to Parquet

#1 opened 3 months ago by

parquet-converter

reacted to kavyamanohar's post with 🔥 about 1 month ago

Post

4595

Releasing Vividh-ASR — an open benchmark and models for Hindi and Malayalam ASR.

Vividh-ASR is built from public data, stratified by complexity:
→ Clean recordings
→ Noisy and accented speech
→ Spontaneous, conversational audio

Alongside the benchmark, we release:
→ Open models for Hindi and Malayalam
→ A training recipe with two counterintuitive choices that moved the needle
→ What failed, not just what worked

The stratified evaluation methodology transfers directly to any low-resource language setup — beyond Hindi and Malayalam.

Built at @adalatai , where we build speech tech for Indian courts. This is our first open contribution back to the community. @janaab @Kush0610 @orgh0

Link: https://huggingface.co/blog/adalat-ai/vividh-benchmark

liked a dataset about 1 month ago

neurlang/slovakspeech_male_dataset

Updated May 17 • 7 • 2

updated a dataset about 1 month ago

neurlang/slovakspeech_male_dataset

Updated May 17 • 7 • 2

reacted to cesear64's post with 🔥 about 1 month ago

Post

4129

Just published: how we built production Sango (Central African Republic) translation without fine-tuning, parallel corpus, or training compute.

The method — vocabulary-augmented prompting with a 581-entry native-speaker-verified lexicon — generalizes to any of the ~2,000 African languages at the same data-poverty level. Recipe, dataset, and code template all included.

📄 Blog: https://huggingface.co/blog/MEYNG/sangoai
📦 Dataset: MEYNG/sango-vocabulary

Would especially value feedback from anyone working on other low-resource African languages — Ewondo, Lingala, Wolof next on our roadmap.

2 replies

liked a model about 2 months ago

Godelaune/Kokoro-82M-ONNX-German-Martin

Text-to-Speech • Updated May 22 • 16

reacted to unmodeled-tyler's post with 😎 about 2 months ago

Post

4124

Hey Hugging Face!

Repo: https://github.com/unmodeled-tyler/vessel-browser

I wanted to share a cool feature from my open source AI native web browser, Vessel: Persistent highlights!

You can highlight anything on the page and the context is provided to the agent. It's kind of a fun way to learn about new stuff, synthesize info, or just deepen your comprehension/understanding.

Since highlights are persistent, you can close the page, come back later - and your highlights will be exactly where you left them. I've found this particularly useful when reviewing technical blogs, model cards, etc.

Check it out!