Building on HF

21 11 10

Mahimai Raja J

mahimairaja

https://mahimairaja.in

AI & ML interests

Azure AI Engineer | LangChain + LangGraph | FastAPI | RAG | Voice AI

Recent Activity

updated a model 15 days ago

mahimairaja/whisper-small-french-finetuned

published a model 15 days ago

mahimairaja/whisper-small-french-finetuned

updated a model 15 days ago

mahimairaja/whisper-small-fr

View all activity

Organizations

updated a model 15 days ago

mahimairaja/whisper-small-french-finetuned

Updated 15 days ago

published a model 15 days ago

mahimairaja/whisper-small-french-finetuned

Updated 15 days ago

updated a model 15 days ago

mahimairaja/whisper-small-fr

Automatic Speech Recognition • 0.2B • Updated 15 days ago • 13

published a model 15 days ago

mahimairaja/whisper-small-fr

Automatic Speech Recognition • 0.2B • Updated 15 days ago • 13

New activity in zai-org/GLM-OCR about 1 month ago

When will there be better support for vLLM?

#6 opened about 1 month ago by

Xiakj

reacted to marksverdhei's post with 🤗 about 2 months ago

Post

2667

Dear Hugging Face team, can we please have a way to archive hf repositories / spaces? I have a bunch of spaces that used to work but don't any more due to the hf space implementations changing and i think it would be good if I could archive those like in GitHub.

React to this post if you want to see this feature! 💡

reacted to Javedalam's post with 🔥 about 2 months ago

Post

2987

KittenTTS Nano — Tiny, Expressive, Practical

KittenTTS Nano is a lightweight, CPU-only text-to-speech model designed to prove that natural, expressive voices don’t require massive cloud stacks or GPUs. At roughly ~15M parameters, it runs fast on modest hardware, supports multiple expressive voices, and exposes simple controls for pacing and tone. This makes it ideal for edge devices, demos, and anyone who wants full control over TTS without latency, lock-in, or infrastructure overhead.

Try it here

Javedalam/KittenTTS

The model page

KittenML/kitten-tts-nano-0.2

2 replies

posted an update about 2 months ago

Post

1186

🔥 Qwen is dominating the SLM space right now.

We all know this year 2026 is the year of Small Models, but Alibaba team took it bit serious it seems!

Qwen3-TTS — 3-sec voice cloning, 10 languages, beats ElevenLabs
Qwen3-ASR — Just dropped TODAY! 52 languages, <8% WER, SOTA open-source ASR
Qwen-Image — #1 open-source image model on AI Arena

All Apache 2.0. The most complete open-source AI stack, period.

So, what do you think now, what next release could be? an Language Model?
Comment below

1 reply

replied to consome2's post about 2 months ago

Awesome!

reacted to consome2's post with ❤️ about 2 months ago

Post

5245

We’ve released two conversational speech datasets from oto on Hugging Face 🤗
Both are based on real, casual, full-duplex conversations, but with slightly different focuses.

Dataset 1: Processed / curated subset
otoearth/otoSpeech-full-duplex-processed-141h
* Full-duplex, spontaneous multi-speaker conversations
* Participants filtered for high audio quality
* PII removal and audio enhancement applied
* Designed for training and benchmarking S2S or dialogue models

Dataset 2: Larger raw(er) release
otoearth/otoSpeech-full-duplex-280h
* Same collection pipeline, with broader coverage
* More diversity in speakers, accents, and conversation styles
* Useful for analysis, filtering, or custom preprocessing experiments

We intentionally split the release to support different research workflows:
clean and ready-to-use vs. more exploratory and research-oriented use.

The datasets are currently private, but we’re happy to approve access requests — feel free to request access if you’re interested.

If you’re working on speech-to-speech (S2S) models or are curious about full-duplex conversational data, we’d love to discuss and exchange ideas together.

Feedback and ideas are very welcome!

2 replies

reacted to Reubencf's post with 🤗 about 2 months ago

Post

1897

Now Live: The Reubencf/Nano_Banana_Editor now includes 10 free requests/day! 🍌 I'm personally sponsoring these credits to help make open AI accessible to all.
(Note: Limits are subject to change based on funding).

Enjoy !