👋 Open to Work

Carlo Moro PRO

cnmoro

AI & ML interests

None yet

Recent Activity

liked a model about 8 hours ago

cnmoro/domain-labeler-enpt

updated a model about 9 hours ago

cnmoro/domain-labeler-enpt

published a model about 9 hours ago

cnmoro/domain-labeler-enpt

View all activity

Organizations

liked a model about 8 hours ago

cnmoro/domain-labeler-enpt

Text Classification • Updated about 8 hours ago • 1

updated a model about 9 hours ago

cnmoro/domain-labeler-enpt

Text Classification • Updated about 8 hours ago • 1

published a model about 9 hours ago

cnmoro/domain-labeler-enpt

Text Classification • Updated about 8 hours ago • 1

liked a model 2 days ago

MiniT2I/MiniT2I

Text-to-Image • Updated 8 days ago • 514 • 14

reacted to owensong's post with 🔥 6 days ago

Post

6401

I just released Inflect-Nano-v1, an ultra-small 4.63 parameter text-to-speech model.

The main idea is simple: instead of only making the acoustic model tiny and relying on a larger external vocoder, Inflect-Nano-v1 keeps the complete text-to-waveform stack under 5M parameters.

Quick facts:
- 4.63M total inference parameters
- 3.46M acoustic model
- 1.17M vocoder
- 24 kHz audio
- English-only
- Single male voice
- Runs locally with a simple PyTorch inference script

Why I made it:
Most modern TTS models are much larger, and even many “small TTS” projects depend on a separate vocoder. I wanted to see how far a complete tiny TTS stack could be pushed while still producing usable speech.

It is not SOTA, and I am not trying to claim it competes with large TTS systems. The interesting part is the size-to-functionality ratio.

What works:
It can generate arbitrary English speech locally, and the model is small enough to be interesting for:

- local voice assistants
- embedded/edge experiments
- browser or WASM-style TTS exploration
- efficient inference research
- tiny-model baselines

Limitations:
The quality is still limited. It can sound robotic, stumble on difficult unseen text, and the vocoder is still a clear bottleneck. Long or unusual prompts are less reliable.

So I would frame this as a research/demo release, not a production TTS engine.

I’d love feedback from people interested in:
- tiny speech models
- vocoders
- local TTS
- efficient inference
- embedded speech synthesis
- improving small-model generalization

If people find it useful, I’m interested in putting more training budget into a stronger v2.

Model page:
owensong/Inflect-Nano-v1

liked a model 7 days ago

owensong/Inflect-Nano-v1

Text-to-Speech • Updated about 19 hours ago • 191

liked a model 9 days ago

yuxinlu1/gemma-4-12B-coder-fable5-composer2.5-v1-GGUF

Text Generation • 12B • Updated 6 days ago • 483k • 2.29k

updated a model 10 days ago

cnmoro/Qwen3.5-2B-Pruned-English-Portuguese

Text Generation • 2B • Updated 10 days ago • 135

published a model 10 days ago

cnmoro/Qwen3.5-2B-Pruned-English-Portuguese

Text Generation • 2B • Updated 10 days ago • 135

liked a model 11 days ago

prefeitura-rio/Rio-3.5-Open-397B

Image-Text-to-Text • 403B • Updated 10 days ago • 191k • 328

liked a model 13 days ago

laion/voiceclap-large-v2

Feature Extraction • 9B • Updated 13 days ago • 487 • 12

liked a model 14 days ago

libertywing/FlashMemory-Deepseek-V4

Feature Extraction • Updated 10 days ago • 21

reacted to RiverRider's post with 🧠 14 days ago

Post

3620

This is not a pipe.

Everyone is born a semiotician, no one is born knowing it. Go easy on yourself (and me) for not understanding this yet.

Computational semiotics is now an empirical study.

LLMs are not proto-minds. They are verifiably semiotic infrastructure.

This repository (or attached demo) can show you, in real time, how any frozen model (Qwen for demo) arrives at any answer by reading its latent states directly during generation.

Any questions?

RiverRider/srt-introspect

Repo:

https://github.com/space-bacon/SRT

Grok insist my intro is condescending … This is certainly true, as is the statement in my condescended opinion. I expect heat for it, let’s think this through?

liked a Space 14 days ago

SRT introspect

🧭

Adaptive-density reasoning traces over a frozen Qwen-2.5-7B

upvoted a collection 16 days ago

GLiClass ONNX

Collection

GLiClass models converted to ONNX format, as well as 8bit quantization • 5 items • Updated 16 days ago • 5

liked a model 17 days ago

onnx-community/gemma-4-E2B-it-qat-mobile-ONNX

Any-to-Any • Updated 19 days ago • 526 • 8

reacted to AxionLab-official's post with 👀 18 days ago

Post

10924

THIS IS CRAZY! THE MODEL ON THE IMAGE(Supra-50M-Reasoning) answered correctly and its QUANTIZED IN 2BIT! THE RESPONSE IS CORRECT, IN A 15MB SIZE FILE!

14 replies

updated a model 20 days ago

cnmoro/portuguese-multilingual-e5-small

published a model 20 days ago

cnmoro/portuguese-multilingual-e5-small

updated a model 20 days ago

Carlo Moro PRO

AI & ML interests

Recent Activity

Organizations

cnmoro's activity

SRT introspect