Matricardi Fabio

FM-1976

https://medium.com/@fabio.matricardi

AI & ML interests

control system engineering, AI, LLM with python. ThePoorGPUguy on substack

Recent Activity

reacted to constannnt's post with ❤️ 1 day ago

We are excited to announce Sipp.sh: a high-performance library for running AI inference locally and in the cloud through a unified API. We began to realize that an LLM isn't just a chat interface for information retrieval. It can be integrated directly into web, games, or productivity apps to handle continuous monitoring and decision-making. It can act as a sort of "second brain,” the silent hand that guides and helps a user without them even realizing it. We see this as the next frontier of UX design, but this is only possible if developers have access to low-cost, zero-latency compute and absolute data privacy. That's why we created Sipp. It’s an opinionated library that lets developers integrate local AI into any application, giving them the superpowers to completely rethink user experiences across the web, games, and desktop. To achieve this, we built an entirely new stack in Rust and C++, working alongside the llama.cpp project. Through our work, we were able to contribute back to that community to help upgrade the GGML WebGPU backend. This deep optimization is what enables our fast, responsive decode speeds directly in the browser. Sipp ships as a zero-dependency library for desktop and web, achieving 3x to 5x speedup in token decode compared to popular alternatives. We are already seeing some incredible use cases emerge from this, from continuous monitoring using local vision to the dynamic generation of game elements in a real-time wizard vs. wizard game. The best part? It's fully open-source! We see this as the start of a dialogue about what the future of user interaction is going to look like, and we built Sipp to lay the foundation for that exciting future. Check out the live demos on our site, run your own benchmarks, or come hang out with us in our Discord. Website: https://www.sipp.sh/ Github: https://github.com/noumena-labs/Sipp

liked a model 11 days ago

mradermacher/Tool-Star-Qwen-3B-GGUF

liked a model 11 days ago

mradermacher/Tool-Star-Qwen-1.5B-GGUF

View all activity

Organizations

None yet

liked 4 models 11 days ago

liked a model 14 days ago

bartowski/nex-agi_Nex-N2-mini-GGUF

Image-Text-to-Text • 35B • Updated 17 days ago • 40.6k • 24

liked a model 16 days ago

mradermacher/Salience-1-9B-GGUF

8B • Updated 16 days ago • 753 • 4

liked 4 models 18 days ago

CohereLabs/North-Mini-Code-1.0

Text Generation • 30B • Updated 12 days ago • 28.6k • 497

Arki05/North-Mini-Code-1.0-GGUF

Text Generation • 30B • Updated 8 days ago • 5.31k • 8

ai-sage/GigaChat3.1-10B-A1.8B-GGUF

Text Generation • 11B • Updated Mar 25 • 3.5k • 75

silx-ai/Quasar-Preview

Text Generation • 17B • Updated 8 days ago • 1.61k • 92

liked 2 models 23 days ago

ideogram-ai/ideogram-4-nf4

Text-to-Image • Updated 23 days ago • 12k • 393

Green-Sky/bonsai-image-binary-4B-GGUF

Text-to-Image • 4B • Updated 26 days ago • 3.83k • 13

liked 2 models 25 days ago

mradermacher/LMT-60-0.6B-GGUF

0.8B • Updated 26 days ago • 540 • 6

JetBrains/Mellum2-12B-A2.5B-Thinking

Text Generation • 12B • Updated 15 days ago • 27.3k • 307

liked a model 26 days ago

huihui-ai/Huihui4-8B-A4B

Image-Text-to-Text • 9B • Updated Apr 27 • 52 • 17

liked a model 29 days ago

Jackrong/Qwopus3.5-4B-Coder

Text Generation • 5B • Updated about 1 month ago • 8.51k • 15

liked 4 models about 1 month ago

openbmb/MiniCPM5-1B

Text Generation • 1B • Updated May 26 • 332k • 824

nvidia/Nemotron-Cascade-8B-Thinking

Text Generation • 8B • Updated Jan 1 • 820 • • 41

nvidia/NVIDIA-Nemotron-3-Nano-4B-GGUF

Text Generation • 4B • Updated Mar 16 • 21.9k • 166

nvidia/Nemotron-Labs-Diffusion-3B

Text Generation • 4B • Updated 24 days ago • 60.1k • 32

Matricardi Fabio

AI & ML interests

Recent Activity

Organizations

FM-1976's activity