Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
0.7
TFLOPS
16
42
500
Matricardi Fabio
FM-1976
Follow
hegderavin's profile picture
Akim92's profile picture
Fishtiks's profile picture
24 followers
·
130 following
https://medium.com/@fabio.matricardi
ThePoorGpuGuy
fabiomatricardi
AI & ML interests
control system engineering, AI, LLM with python. ThePoorGPUguy on substack
Recent Activity
reacted
to
constannnt
's
post
with ❤️
1 day ago
We are excited to announce Sipp.sh: a high-performance library for running AI inference locally and in the cloud through a unified API. We began to realize that an LLM isn't just a chat interface for information retrieval. It can be integrated directly into web, games, or productivity apps to handle continuous monitoring and decision-making. It can act as a sort of "second brain,” the silent hand that guides and helps a user without them even realizing it. We see this as the next frontier of UX design, but this is only possible if developers have access to low-cost, zero-latency compute and absolute data privacy. That's why we created Sipp. It’s an opinionated library that lets developers integrate local AI into any application, giving them the superpowers to completely rethink user experiences across the web, games, and desktop. To achieve this, we built an entirely new stack in Rust and C++, working alongside the llama.cpp project. Through our work, we were able to contribute back to that community to help upgrade the GGML WebGPU backend. This deep optimization is what enables our fast, responsive decode speeds directly in the browser. Sipp ships as a zero-dependency library for desktop and web, achieving 3x to 5x speedup in token decode compared to popular alternatives. We are already seeing some incredible use cases emerge from this, from continuous monitoring using local vision to the dynamic generation of game elements in a real-time wizard vs. wizard game. The best part? It's fully open-source! We see this as the start of a dialogue about what the future of user interaction is going to look like, and we built Sipp to lay the foundation for that exciting future. Check out the live demos on our site, run your own benchmarks, or come hang out with us in our Discord. Website: https://www.sipp.sh/ Github: https://github.com/noumena-labs/Sipp
liked
a model
11 days ago
mradermacher/Tool-Star-Qwen-3B-GGUF
liked
a model
11 days ago
mradermacher/Tool-Star-Qwen-1.5B-GGUF
View all activity
Organizations
None yet
FM-1976
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
4 models
11 days ago
mradermacher/Tool-Star-Qwen-3B-GGUF
3B
•
Updated
May 25, 2025
•
98
•
4
mradermacher/Tool-Star-Qwen-1.5B-GGUF
2B
•
Updated
Jun 7, 2025
•
54
•
3
mradermacher/DeepSeek-R1-Distill-Llama-3B-tools-GGUF
3B
•
Updated
Jul 11, 2025
•
188
•
2
ibm-granite/granite-switch-4.1-3b-preview
Text Generation
•
4B
•
Updated
May 24
•
6.24k
•
33
liked
a model
14 days ago
bartowski/nex-agi_Nex-N2-mini-GGUF
Image-Text-to-Text
•
35B
•
Updated
16 days ago
•
40.6k
•
24
liked
a model
16 days ago
mradermacher/Salience-1-9B-GGUF
8B
•
Updated
16 days ago
•
753
•
4
liked
4 models
18 days ago
CohereLabs/North-Mini-Code-1.0
Text Generation
•
30B
•
Updated
12 days ago
•
28.6k
•
495
Arki05/North-Mini-Code-1.0-GGUF
Text Generation
•
30B
•
Updated
8 days ago
•
5.31k
•
8
ai-sage/GigaChat3.1-10B-A1.8B-GGUF
Text Generation
•
11B
•
Updated
Mar 25
•
3.5k
•
75
silx-ai/Quasar-Preview
Text Generation
•
17B
•
Updated
8 days ago
•
1.61k
•
92
liked
2 models
23 days ago
ideogram-ai/ideogram-4-nf4
Text-to-Image
•
Updated
23 days ago
•
12k
•
390
Green-Sky/bonsai-image-binary-4B-GGUF
Text-to-Image
•
4B
•
Updated
26 days ago
•
3.83k
•
13
liked
2 models
25 days ago
mradermacher/LMT-60-0.6B-GGUF
0.8B
•
Updated
25 days ago
•
540
•
6
JetBrains/Mellum2-12B-A2.5B-Thinking
Text Generation
•
12B
•
Updated
15 days ago
•
27.3k
•
307
liked
a model
26 days ago
huihui-ai/Huihui4-8B-A4B
Image-Text-to-Text
•
9B
•
Updated
Apr 27
•
52
•
17
liked
a model
29 days ago
Jackrong/Qwopus3.5-4B-Coder
Text Generation
•
5B
•
Updated
about 1 month ago
•
8.51k
•
15
liked
4 models
about 1 month ago
openbmb/MiniCPM5-1B
Text Generation
•
1B
•
Updated
May 26
•
332k
•
824
nvidia/Nemotron-Cascade-8B-Thinking
Text Generation
•
8B
•
Updated
Jan 1
•
820
•
•
41
nvidia/NVIDIA-Nemotron-3-Nano-4B-GGUF
Text Generation
•
4B
•
Updated
Mar 16
•
21.9k
•
166
nvidia/Nemotron-Labs-Diffusion-3B
Text Generation
•
4B
•
Updated
24 days ago
•
60.1k
•
31
Load more