Matt Cool
mbcool
AI & ML interests
Open Source, local and offline.
Recent Activity
liked a model about 5 hours ago
owensong/Inflect-Nano-v1 liked a Space 6 months ago
nvidia/nemotron-speech-streaming-en-0.6b commentedon an article about 1 year ago
KV Caching Explained: Optimizing Transformer Inference Efficiency