GutenOCR: A Grounded Vision-Language Front-End for Documents Paper • 2601.14490 • Published 11 days ago • 36
view article Article Hugging Face and VirusTotal collaborate to strengthen AI security Oct 22, 2025 • 43
ZeroShot Medical & Clinical NER Collection OpenMed ZeroShot NER Models • 93 items • Updated Sep 15, 2025 • 24
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents Paper • 2506.03143 • Published Jun 3, 2025 • 53
view article Article Blazingly fast whisper transcriptions with Inference Endpoints +4 May 13, 2025 • 81
WORLDMEM: Long-term Consistent World Simulation with Memory Paper • 2504.12369 • Published Apr 16, 2025 • 35
NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors Paper • 2504.11427 • Published Apr 15, 2025 • 19
DataDecide Collection A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale. • 358 items • Updated Dec 23, 2025 • 21
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published Apr 7, 2025 • 205
distil-large-v3.5 Collection This collection contains the model repositories for distil-large-v3.5, which provides support for the most popular Whisper libraries. • 5 items • Updated Mar 25, 2025 • 9