Why Far Looks Up: Probing Spatial Representation in Vision-Language Models Paper • 2605.30161 • Published 29 days ago • 60
HumanNet: Scaling Human-centric Video Learning to One Million Hours Paper • 2605.06747 • Published May 7 • 55
waxal-benchmarking/whisper-tiny-sna-candace Automatic Speech Recognition • 37.8M • Updated Apr 16 • 2
waxal-benchmarking/whisper-tiny-sna-candace Automatic Speech Recognition • 37.8M • Updated Apr 16 • 2
waxal-benchmarking/whisper-small-sna-candace Automatic Speech Recognition • 0.2B • Updated Apr 16 • 2
waxal-benchmarking/whisper-small-sna-candace Automatic Speech Recognition • 0.2B • Updated Apr 16 • 2