Daniel Serrano's picture

Daniel Serrano

dnlserrano

·

https://dnlserrano.dev

AI & ML interests

computer vision, biometrics, face, facial recognition, deepfakes, pad, mad, age, bias

Recent Activity

liked a model 6 days ago

CohereLabs/North-Mini-Code-1.0

liked a model 4 months ago

microsoft/Phi-4-reasoning-vision-15B

liked a Space 4 months ago

baohuynhbk14/Qwen3-VL-Multimodal-Search-DEMO

View all activity

Organizations

None yet

upvoted 2 collections 9 months ago

FastVLM

Efficient Vision Encoding for Vision Language Models • 8 items • Updated Mar 2 • 114

MobileCLIP2

MobileCLIP2: Mobile-friendly image-text models with SOTA zero-shot capabilities trained on DFNDR-2B • 30 items • Updated Apr 23 • 64

upvoted a collection about 1 year ago

Nomic Embed Vision

Vision Encoders aligned to Nomic Embed Text making Nomic Embed multimodal! • 2 items • Updated Jun 5, 2024 • 11

upvoted an article over 1 year ago

Article

Introduction to ggml

+1

ngxson, ggerganov, slaren

•

Aug 13, 2024

• 294

upvoted a paper over 1 year ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 148