bobby dilson's picture

bobby dilson

ybxh

·

AI & ML interests

None yet

Recent Activity

liked a model 6 days ago

google/diffusiongemma-26B-A4B-it

liked a model 11 days ago

google/gemma-4-26B-A4B-it

liked a model 11 days ago

google/gemma-4-12B-it

View all activity

Organizations

upvoted an article 8 months ago

Article

Supercharge your OCR Pipelines with Open Models

+5

merve, ariG23498, davanstrien, hynky, andito, reach-vb, pcuenq

•

Oct 21, 2025

• 315

upvoted a paper 9 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 517

upvoted 2 collections 9 months ago

VibeVoice

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated Mar 2 • 247

Qwen3-Omni

6 items • Updated Dec 31, 2025 • 204

upvoted a paper 12 months ago

ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing

Paper • 2506.21448 • Published Jun 26, 2025 • 9

upvoted 3 papers about 1 year ago

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published Jun 10, 2025 • 109

Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding

Paper • 2505.18079 • Published May 23, 2025 • 5

Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation

Paper • 2504.02542 • Published Apr 3, 2025 • 52

upvoted a paper over 1 year ago

InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

Paper • 2503.16418 • Published Mar 20, 2025 • 36

upvoted 2 collections over 1 year ago

Phi-4

Phi-4 family of small language, multi-modal and reasoning models. • 17 items • Updated Jul 10, 2025 • 213

Cosmos-Preidct1

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3 • 14 items • Updated 16 days ago • 304