Marcos V. Conde PRO

marcosv

https://mv-lab.github.io/

AI & ML interests

Artificial Intelligence, Neural Networks, Deep Learning , Computer Vision, Generative AI, Language Models, Inverse Problems, Computational Photography, Super-Resolution, Neural Fields

Organizations

upvoted 3 articles about 1 year ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

danaaubakirova, andito, merve, ariG23498, fracapuano, loubnabnl, pcuenq, mshukor, cadene

•

Jun 3, 2025

• 355

Article

SmolVLM2: Bringing Video Understanding to Every Device

orrzohar, mfarre, andito, merve, pcuenq, cyrilzakka, Xenova

•

Feb 20, 2025

• 343

Article

AI Watermarking 101: Tools and Techniques

sasha, yjernite, derek-thomas, EmilyWitko, Ezi, JJoe206, reach-vb, BrigitteTousi, meg

•

Feb 26, 2024

• 27

upvoted a collection over 1 year ago

SmolLM2

Collection

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5, 2025 • 309

upvoted 2 articles over 1 year ago

Article

SmolVLM - small yet mighty Vision Language Model

andito, merve, mfarre, eliebak, pcuenq

•

Nov 26, 2024

• 419

Article

SmolLM - blazingly fast and remarkably powerful

loubnabnl, anton-l, eliebak

•

Jul 16, 2024

• 460

upvoted a collection over 1 year ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 10 items • Updated Mar 2 • 566

upvoted an article over 1 year ago

Article

Mastering Long Contexts in LLMs with KVPress

nvidia

•

Jan 23, 2025

• 76

upvoted a collection over 1 year ago

Gradio WebRTC Cookbook ⚡️

Collection

Collection of real-time voice and video demos built with gradio-webrtc custom component • 8 items • Updated Dec 10, 2024 • 19

upvoted 2 papers over 1 year ago

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published Jan 8, 2025 • 96

DarkIR: Robust Low-Light Image Restoration

Paper • 2412.13443 • Published Dec 18, 2024 • 4

upvoted a paper almost 2 years ago

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published May 16, 2024 • 135

upvoted 5 papers over 2 years ago

upvoted 3 papers almost 3 years ago

One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization

Paper • 2306.16928 • Published Jun 29, 2023 • 41

Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors

Paper • 2306.17843 • Published Jun 30, 2023 • 44

NILUT: Conditional Neural Implicit 3D Lookup Tables for Image Enhancement

Paper • 2306.11920 • Published Jun 20, 2023 • 3

Marcos V. Conde PRO

AI & ML interests

Organizations

marcosv's activity

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

SmolVLM2: Bringing Video Understanding to Every Device

AI Watermarking 101: Tools and Techniques

SmolVLM - small yet mighty Vision Language Model

SmolLM - blazingly fast and remarkably powerful

Mastering Long Contexts in LLMs with KVPress