Santiago Garcia

santyzenith

1 47 339

AI & ML interests

Large language models, Natural Language Processing, Computer Vision, Spanish Large language models.

Recent Activity

liked a model 2 days ago

google/tabfm-1.0.0-pytorch

liked a model 18 days ago

meta-llama/Prompt-Guard-86M

liked a model 19 days ago

google/diffusiongemma-26B-A4B-it

View all activity

Organizations

upvoted a collection about 1 month ago

Cosmos3

Collection

Omnimodal World Models for Physical AI • 18 items • Updated 1 day ago • 138

upvoted a collection 2 months ago

Nemotron-Pre-Training-Datasets

Collection

Large scale pre-training datasets used in the Nemotron family of models. • 15 items • Updated 22 days ago • 176

upvoted an article 3 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 910

upvoted 2 articles 9 months ago

Article

mmBERT: ModernBERT goes Multilingual

mmarone, orionweller, will-fleshman, eugene-yang, dlawrie, vandurme

•

Sep 9, 2025

• 148

Article

Finally, a Replacement for BERT: Introducing ModernBERT

bwarner, NohTow, bclavie, orionweller, ohallstrom, staghado, alexisgallagher, rbiswasfc, fladhak, tomaarsen, ncoop57, griffin, jph00, johnowhitaker, iacolippo

•

Dec 19, 2024

• 748

upvoted an article 10 months ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

abidlabs, znation, nouamanetazi, sasha, qgallouedec

•

Jul 29, 2025

• 225

upvoted 2 articles about 1 year ago

Article

A Deepdive into Aya Expanse: Advancing the Frontier of Multilinguality

johndang-cohere, shivalikasingh, dsouzadaniel, ArashAhmadian

•

Oct 24, 2024

• 64

Article

Vision Language Models (Better, faster, stronger)

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 613

upvoted 3 collections over 1 year ago

upvoted a collection almost 2 years ago

LLM2Vec

Collection

21 items • Updated Dec 2, 2025 • 52

upvoted 3 articles almost 2 years ago

Article

Train a Llama model from scratch

nroggendorff

•

Jul 29, 2024

• 57

Article

Vision Language Models Explained

merve, edbeeching

•

Apr 11, 2024

• 538

Article

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

edbeeching, ybelkada, lvwerra, smangrul, lewtun, kashif

•

Mar 9, 2023

• 72

upvoted 2 papers almost 2 years ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19, 2024 • 40

MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition

Paper • 2302.13750 • Published Feb 27, 2023 • 2

upvoted 3 articles almost 2 years ago

Article

Introduction to Graph Machine Learning

clefourrier

•

Jan 3, 2023

• 55

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

philschmid, osanseviero, alvarobartt, lvwerra, dvilasuero, reach-vb, marcsun13, pcuenq

•

Jul 23, 2024

• 241

Article

Welcome Gemma 2 - Google’s new open LLM

philschmid, osanseviero, pcuenq, lewtun, tomaarsen, reach-vb

•

Jun 27, 2024

• 132

Santiago Garcia

AI & ML interests

Recent Activity

Organizations

santyzenith's activity

Welcome Gemma 4: Frontier multimodal intelligence on device

mmBERT: ModernBERT goes Multilingual

Finally, a Replacement for BERT: Introducing ModernBERT

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

A Deepdive into Aya Expanse: Advancing the Frontier of Multilinguality

Vision Language Models (Better, faster, stronger)

Train a Llama model from scratch

Vision Language Models Explained

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Introduction to Graph Machine Learning

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Welcome Gemma 2 - Google’s new open LLM