2 3

Santiago Galiano Segura

sgs29

AI & ML interests

None yet

Recent Activity

updated a model about 1 month ago

gplsi/Aitana-2B-S-Instruct-Aligned

updated a model about 2 months ago

gplsi/Aitana-2B-S-Instruct

liked a Space 2 months ago

HuggingFaceTB/trl-distillation-trainer

View all activity

Organizations

updated a model about 1 month ago

gplsi/Aitana-2B-S-Instruct-Aligned

Text Generation • 2B • Updated 18 days ago • 80

updated a model about 2 months ago

gplsi/Aitana-2B-S-Instruct

Text Generation • 2B • Updated 18 days ago • 72

liked a Space 2 months ago

Distilling 100B+ Models 40x Faster with TRL

📝

TRL distillation for 100B+ teachers, 40x faster

updated 2 models 3 months ago

gplsi/Aitana-7B-S-base

Text Generation • 8B • Updated about 1 month ago • 85

gplsi/Aitana-7B-S-Instruct

Text Generation • 8B • Updated 18 days ago • 490

upvoted an article 4 months ago

Article

Everything You Need to Know about Knowledge Distillation

Kseniase

•

Mar 6, 2025

• 82

updated a model 7 months ago

gplsi/Aitana-2B-S-base

Text Generation • 2B • Updated about 1 month ago • 116

upvoted a collection 7 months ago

Salamandra 🦎

Collection

13 items • Updated Mar 2 • 62

updated a model 8 months ago

gplsi/Aitana-2B-S-LF

Text Generation • 2B • Updated 18 days ago • 17

published a model 8 months ago

gplsi/Aitana-2B-S-LF

Text Generation • 2B • Updated 18 days ago • 17

updated a model 8 months ago

sgs29/Taxi-v3

Reinforcement Learning • Updated Oct 31, 2025

published a model 8 months ago

sgs29/Taxi-v3

Reinforcement Learning • Updated Oct 31, 2025

updated a model 8 months ago

sgs29/q-FrozenLake-v1-4x4-noSlippery

Reinforcement Learning • Updated Oct 31, 2025

published a model 8 months ago

sgs29/q-FrozenLake-v1-4x4-noSlippery

Reinforcement Learning • Updated Oct 31, 2025

updated a model 8 months ago

sgs29/ppo-LunarLander-v3

Reinforcement Learning • Updated Oct 31, 2025 • 2

published a model 8 months ago

sgs29/ppo-LunarLander-v3

Reinforcement Learning • Updated Oct 31, 2025 • 2

liked 2 Spaces 9 months ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.38k

Explore and download the FineWeb web‑scale text dataset

The Ultra-Scale Playbook

🌌

3.9k

The ultimate guide to training LLM on large GPU Clusters

updated a dataset 9 months ago

gplsi/truthfulqa_va

Viewer • Updated Nov 5, 2025 • 817 • 76

Santiago Galiano Segura

AI & ML interests

Recent Activity

Organizations

sgs29's activity

Distilling 100B+ Models 40x Faster with TRL

Everything You Need to Know about Knowledge Distillation

FineWeb: decanting the web for the finest text data at scale

The Ultra-Scale Playbook