4 10 13

Albert Catalan-Tatjer

aldakata

https://aldakata.github.io/

aldakata

AI & ML interests

Efficiency

Recent Activity

upvoted a collection about 17 hours ago

Qwen-AgentWorld

liked a model 2 days ago

unsloth/GLM-5.2-GGUF

liked a model 3 days ago

fixed-point-reasoners/fprm

View all activity

Organizations

upvoted a collection about 17 hours ago

Qwen-AgentWorld

Collection

3 items • Updated 1 day ago • 33

liked a model 2 days ago

unsloth/GLM-5.2-GGUF

Text Generation • 754B • Updated 1 day ago • 77k • 359

liked a model 3 days ago

fixed-point-reasoners/fprm

Updated 3 days ago • 1

liked 2 models about 1 month ago

talkie-lm/talkie-1930-13b-it

Updated Apr 23 • 282

JonasGeiping/stream-qwen3.5-27b

Text Generation • 27B • Updated May 13 • 293 • 22

New activity in JonasGeiping/stream-qwen3-8b about 1 month ago

typo

#1 opened about 1 month ago by

aldakata

liked a model about 1 month ago

JonasGeiping/stream-qwen3-8b

Text Generation • 8B • Updated May 13 • 241 • 6

New activity in allenai/OLMo-2-0425-1B 3 months ago

Main revision

#5 opened 9 months ago by

aldakata

liked 2 datasets 5 months ago

ricdomolm/MATH-500

Viewer • Updated Feb 6, 2025 • 12.5k • 264 • 4

christopher/rosetta-code

Viewer • Updated Sep 24, 2023 • 79k • 622 • 39

upvoted a paper 5 months ago

Olmo 3

Paper • 2512.13961 • Published Dec 15, 2025 • 36

upvoted a collection 5 months ago

Olmo 3

Collection

Artifacts for the Olmo 3 release. • 7 items • Updated Mar 2 • 171

liked a model 7 months ago

deepseek-ai/DeepSeek-Math-V2

Text Generation • 685B • Updated Nov 27, 2025 • 698 • 699

liked a model 8 months ago

microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated Dec 17, 2025 • 9.84k • 1.46k

liked 2 Spaces 8 months ago

The Smol Training Playbook

📚

3.22k

The secrets to building world-class LLMs

The Ultra-Scale Playbook

🌌

3.9k

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 8 months ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

not-lain

•

Jan 30, 2025

• 351

liked a dataset 8 months ago

bigcode/starcoderdata

Viewer • Updated May 16, 2023 • 207M • 24.5k • 523

authored a paper 8 months ago

Training Dynamics Impact Post-Training Quantization Robustness

Paper • 2510.06213 • Published Oct 7, 2025 • 3

upvoted a paper 8 months ago

Training Dynamics Impact Post-Training Quantization Robustness

Paper • 2510.06213 • Published Oct 7, 2025 • 3

Albert Catalan-Tatjer

AI & ML interests

Recent Activity

Organizations

aldakata's activity

typo

Main revision

The Smol Training Playbook

The Ultra-Scale Playbook

KV Caching Explained: Optimizing Transformer Inference Efficiency