🤝 Open to Collab

2 12 43

Piergiorgio Di Pasquale

pierjoe

AI & ML interests

language models and text analysis

Recent Activity

liked a model 6 days ago

unsloth/GLM-5.2-GGUF

liked a model 6 days ago

zai-org/GLM-5.2

liked a model 9 days ago

inclusionAI/UI-Venus-1.5-2B

View all activity

Organizations

liked 2 models 6 days ago

unsloth/GLM-5.2-GGUF

Text Generation • 754B • Updated 2 days ago • 88.9k • 365

zai-org/GLM-5.2

Text Generation • 753B • Updated 2 days ago • 67.1k • • 2.41k

liked a model 9 days ago

inclusionAI/UI-Venus-1.5-2B

Image-Text-to-Text • 2B • Updated Feb 11 • 1.26k • 39

liked a model 10 days ago

yuxinlu1/gemma-4-12B-coder-fable5-composer2.5-v1-GGUF

Text Generation • 12B • Updated 6 days ago • 496k • 2.32k

updated a dataset 15 days ago

pierjoe/function-calling-synthetic-2000

Viewer • Updated 15 days ago • 1.58k • 108

published a dataset 15 days ago

pierjoe/function-calling-synthetic-2000

Viewer • Updated 15 days ago • 1.58k • 108

liked 2 datasets about 1 month ago

wikimedia/structured-wikipedia

Viewer • Updated May 19 • 10.5M • 18k • 384

HuggingFaceTB/smollm3-configs

Updated Aug 4, 2025 • 69 • 7

upvoted a collection about 1 month ago

Olmo 3 Pre-training

Collection

All artifacts related to Olmo 3 pre-training • 10 items • Updated Dec 23, 2025 • 36

liked a dataset about 1 month ago

allenai/dolma3_mix-6T-1025-7B

Updated Jan 15 • 70.7k • 53

upvoted a collection about 1 month ago

Nemotron v3 Pre-Training

Collection

Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 13 days ago • 17

liked a model about 1 month ago

domyn/Domyn-Small-v1.0

Text Generation • 10B • Updated May 19 • 1.65k • 23

liked a dataset about 1 month ago

HuggingFaceFW/fineweb-2

Viewer • Updated Oct 27, 2025 • 4.48B • 97k • 827

liked a model about 1 month ago

Qwen/Qwen3.6-35B-A3B

Image-Text-to-Text • 36B • Updated Apr 24 • 5.5M • • 2.23k

upvoted a collection about 2 months ago

SmolLM3 pretraining datasets

Collection

datasets used in SmolLM3 pretraining • 15 items • Updated Aug 12, 2025 • 52

liked a model about 2 months ago

Qwen/Qwen3-30B-A3B-FP8

Text Generation • 31B • Updated Jul 26, 2025 • 504k • 84

liked 2 datasets about 2 months ago

nvidia/OpenCodeReasoning-2

Viewer • Updated May 17, 2025 • 2.16M • 2.08k • 57

bigcode/the-stack-v2-train-smol-ids

Viewer • Updated Apr 23, 2024 • 40.1M • 2.4k • 53

upvoted a paper about 2 months ago

Nemotron-CC-Math: A 133 Billion-Token-Scale High Quality Math Pretraining Dataset

Paper • 2508.15096 • Published Aug 20, 2025 • 11

upvoted an article about 2 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 780

Piergiorgio Di Pasquale

AI & ML interests

Recent Activity

Organizations

pierjoe's activity

SmolLM3: smol, multilingual, long-context reasoner