Krzysztof Sopyla

ksopyla

13 96

https://ksopyla.com

AI & ML interests

NLP, knowledge extraction, knowledge graphs, semantic similarity, model factfulness

Recent Activity

liked a dataset 9 days ago

OptimalScale/ClimbLab

liked a dataset about 1 month ago

gvlassis/ClimbMix

liked a dataset about 1 month ago

OptimalScale/ClimbMix

View all activity

Organizations

liked a dataset 9 days ago

OptimalScale/ClimbLab

Viewer • Updated May 4, 2025 • 1.24B • 3.32k • 14

liked 2 datasets about 1 month ago

gvlassis/ClimbMix

Viewer • Updated May 11, 2025 • 553M • 2.45k • 7

OptimalScale/ClimbMix

Viewer • Updated May 4, 2025 • 395M • 4.26k • 34

liked 4 datasets 6 months ago

liked a dataset 7 months ago

JeanKaddour/minipile

Viewer • Updated Jun 20, 2023 • 1.01M • 4.28k • 149

liked 3 Spaces 11 months ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.38k

Explore and download the FineWeb web‑scale text dataset

Predict Memory

🧮

110

Estimate model memory usage and see detailed plots

The Ultra-Scale Playbook

🌌

3.91k

The ultimate guide to training LLM on large GPU Clusters

liked a model about 1 year ago

krasserm/perceiver-io-mlm

Fill-Mask • 0.2B • Updated Aug 13, 2025 • 12 • 1

liked a dataset about 1 year ago

allenai/olmo-mix-1124

Viewer • Updated Aug 19, 2025 • 621M • 55.7k • 88

liked 4 datasets over 1 year ago

nampdn-ai/tiny-orca-textbooks

Viewer • Updated Sep 28, 2023 • 147k • 44 • 43

nampdn-ai/tiny-textbooks

Viewer • Updated Jul 3, 2024 • 420k • 515 • 179

allenai/dolmino-mix-1124

Viewer • Updated Oct 29, 2025 • 170M • 11.1k • 97

bookcorpus/bookcorpus

Updated May 3, 2024 • 9.6k • 357

liked a model over 1 year ago

nomic-ai/nomic-bert-2048

Fill-Mask • 0.1B • Updated Apr 29, 2025 • 2.01k • 54

liked a dataset over 1 year ago

wikimedia/wikipedia

Viewer • Updated Jan 9, 2024 • 61.6M • 175k • 1.26k