Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
15
4
9
Pietro Lesci
pietrolesci
Follow
makoya's profile picture
regisss's profile picture
GloriaMK's profile picture
18 followers
·
34 following
https://pietrolesci.github.io/
pietro_lesci
pietrolesci
pietrolesci
pietrolesci.bsky.social
AI & ML interests
I like developing and applying causal methods to study the effect of training choices on models’ behaviour, including memorisation, shortcut learning, and tokenisation.
Organizations
pietrolesci
's datasets
56
Sort: Recently updated
pietrolesci/unimixlm
Viewer
•
Updated
Jul 25, 2025
•
81.9M
•
85
pietrolesci/me-minipile-evals
Viewer
•
Updated
Jun 3, 2025
•
1.22M
•
20
pietrolesci/pile-deduped
Viewer
•
Updated
May 5, 2025
•
748M
•
8
pietrolesci/pythia-deduped-memorisation-profiles
Viewer
•
Updated
Apr 9, 2025
•
2.13M
•
13
pietrolesci/pile-validation
Viewer
•
Updated
Apr 9, 2025
•
429k
•
59
pietrolesci/pile-deduped-subset
Viewer
•
Updated
Apr 9, 2025
•
16.3k
•
15
pietrolesci/pythia-deduped-stats
Viewer
•
Updated
Apr 9, 2025
•
16.3M
•
83
pietrolesci/pythia-deduped-stats-raw
Viewer
•
Updated
Apr 9, 2025
•
14.9M
•
79.6k
pietrolesci/agnews
Viewer
•
Updated
Apr 9, 2025
•
510k
•
34
pietrolesci/amazoncat-13k
Viewer
•
Updated
Apr 9, 2025
•
5.99M
•
187
•
1
pietrolesci/wikitoxic
Viewer
•
Updated
Apr 9, 2025
•
894k
•
619
•
1
pietrolesci/multiwoz_all_versions
Viewer
•
Updated
Apr 9, 2025
•
82k
•
24
•
1
pietrolesci/anchoral-paper-artefacts
Viewer
•
Updated
Apr 9, 2025
•
2.78M
•
75
pietrolesci/pile-deduped-pythia-preshuffled
Viewer
•
Updated
Mar 25, 2025
•
244M
•
41
pietrolesci/pile-deduped-pythia-tokfreq
Viewer
•
Updated
Mar 17, 2025
•
50.1k
•
4
pietrolesci/finewebedu-20B
Viewer
•
Updated
Mar 16, 2025
•
40.4M
•
93
pietrolesci/minipile
Viewer
•
Updated
Feb 27, 2025
•
6.06M
•
22
pietrolesci/opus-5langs-1M
Viewer
•
Updated
Dec 10, 2024
•
5M
•
6
pietrolesci/opus-raw
Viewer
•
Updated
Nov 27, 2024
•
4.06B
•
158
pietrolesci/pythia-pile-stats
Viewer
•
Updated
Sep 23, 2024
•
113M
•
2
pietrolesci/slim-pajama-eval
Viewer
•
Updated
Sep 16, 2024
•
1.84M
•
1
•
1
pietrolesci/pile-subset
Updated
Sep 13, 2024
•
12
pietrolesci/cmnist
Viewer
•
Updated
Jul 29, 2024
•
308k
•
5
pietrolesci/celeba-wilds
Viewer
•
Updated
Jul 2, 2024
•
203k
•
2
•
1
pietrolesci/civilcomments-wilds
Viewer
•
Updated
Jul 2, 2024
•
893k
•
36
•
2
pietrolesci/mnli-stats
Viewer
•
Updated
May 13, 2024
•
785k
•
3
pietrolesci/mnli-embeddings
Viewer
•
Updated
Mar 22, 2024
•
785k
•
4
pietrolesci/_mnli-stats
Viewer
•
Updated
Mar 20, 2024
•
15.7M
•
38
pietrolesci/wikitext-103-raw-v1_gpt2-20k
Viewer
•
Updated
Nov 16, 2023
•
8.01M
•
50
pietrolesci/yahoo_answers_topics
Viewer
•
Updated
Sep 25, 2023
•
2.92M
•
13
Previous
1
2
Next