Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
xiulinyang
's Collections
Function_words
PoSH-Bench
Parallel_multilingual_language_models
Parallel_multilingual_LM_varying_vocab
PoSH-Bench
updated
Nov 10, 2025
This collection contains the models I trained for the PoSH-Bench paper
Upvote
-
xiulinyang/gpt2_small_baby_100M_32768_42
Updated
Oct 19, 2025
•
3
xiulinyang/gpt2_small_wiki_100M_32768_42
Updated
Oct 18, 2025
•
6
xiulinyang/gpt2_small_wiki_100M_32768_53
Updated
Nov 3, 2025
•
2
xiulinyang/gpt2_small_baby_100M_32768_76
Updated
Nov 4, 2025
•
3
xiulinyang/gpt2_mini_baby_100M_32768_42
Updated
Oct 20, 2025
•
5
xiulinyang/gpt2_small_baby_100M_32768_42_c
Updated
Oct 18, 2025
•
3
xiulinyang/gpt2_small_baby_50M_32768_42
Updated
Oct 19, 2025
•
7
xiulinyang/gpt2_small_wiki_50M_32768_42
Updated
Oct 18, 2025
•
7
xiulinyang/gpt2_small_baby_30Mf_32768_42
Updated
Oct 19, 2025
•
5
xiulinyang/gpt2_xxs_baby_10Mf_32768_42_300k
Updated
Oct 25, 2025
•
3
xiulinyang/gpt2_small_baby_30M_32768_42
Updated
Oct 19, 2025
•
6
xiulinyang/pretraining-10Mf-10k-42
Text Generation
•
0.2B
•
Updated
Oct 26, 2025
•
2
xiulinyang/gpt2_small_baby_50Mf_32768_42
Updated
Oct 19, 2025
•
7
xiulinyang/gpt2_mini_baby_10Mf_32768_42
Updated
Oct 18, 2025
•
6
xiulinyang/gpt2_small_baby_100M_32768_53
Updated
Nov 3, 2025
•
3
xiulinyang/pre-pretraining-10Mf-10k-42
Text Generation
•
0.2B
•
Updated
Oct 26, 2025
•
3
xiulinyang/gpt2_small_wiki_100M_32768_76
Updated
Nov 4, 2025
•
3
xiulinyang/gpt2_small_baby_50M_32768_42f
Updated
Oct 17, 2025
•
4
xiulinyang/gpt2_small_wiki_50M_32768_76
Updated
Oct 25, 2025
•
3
xiulinyang/gpt2_small_baby_30Mf_32768_76
Updated
Oct 25, 2025
•
4
xiulinyang/gpt2_mini_baby-dyck_10Mf_32768_53
Updated
Oct 27, 2025
•
3
xiulinyang/gpt2_small_wiki_30M_32768_42
Updated
Oct 17, 2025
•
4
xiulinyang/gpt2_mini_baby-dyck_10Mf_32768_76
Updated
Oct 28, 2025
•
3
xiulinyang/gpt2_mini_baby-dyck_10Mf_32768_42
Updated
Oct 28, 2025
•
4
xiulinyang/gpt2_mini_baby_10M_32768_42
Updated
Oct 18, 2025
•
7
xiulinyang/gpt2_small_baby_30M_32768_42f
Updated
Oct 17, 2025
•
4
xiulinyang/gpt2_xs_baby_30M_32768_42
Updated
Oct 14, 2025
•
4
xiulinyang/gpt2_xs_baby_50M_32768_42
Updated
Oct 14, 2025
•
7
xiulinyang/pretraining-10Mf-gpt2-small-42
Text Generation
•
29.6M
•
Updated
Oct 27, 2025
•
4
xiulinyang/gpt2_mini_baby_10M_32768_53
Updated
Oct 24, 2025
•
4
xiulinyang/gpt2_small_wiki_50M_32768_53
Updated
Oct 24, 2025
•
3
xiulinyang/gpt2_tiny_baby_50M_32768_42
Updated
Oct 12, 2025
•
7
xiulinyang/gpt2_mini_wiki_10M_32768_42
Updated
Oct 17, 2025
•
6
xiulinyang/gpt2_small_baby_10M_32768_42
Updated
Oct 9, 2025
•
5
xiulinyang/gpt2_mini_wiki_10M_32768_76
Updated
Oct 25, 2025
•
3
xiulinyang/pretraining-gpt2-mini-10Mf
Text Generation
•
29.6M
•
Updated
Oct 27, 2025
•
4
xiulinyang/gpt2_mini_baby_10Mf_32768_53
Updated
Oct 23, 2025
•
5
xiulinyang/gpt2_mini_wiki_10M_32768_53
Updated
Oct 23, 2025
•
3
xiulinyang/gpt2_small_baby_10M_32768_42f
Updated
Oct 16, 2025
•
3
xiulinyang/gpt2_mini_baby_10Mf_32768_76
Updated
Oct 24, 2025
•
2
xiulinyang/pre-pretraining-10Mf-10k
Text Generation
•
0.2B
•
Updated
Oct 26, 2025
•
3
xiulinyang/gpt2_small_baby_30Mf_32768_53
Updated
Oct 24, 2025
•
2
xiulinyang/gpt2_small_baby_30M_32768_53
Updated
Oct 25, 2025
•
4
xiulinyang/pretraining-10Mf-10k-53
Text Generation
•
0.2B
•
Updated
Oct 26, 2025
•
4
Upvote
-
Share collection
View history
Collection guide
Browse collections