Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
2
Xiulin Yang
xiulinyang
Follow
0 followers
·
2 following
xiulinyang
AI & ML interests
Language Modeling, Interpretability, (compositional) generalization, tokenization
Organizations
None yet
xiulinyang
's models
326
Sort: Recently updated
xiulinyang/gpt2_mini_wiki_10M_32768_42
Updated
Oct 17, 2025
xiulinyang/gpt2_small_baby_50M_32768_42f
Updated
Oct 17, 2025
•
1
xiulinyang/gpt2_small_baby_30M_32768_42f
Updated
Oct 17, 2025
xiulinyang/gpt2_small_baby_10M_32768_42f
Updated
Oct 16, 2025
xiulinyang/gpt2_xs_baby_30M_32768_42
Updated
Oct 14, 2025
xiulinyang/gpt2_xs_baby_100M_32768_42
Updated
Oct 14, 2025
xiulinyang/gpt2_xs_baby_50M_32768_42
Updated
Oct 14, 2025
xiulinyang/gpt2_tiny_baby_100M_32768_42
Updated
Oct 12, 2025
xiulinyang/gpt2_tiny_baby_50M_32768_42
Updated
Oct 12, 2025
xiulinyang/gpt2_mini_baby_30M_32768_42
Updated
Oct 10, 2025
xiulinyang/gpt2_small_baby_50M_32768_42_b
Updated
Oct 10, 2025
xiulinyang/gpt2_mini_baby_50M_32768_42
Updated
Oct 10, 2025
xiulinyang/gpt2_xs_baby_10M_32768_42
Updated
Oct 10, 2025
xiulinyang/gpt2_small_baby_100M_32768_42_b
Updated
Oct 10, 2025
xiulinyang/gpt2_tiny_baby_30M_32768_42
Updated
Oct 9, 2025
xiulinyang/gpt2_small_baby_10M_32768_42
Updated
Oct 9, 2025
xiulinyang/gpt2_mini_baby_10M_32768_42_b
Updated
Oct 9, 2025
xiulinyang/gpt2_small_baby_30M_32768_42_b
Updated
Oct 9, 2025
xiulinyang/gpt2_tiny_baby_10M_32768_42
Updated
Oct 9, 2025
xiulinyang/gpt2_tiny_EN_unigram_350_42
Updated
Oct 6, 2025
xiulinyang/gpt2_tiny_EN_bpe_350_42
Updated
Oct 6, 2025
xiulinyang/gpt2_tiny_EN_superbpe_350_42
Updated
Oct 6, 2025
xiulinyang/gpt2_tiny_EN_superbpe_300_42
Updated
Oct 6, 2025
xiulinyang/gpt2_tiny_EN_bpe_300_42
Updated
Oct 5, 2025
xiulinyang/gpt2_mini_EN_bpe_500_42
Updated
Oct 5, 2025
xiulinyang/gpt2_mini_EN_bpe_1000_42
13.4M
•
Updated
Sep 29, 2025
xiulinyang/fox_no_rope
Text Generation
•
27.4M
•
Updated
Aug 12, 2025
•
1
xiulinyang/alibi
Text Generation
•
27.4M
•
Updated
Aug 10, 2025
•
1
xiulinyang/transformer
Text Generation
•
27.4M
•
Updated
Aug 10, 2025
•
4
xiulinyang/forgetting_transformer
Text Generation
•
27.4M
•
Updated
Aug 10, 2025
Previous
1
...
3
4
5
6
7
...
11
Next