Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
2
Xiulin Yang
xiulinyang
Follow
0 followers
·
2 following
xiulinyang
AI & ML interests
Language Modeling, Interpretability, (compositional) generalization, tokenization
Recent Activity
updated
a model
26 days ago
xiulinyang/GPT2_bigram_function_within_boundary_53
published
a model
26 days ago
xiulinyang/GPT2_bigram_function_within_boundary_53
updated
a model
26 days ago
xiulinyang/GPT2_five_function_within_boundary_53
View all activity
Organizations
None yet
xiulinyang
's models
329
Sort: Recently updated
xiulinyang/output_dynamic_10Mf_20
29.9M
•
Updated
Nov 14, 2025
•
2
xiulinyang/gpt2_small_wiki_100M_32768_76
Updated
Nov 4, 2025
•
1
xiulinyang/gpt2_small_baby_100M_32768_76
Updated
Nov 4, 2025
•
1
xiulinyang/gpt2_small_baby_100M_32768_53
Updated
Nov 3, 2025
•
2
xiulinyang/gpt2_small_wiki_100M_32768_53
Updated
Nov 3, 2025
•
2
xiulinyang/dynamic_att20
5.52M
•
Updated
Oct 28, 2025
•
1
xiulinyang/linear_att20
5.52M
•
Updated
Oct 28, 2025
•
1
xiulinyang/linear_att
5.52M
•
Updated
Oct 28, 2025
•
1
xiulinyang/dynamic_att
5.52M
•
Updated
Oct 28, 2025
•
2
xiulinyang/gpt2_mini_baby-dyck_10Mf_32768_42
Updated
Oct 28, 2025
•
2
xiulinyang/gpt2_mini_baby-dyck_10Mf_32768_76
Updated
Oct 28, 2025
•
2
xiulinyang/gpt2_mini_baby-dyck_10Mf_32768_53
Updated
Oct 27, 2025
•
3
xiulinyang/pretraining-10Mf-gpt2-small-42
Text Generation
•
29.6M
•
Updated
Oct 27, 2025
•
1
xiulinyang/pre-pretraining-10Mf-gpt2-small-42
Text Generation
•
29.6M
•
Updated
Oct 27, 2025
•
2
xiulinyang/pretraining-gpt2-mini-10Mf
Text Generation
•
29.6M
•
Updated
Oct 27, 2025
•
2
xiulinyang/pre-pretraining-gpt2-mini-10Mf
Text Generation
•
29.6M
•
Updated
Oct 27, 2025
•
3
xiulinyang/replicate_dyn_linear
5.52M
•
Updated
Oct 26, 2025
•
3
xiulinyang/replicate_linear_attention
5.52M
•
Updated
Oct 26, 2025
•
2
xiulinyang/pre-pretraining-10Mf-40k
Updated
Oct 26, 2025
xiulinyang/replicate_dynamic_attention
5.52M
•
Updated
Oct 26, 2025
•
2
xiulinyang/pre-pretraining-10Mf-10k-53
Text Generation
•
0.2B
•
Updated
Oct 26, 2025
•
2
xiulinyang/pretraining-10Mf-10k-53
Text Generation
•
0.2B
•
Updated
Oct 26, 2025
•
2
xiulinyang/pretraining-10Mf-10k-42
Text Generation
•
0.2B
•
Updated
Oct 26, 2025
•
2
xiulinyang/pre-pretraining-10Mf-10k-42
Text Generation
•
0.2B
•
Updated
Oct 26, 2025
•
3
xiulinyang/pre-pretraining-10Mf-10k
Text Generation
•
0.2B
•
Updated
Oct 26, 2025
•
1
xiulinyang/pretraining-10Mf-10k
Text Generation
•
0.2B
•
Updated
Oct 26, 2025
•
2
xiulinyang/linear_attention_10Mf
11.8M
•
Updated
Oct 25, 2025
•
2
xiulinyang/linear_attention_10M
11.8M
•
Updated
Oct 25, 2025
xiulinyang/dynamic_attention_10M
11.8M
•
Updated
Oct 25, 2025
•
1
xiulinyang/gpt2_small_wiki_50M_32768_76
Updated
Oct 25, 2025
•
2
Previous
1
2
3
4
5
...
11
Next