Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
2
Xiulin Yang
xiulinyang
Follow
0 followers
·
2 following
xiulinyang
AI & ML interests
Language Modeling, Interpretability, (compositional) generalization, tokenization
Organizations
None yet
xiulinyang
's models
326
Sort: Recently updated
xiulinyang/gpt2_small_baby_100M_32768_53
Updated
Nov 3, 2025
xiulinyang/gpt2_small_wiki_100M_32768_53
Updated
Nov 3, 2025
xiulinyang/dynamic_att20
5.52M
•
Updated
Oct 28, 2025
xiulinyang/linear_att20
5.52M
•
Updated
Oct 28, 2025
xiulinyang/linear_att
5.52M
•
Updated
Oct 28, 2025
xiulinyang/dynamic_att
5.52M
•
Updated
Oct 28, 2025
xiulinyang/gpt2_mini_baby-dyck_10Mf_32768_42
Updated
Oct 28, 2025
•
1
xiulinyang/gpt2_mini_baby-dyck_10Mf_32768_76
Updated
Oct 28, 2025
xiulinyang/gpt2_mini_baby-dyck_10Mf_32768_53
Updated
Oct 27, 2025
xiulinyang/pretraining-10Mf-gpt2-small-42
Text Generation
•
29.6M
•
Updated
Oct 27, 2025
•
1
xiulinyang/pre-pretraining-10Mf-gpt2-small-42
Text Generation
•
29.6M
•
Updated
Oct 27, 2025
•
2
xiulinyang/pretraining-gpt2-mini-10Mf
Text Generation
•
29.6M
•
Updated
Oct 27, 2025
xiulinyang/pre-pretraining-gpt2-mini-10Mf
Text Generation
•
29.6M
•
Updated
Oct 27, 2025
•
1
xiulinyang/replicate_dyn_linear
5.52M
•
Updated
Oct 26, 2025
xiulinyang/replicate_linear_attention
5.52M
•
Updated
Oct 26, 2025
xiulinyang/pre-pretraining-10Mf-40k
Updated
Oct 26, 2025
xiulinyang/replicate_dynamic_attention
5.52M
•
Updated
Oct 26, 2025
xiulinyang/pre-pretraining-10Mf-10k-53
Text Generation
•
0.2B
•
Updated
Oct 26, 2025
xiulinyang/pretraining-10Mf-10k-53
Text Generation
•
0.2B
•
Updated
Oct 26, 2025
xiulinyang/pretraining-10Mf-10k-42
Text Generation
•
0.2B
•
Updated
Oct 26, 2025
xiulinyang/pre-pretraining-10Mf-10k-42
Text Generation
•
0.2B
•
Updated
Oct 26, 2025
xiulinyang/pre-pretraining-10Mf-10k
Text Generation
•
0.2B
•
Updated
Oct 26, 2025
•
1
xiulinyang/pretraining-10Mf-10k
Text Generation
•
0.2B
•
Updated
Oct 26, 2025
xiulinyang/linear_attention_10Mf
11.8M
•
Updated
Oct 25, 2025
xiulinyang/linear_attention_10M
11.8M
•
Updated
Oct 25, 2025
xiulinyang/dynamic_attention_10M
11.8M
•
Updated
Oct 25, 2025
xiulinyang/gpt2_small_wiki_50M_32768_76
Updated
Oct 25, 2025
xiulinyang/dynamtic_attention_10Mf_512
11.8M
•
Updated
Oct 25, 2025
xiulinyang/gpt2_xxs_baby_10Mf_32768_42_300k
Updated
Oct 25, 2025
xiulinyang/dynamic_attention_10Mf
11.8M
•
Updated
Oct 25, 2025
Previous
1
2
3
4
5
...
11
Next