Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
tanaymehta
's Collections
Thesis Models
Thesis Models
updated
Jul 27, 2024
All the GPT2 variants I have trained for my Masters Thesis
Upvote
-
tanaymehta/gpt2_12H_12L_75M_tok_20_eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
2
tanaymehta/gpt2_12H_10L_75M_tok_20_eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
3
tanaymehta/gpt2_12H_8L_75M_tok_20_eps
Text Generation
•
96.1M
•
Updated
Jul 29, 2024
•
3
tanaymehta/gpt2_12H_6L_75M_tok_20_eps
Text Generation
•
81.9M
•
Updated
Jul 29, 2024
•
2
tanaymehta/gpt2_12H_4L_75M_tok_20_eps
Text Generation
•
67.7M
•
Updated
Jul 29, 2024
•
2
tanaymehta/gpt2_12H_2L_75M_tok_20_eps
Text Generation
•
53.6M
•
Updated
Jul 29, 2024
•
5
tanaymehta/gpt2_12H_12L_15M_tok_100_eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
1
tanaymehta/gpt2_12H_12L_10M_tok_100_eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
3
tanaymehta/gpt2_12H_12L_5M_tok_100_eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
1
tanaymehta/gpt2_8H_12L_1M_tok_50_eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
3
tanaymehta/gpt2_6H_12L_1M_tok_50_eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
1
tanaymehta/gpt2_4H_12L_1M_tok_50_eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
1
tanaymehta/gpt2_2H_12L_1M_tok_50_eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
1
tanaymehta/gpt2_2M_100eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
2
tanaymehta/gpt2_1_9M_100eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
1
tanaymehta/gpt2_1_8M_100eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
1
tanaymehta/gpt2_1_7M_100eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
1
tanaymehta/gpt2_1_6M_100eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
1
tanaymehta/gpt2_1_5M_100eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
1
tanaymehta/gpt2_1_4M_100eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
1
tanaymehta/gpt2_1_3M_100eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
2
tanaymehta/gpt2_1_2M_100eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
2
tanaymehta/gpt2_1_1M_100eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
2
tanaymehta/gpt2_1M_100eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
2
tanaymehta/gpt2_900K_100eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
3
tanaymehta/gpt2_800K_100eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
3
tanaymehta/gpt2_700K_100eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
2
tanaymehta/gpt2_600K_100eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
4
tanaymehta/gpt2_500K_100eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
3
tanaymehta/gpt2_400K_100eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
5
tanaymehta/gpt2_300K_100eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
3
tanaymehta/gpt2_200K_100eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
2
tanaymehta/gpt2_100K_100eps
Text Generation
•
0.1B
•
Updated
Jul 29, 2024
•
4
Upvote
-
Share collection
View history
Collection guide
Browse collections