Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
🤝
Open to Collab
178.3
TFLOPS
Carlos García
cgarciams
Follow
0 followers
·
6 following
CarlosJGarcia
AI & ML interests
Building a GPT-2 medium size (approx. 400 M parameters) model from scratch, using PyTorch, the OpenWebText dataset, Tiktoken, AdamW optimizer and FlashAttention. Just for fun.
Recent Activity
updated
a model
7 days ago
cgarciams/gsp2_355m_sft
published
a model
7 days ago
cgarciams/gsp2_355m_sft
updated
a model
24 days ago
cgarciams/gpt_124m
View all activity
Organizations
cgarciams
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
7 days ago
cgarciams/gsp2_355m_sft
Text Generation
•
Updated
7 days ago
published
a model
7 days ago
cgarciams/gsp2_355m_sft
Text Generation
•
Updated
7 days ago
updated
a model
24 days ago
cgarciams/gpt_124m
Text Generation
•
Updated
24 days ago
published
a model
25 days ago
cgarciams/gpt_124m
Text Generation
•
Updated
24 days ago
published
a dataset
about 1 month ago
cgarciams/hle-text
Updated
May 22
•
80