Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Geonwoo Hong
Geonwoohong
Follow
AI & ML interests
Natural Language Processing, Multimodal Learning
Organizations
None yet
Geonwoohong
's datasets
9
Sort: Recently updated
Geonwoohong/pile-uncopyrighted-test-tokenized-gpt2
Viewer
•
Updated
Nov 14, 2025
•
180k
•
6
Geonwoohong/pile-uncopyrighted-train-tokenized-gpt2
Viewer
•
Updated
Nov 14, 2025
•
288M
•
4
Geonwoohong/wmt21-train-tokenized-sentencepiece
Viewer
•
Updated
Nov 8, 2025
•
8M
Geonwoohong/openwebtext-test-tokenized-gpt2
Viewer
•
Updated
Nov 3, 2025
•
977
•
8
•
1
Geonwoohong/cc100-en-test-tokenized-gpt2
Viewer
•
Updated
Nov 3, 2025
•
1.03k
•
6
Geonwoohong/wikitext-103-raw-v1-test-tokenized-gpt2
Viewer
•
Updated
Nov 3, 2025
•
280
•
2
Geonwoohong/modu-morph-train-encoded-ko
Updated
Oct 23, 2025
•
3
Geonwoohong/aihub-webcorpus-morph-train-tokenized-ko
Viewer
•
Updated
Oct 21, 2025
•
4.33M
•
3
Geonwoohong/lambada-openai-train-tokenized-gpt2
Viewer
•
Updated
Oct 7, 2025
•
5.15k
•
7