Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
21
1
12
Pedro Ortiz Suarez
pjox
Follow
21world's profile picture
naturelizer's profile picture
Fishtiks's profile picture
18 followers
·
21 following
https://portizs.eu/
pjox13
pjox
pjox
pjox.bsky.social
AI & ML interests
Language modeling, parsing, sequence tagging, NER, historical languages.
Recent Activity
published
a dataset
about 1 month ago
commoncrawl/CommonLID
updated
a dataset
about 1 month ago
commoncrawl/CommonLID
authored
a paper
about 1 month ago
SciLaD: A Large-Scale, Transparent, Reproducible Dataset for Natural Scientific Language Processing
View all activity
Organizations
pjox
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
10 months ago
BramVanroy/CommonCrawl-CreativeCommons
Viewer
•
Updated
Aug 28, 2025
•
739M
•
961
•
34
liked
a dataset
over 1 year ago
oscar-corpus/community-oscar
Preview
•
Updated
Nov 15, 2024
•
35
•
28
liked
a dataset
almost 2 years ago
cis-lmu/GlotCC-V1
Viewer
•
Updated
Nov 1, 2024
•
1.28B
•
666
•
57
liked
a dataset
about 2 years ago
allenai/MADLAD-400
Updated
Sep 9, 2024
•
54.9k
•
157
liked
a dataset
over 2 years ago
wikimedia/wikipedia
Viewer
•
Updated
Jan 9, 2024
•
61.6M
•
74.7k
•
1.15k
liked
a Space
almost 3 years ago
Running
6
Grobid CRF image
😻
6
Extract bibliographic data from PDFs
liked
a dataset
almost 3 years ago
leeminxji/doguri
Viewer
•
Updated
Feb 14, 2023
•
32
•
13
•
1
liked
a model
almost 3 years ago
DFKI-SLT/eurogpt2
Updated
Mar 29, 2023
•
6
liked
a Space
about 3 years ago
Running
57
Grobid
🌍
57
Extract bibliographic data from PDFs
liked
a dataset
almost 4 years ago
oscar-corpus/OSCAR-2201
Updated
Aug 6, 2025
•
1.81k
•
125
liked
2 datasets
over 4 years ago
oscar-corpus/OSCAR-2109
Updated
Aug 6, 2025
•
85
•
42
oscar-corpus/oscar
Updated
Sep 4, 2025
•
715
•
204