A Multilingual Dataset and Model for Information Extraction from News Web Pages
ISPRAS Crawlers
community
AI & ML interests
Web scraping, web data extraction, information extraction
Recent Activity
Organization Card
Research group at the Institute for System Programming of the Russian Academy of Sciences focused on web data collection.
models
5
ispras-crawlers/newsxlm-domlm-ae
Token Classification
•
0.3B
•
Updated
•
58
ispras-crawlers/newsxlm-markuplm-en-ae
Token Classification
•
0.1B
•
Updated
•
24
ispras-crawlers/newsxlm-xlmroberta-ae
Token Classification
•
0.3B
•
Updated
•
23
ispras-crawlers/newsxlm-markuplm-ae
Token Classification
•
0.1B
•
Updated
•
14
ispras-crawlers/newsxlm-domlm-pretrained
0.3B
•
Updated
•
34