AksaraPixelLM-Models Collection Pixel-based LMs trained on Aksara dataset • 2 items • Updated 3 days ago
AksaraPixelLM-Models Collection Pixel-based LMs trained on Aksara dataset • 2 items • Updated 3 days ago
Anthropogenic Regional Adaptation in Multimodal Vision-Language Model Paper • 2604.11490 • Published Apr 13 • 16
A Multi-Labeled Dataset for Indonesian Discourse: Examining Toxicity, Polarization, and Demographics Information Paper • 2503.00417 • Published Mar 1, 2025
NusaAksara: A Multimodal and Multilingual Benchmark for Preserving Indonesian Indigenous Scripts Paper • 2502.18148 • Published Feb 25, 2025
IndoToxic2024: A Demographically-Enriched Dataset of Hate Speech and Toxicity Types for Indonesian Language Paper • 2406.19349 • Published Jun 27, 2024
Replicable Benchmarking of Neural Machine Translation (NMT) on Low-Resource Local Languages in Indonesia Paper • 2311.00998 • Published Nov 2, 2023
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages Paper • 2406.10118 • Published Jun 14, 2024 • 32
IndoToxic2024: A Demographically-Enriched Dataset of Hate Speech and Toxicity Types for Indonesian Language Paper • 2406.19349 • Published Jun 27, 2024
MetaMetrics: Calibrating Metrics For Generation Tasks Using Human Preferences Paper • 2410.02381 • Published Oct 3, 2024 • 1
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines Paper • 2410.12705 • Published Oct 16, 2024 • 32