Incorporating Domain Knowledge into Materials Tokenization Paper • 2506.11115 • Published Jun 9, 2025 • 3
KOMBO: Korean Character Representations Based on the Combination Rules of Subcharacters Paper • 2604.23948 • Published Apr 27
Polishing Every Facet of the GEM: Testing Linguistic Competence of LLMs and Humans in Korean Paper • 2506.01237 • Published Jun 2, 2025
SCRIPT: A Subcharacter Compositional Representation Injection Module for Korean Pre-Trained Language Models Paper • 2604.12377 • Published Apr 14