Turkish Benchmarking Sets Collection A collection of benchmarking sets for Turkish NLP. • 5 items • Updated 15 days ago • 2
Turkish Subwords Research Collection Collection models, tokenizers and testsets for the research work "Optimal Turkish Subword Strategies at Scale". The models are experimental models. • 35 items • Updated about 1 month ago • 2
Optimal Turkish Subword Strategies at Scale: Systematic Evaluation of Data, Vocabulary, Morphology Interplay Paper • 2602.06942 • Published Feb 6 • 3