The Elephant in the Coreference Room: Resolving Coreference in Full-Length French Fiction Works Paper • 2510.15594 • Published Oct 17, 2025 • 1
mmBERT: a modern multilingual encoder Collection mmBERT is trained on 3T tokens from over 1800 languages, showing SoTA scores on benchmarks and exceptional low-resource performance • 16 items • Updated Sep 9, 2025 • 53