Fix Chinese preprocessing to match upstream e0a2bda Zihan428 Claude Opus 4.8 (1M context) commited on 22 days ago