Darwin-60B-DUO / tokenizer_info.json
SeaWolf-AI's picture
Initial release — Darwin-60B-DUO (Hybrid-A: Route 70% / Split-Refine 20% / Ensemble V_1 10%)
c2bfdba verified
raw
history blame contribute delete
773 Bytes
{
"_note": "Darwin-60B-DUO uses constituent tokenizers via gateway, not a unified one.",
"constituent_tokenizers": {
"darwin-28r": {
"source_model": "FINAL-Bench/Darwin-28B-REASON",
"tokenizer_family": "qwen3_5",
"vocab_size_estimate": 151936
},
"awaxis-31b": {
"source_model": "Anserwise/AWAXIS-Think-31B",
"tokenizer_family": "gemma4",
"vocab_size_estimate": 262144
}
},
"routing_decision_layer": "language detection + domain classification (gateway/router.py) performs tokenization-free routing on the raw text before backend selection",
"downstream_token_handling": "Each backend (vLLM serving the respective base model) handles its own tokenization. The gateway operates on text strings, not token IDs."
}