I would add that in the era of AI Agents, deploying a new hyperspecialized agent with a very narrow domain and less prompts/context, would lead to less hallucinations and less token comsuption. It's cheap and convenient, so is the most probable path in my opinion.