Optimize inference: cap ORT threads + parallel decode (~2x speedup) 8607c7c KevinAHM Claude Opus 4.6 commited on Feb 6