Add strategy=joint|isolated flag + empirical findings in docstring (joint default, 10x faster, same allocation as isolated per A vs B comparison on OLMo-2-13B) 002163f verified mxguru1 commited on 8 days ago