Update to continuous batching model (batch_size=4, neuronxcc 2.21) 9452995 verified kunhunjon commited on Nov 20, 2025
Upload Neuron-traced ChessLM model for vLLM inference on AWS Trainium 4e0e9cf verified kunhunjon commited on Nov 20, 2025