Commit History

Update to continuous batching model (batch_size=4, neuronxcc 2.21)
9452995
verified

kunhunjon commited on

Upload Neuron-traced ChessLM model for vLLM inference on AWS Trainium
4e0e9cf
verified

kunhunjon commited on