Commit History

Fix: Correct base model from Qwen3-2B to Qwen3-8B
b7d3975
verified

kunhunjon commited on

Update to continuous batching model (batch_size=4, neuronxcc 2.21)
9452995
verified

kunhunjon commited on

Add pipeline_tag metadata and model documentation
6223ae4
verified

kunhunjon commited on

Upload Neuron-traced ChessLM model for vLLM inference on AWS Trainium
4e0e9cf
verified

kunhunjon commited on

initial commit
3005ff3
verified

kunhunjon commited on