Fix: Correct base model from Qwen3-2B to Qwen3-8B b7d3975 verified kunhunjon commited on Nov 24, 2025
Update to continuous batching model (batch_size=4, neuronxcc 2.21) 9452995 verified kunhunjon commited on Nov 20, 2025
Add pipeline_tag metadata and model documentation 6223ae4 verified kunhunjon commited on Nov 20, 2025
Upload Neuron-traced ChessLM model for vLLM inference on AWS Trainium 4e0e9cf verified kunhunjon commited on Nov 20, 2025