Enlarging max_position_embeddings to support long-context generation for the specific inference engine. d806df7 verified QipengGuo commited on Aug 28, 2025