fastapi uvicorn[standard] torch transformers accelerate sentencepiece exllamav2 gptqmodel #torch #sentencepiece #protobuf #transformers>=4.37.0 #bitsandbytes>=0.41.3 #accelerate>=0.27.0 #optimum #gguf>=0.10.0 #einops #transformers_stream_generator #autoawq #safetensors #intel_extension_for_pytorch