transformers sentence_transformers faiss-gpu peft trl accelerate bitsandbytes datasets fastapi uvicorn[standard] llama-cpp-python