feat: Irminsul — Llama 3.1 8B QLoRA RAG serving stack with Pinecone, FastAPI, Azure ef5f450 MukulRay commited on Mar 19