fix: rename README, update demo links, add live demo URL
Browse files- Readme.MD +8 -4
- docs/demo.html +1 -1
- pyvenv.cfg +5 -0
Readme.MD
CHANGED
|
@@ -1,9 +1,11 @@
|
|
| 1 |
-
#
|
| 2 |
|
| 3 |
> Fine-tuned Llama 3.1 8B · QLoRA · Pinecone RAG · FastAPI · Azure Container Apps
|
| 4 |
|
| 5 |
A full end-to-end LLMOps serving stack — from a QLoRA fine-tuned model running in 4-bit NF4 on consumer hardware, through a retrieval-augmented generation pipeline, to a containerized API deployed on Azure. Built to be production-shaped, not just a demo.
|
| 6 |
|
|
|
|
|
|
|
| 7 |
---
|
| 8 |
|
| 9 |
## What this is
|
|
@@ -64,7 +66,7 @@ Grounded answer + source attribution
|
|
| 64 |
```bash
|
| 65 |
# 1. Clone and set up environment
|
| 66 |
git clone https://github.com/MukulRay1603/Irminsul.git
|
| 67 |
-
cd
|
| 68 |
python -m venv venv && source venv/bin/activate # Windows: venv\Scripts\activate
|
| 69 |
pip install -r requirements.txt
|
| 70 |
|
|
@@ -144,7 +146,7 @@ The script provisions a resource group, builds and pushes the image via ACR Task
|
|
| 144 |
## Project structure
|
| 145 |
|
| 146 |
```
|
| 147 |
-
|
| 148 |
├── main.py # FastAPI app — endpoints, lifespan, CORS
|
| 149 |
├── rag.py # Model loading, 4-bit config, LangChain RAG chain
|
| 150 |
├── embedder.py # sentence-transformers singleton wrapper
|
|
@@ -153,7 +155,9 @@ llmops-serve/
|
|
| 153 |
├── Dockerfile
|
| 154 |
├── deploy_azure.sh # One-shot Azure Container Apps deploy
|
| 155 |
├── requirements.txt
|
| 156 |
-
|
|
|
|
|
|
|
| 157 |
```
|
| 158 |
|
| 159 |
---
|
|
|
|
| 1 |
+
# Irminsul
|
| 2 |
|
| 3 |
> Fine-tuned Llama 3.1 8B · QLoRA · Pinecone RAG · FastAPI · Azure Container Apps
|
| 4 |
|
| 5 |
A full end-to-end LLMOps serving stack — from a QLoRA fine-tuned model running in 4-bit NF4 on consumer hardware, through a retrieval-augmented generation pipeline, to a containerized API deployed on Azure. Built to be production-shaped, not just a demo.
|
| 6 |
|
| 7 |
+
**[→ Live Demo](https://mukulray1603.github.io/Irminsul/demo.html)**
|
| 8 |
+
|
| 9 |
---
|
| 10 |
|
| 11 |
## What this is
|
|
|
|
| 66 |
```bash
|
| 67 |
# 1. Clone and set up environment
|
| 68 |
git clone https://github.com/MukulRay1603/Irminsul.git
|
| 69 |
+
cd Irminsul
|
| 70 |
python -m venv venv && source venv/bin/activate # Windows: venv\Scripts\activate
|
| 71 |
pip install -r requirements.txt
|
| 72 |
|
|
|
|
| 146 |
## Project structure
|
| 147 |
|
| 148 |
```
|
| 149 |
+
Irminsul/
|
| 150 |
├── main.py # FastAPI app — endpoints, lifespan, CORS
|
| 151 |
├── rag.py # Model loading, 4-bit config, LangChain RAG chain
|
| 152 |
├── embedder.py # sentence-transformers singleton wrapper
|
|
|
|
| 155 |
├── Dockerfile
|
| 156 |
├── deploy_azure.sh # One-shot Azure Container Apps deploy
|
| 157 |
├── requirements.txt
|
| 158 |
+
├── .env.example
|
| 159 |
+
└── docs/ # Corpus + GitHub Pages demo
|
| 160 |
+
└── demo.html
|
| 161 |
```
|
| 162 |
|
| 163 |
---
|
docs/demo.html
CHANGED
|
@@ -177,7 +177,7 @@
|
|
| 177 |
</main>
|
| 178 |
|
| 179 |
<footer>
|
| 180 |
-
static demo · <a href="https://github.com/
|
| 181 |
</footer>
|
| 182 |
|
| 183 |
<script>
|
|
|
|
| 177 |
</main>
|
| 178 |
|
| 179 |
<footer>
|
| 180 |
+
static demo · <a href="https://github.com/MukulRay1603/Irminsul" target="_blank">github</a> · llmops-serve · Llama 3.1 8B QLoRA
|
| 181 |
</footer>
|
| 182 |
|
| 183 |
<script>
|
pyvenv.cfg
ADDED
|
@@ -0,0 +1,5 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
home = C:\Users\mukul\AppData\Local\Programs\Python\Python312
|
| 2 |
+
include-system-site-packages = false
|
| 3 |
+
version = 3.12.9
|
| 4 |
+
executable = C:\Users\mukul\AppData\Local\Programs\Python\Python312\python.exe
|
| 5 |
+
command = C:\Users\mukul\AppData\Local\Programs\Python\Python312\python.exe -m venv E:\Projects\llmops-serve\venv
|