MukulRay commited on
Commit
3774975
·
1 Parent(s): 69b8694

fix: rename README, update demo links, add live demo URL

Browse files
Files changed (3) hide show
  1. Readme.MD +8 -4
  2. docs/demo.html +1 -1
  3. pyvenv.cfg +5 -0
Readme.MD CHANGED
@@ -1,9 +1,11 @@
1
- # llmops-serve
2
 
3
  > Fine-tuned Llama 3.1 8B · QLoRA · Pinecone RAG · FastAPI · Azure Container Apps
4
 
5
  A full end-to-end LLMOps serving stack — from a QLoRA fine-tuned model running in 4-bit NF4 on consumer hardware, through a retrieval-augmented generation pipeline, to a containerized API deployed on Azure. Built to be production-shaped, not just a demo.
6
 
 
 
7
  ---
8
 
9
  ## What this is
@@ -64,7 +66,7 @@ Grounded answer + source attribution
64
  ```bash
65
  # 1. Clone and set up environment
66
  git clone https://github.com/MukulRay1603/Irminsul.git
67
- cd llmops-serve
68
  python -m venv venv && source venv/bin/activate # Windows: venv\Scripts\activate
69
  pip install -r requirements.txt
70
 
@@ -144,7 +146,7 @@ The script provisions a resource group, builds and pushes the image via ACR Task
144
  ## Project structure
145
 
146
  ```
147
- llmops-serve/
148
  ├── main.py # FastAPI app — endpoints, lifespan, CORS
149
  ├── rag.py # Model loading, 4-bit config, LangChain RAG chain
150
  ├── embedder.py # sentence-transformers singleton wrapper
@@ -153,7 +155,9 @@ llmops-serve/
153
  ├── Dockerfile
154
  ├── deploy_azure.sh # One-shot Azure Container Apps deploy
155
  ├── requirements.txt
156
- ── .env.example
 
 
157
  ```
158
 
159
  ---
 
1
+ # Irminsul
2
 
3
  > Fine-tuned Llama 3.1 8B · QLoRA · Pinecone RAG · FastAPI · Azure Container Apps
4
 
5
  A full end-to-end LLMOps serving stack — from a QLoRA fine-tuned model running in 4-bit NF4 on consumer hardware, through a retrieval-augmented generation pipeline, to a containerized API deployed on Azure. Built to be production-shaped, not just a demo.
6
 
7
+ **[→ Live Demo](https://mukulray1603.github.io/Irminsul/demo.html)**
8
+
9
  ---
10
 
11
  ## What this is
 
66
  ```bash
67
  # 1. Clone and set up environment
68
  git clone https://github.com/MukulRay1603/Irminsul.git
69
+ cd Irminsul
70
  python -m venv venv && source venv/bin/activate # Windows: venv\Scripts\activate
71
  pip install -r requirements.txt
72
 
 
146
  ## Project structure
147
 
148
  ```
149
+ Irminsul/
150
  ├── main.py # FastAPI app — endpoints, lifespan, CORS
151
  ├── rag.py # Model loading, 4-bit config, LangChain RAG chain
152
  ├── embedder.py # sentence-transformers singleton wrapper
 
155
  ├── Dockerfile
156
  ├── deploy_azure.sh # One-shot Azure Container Apps deploy
157
  ├── requirements.txt
158
+ ── .env.example
159
+ └── docs/ # Corpus + GitHub Pages demo
160
+ └── demo.html
161
  ```
162
 
163
  ---
docs/demo.html CHANGED
@@ -177,7 +177,7 @@
177
  </main>
178
 
179
  <footer>
180
- static demo · <a href="https://github.com/YOUR_USERNAME/llmops-serve" target="_blank">github</a> · llmops-serve · Llama 3.1 8B QLoRA
181
  </footer>
182
 
183
  <script>
 
177
  </main>
178
 
179
  <footer>
180
+ static demo · <a href="https://github.com/MukulRay1603/Irminsul" target="_blank">github</a> · llmops-serve · Llama 3.1 8B QLoRA
181
  </footer>
182
 
183
  <script>
pyvenv.cfg ADDED
@@ -0,0 +1,5 @@
 
 
 
 
 
 
1
+ home = C:\Users\mukul\AppData\Local\Programs\Python\Python312
2
+ include-system-site-packages = false
3
+ version = 3.12.9
4
+ executable = C:\Users\mukul\AppData\Local\Programs\Python\Python312\python.exe
5
+ command = C:\Users\mukul\AppData\Local\Programs\Python\Python312\python.exe -m venv E:\Projects\llmops-serve\venv