sitsope commited on
Commit
6933c9e
·
verified ·
1 Parent(s): ffbe643

Upload folder using huggingface_hub

Browse files
Files changed (2) hide show
  1. .gitignore +1 -1
  2. README.md +0 -29
.gitignore CHANGED
@@ -4,7 +4,7 @@ __pycache__/
4
  *.pyc
5
 
6
  # Local models / data
7
- models/
8
  data/index.faiss
9
  data/docstore.json
10
 
 
4
  *.pyc
5
 
6
  # Local models / data
7
+
8
  data/index.faiss
9
  data/docstore.json
10
 
README.md CHANGED
@@ -1,32 +1,3 @@
1
- ---
2
- language: en
3
- license: mit
4
- tags:
5
- - gguf
6
- - quantized
7
- - phi-3
8
- - llama-cpp
9
- library_name: gguf
10
- pipeline_tag: text-generation
11
- ---
12
-
13
- # Phi-3 Mini 4K Instruct – 4-bit GGUF
14
-
15
- 4-bit quantized GGUF for efficient inference (low-cost servers).
16
-
17
- ## Files
18
- - `Phi-3-mini-4k-instruct-q4.gguf`
19
-
20
- ## Usage (llama.cpp)
21
- ```bash
22
- ./main -m Phi-3-mini-4k-instruct-q4.gguf -p "Hello"
23
- ```
24
-
25
- ## License
26
- MIT (project code). Base model license applies to the original Phi-3 weights.
27
-
28
-
29
-
30
  # Quantized LLM + RAG (FastAPI + FAISS + Phi‑3)
31
 
32
  ## Goal
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  # Quantized LLM + RAG (FastAPI + FAISS + Phi‑3)
2
 
3
  ## Goal