Spaces:

Deva1211
/

chatbot

Running

Deva1211 commited on Aug 15, 2025

Commit

8e85724

verified ·

1 Parent(s): d0ab12d

trying to use TheBloke/alpaca-lora-65B-GPTQ

Files changed (1) hide show

app.py CHANGED Viewed

@@ -10,7 +10,7 @@ print("Loading optimized Mistral model...")
 try:
     # First try: AWQ quantized model (best performance)
     print("🔄 Attempting to load AWQ model...")
-    tokenizer = AutoTokenizer.from_pretrained("TheBloke/Mistral-7B-Instruct-v0.2-AWQ")
     model = AutoModelForCausalLM.from_pretrained(
         "TheBloke/Mistral-7B-Instruct-v0.2-AWQ",
         device_map="auto",

 try:
     # First try: AWQ quantized model (best performance)
     print("🔄 Attempting to load AWQ model...")
+    tokenizer = AutoTokenizer.from_pretrained("TheBloke/alpaca-lora-65B-GPTQ")
     model = AutoModelForCausalLM.from_pretrained(
         "TheBloke/Mistral-7B-Instruct-v0.2-AWQ",
         device_map="auto",