Spaces:

Alovestocode
/

ZeroGPU-LLM-Inference

Sleeping

Alikestocode commited on Nov 10, 2025

Commit

7e31310

1 Parent(s): 4001f22

Fix oneshot() API: use correct parameter names

- Change 'model' to 'model_id'
- Change 'modifiers' to 'recipe_modifiers'
- Change 'calibration_data' to 'recipe_dataset'
- Change 'token' to 'hf_token'
- Fixes HfArgumentParser unrecognized keys error

Files changed (1) hide show

quantize_to_awq_colab.ipynb +12 -5

quantize_to_awq_colab.ipynb CHANGED Viewed

@@ -346,14 +346,21 @@
     "        print(f\"  ✅ AWQModifier created successfully\")\n",
     "        \n",
     "        # Call oneshot with the modifier\n",
     "        print(f\"  → Starting quantization process...\")\n",
     "        oneshot(\n",
-    "            model=repo_id,\n",
     "            output_dir=temp_output_dir,\n",
-    "            modifiers=modifiers,\n",
-    "            token=os.environ.get(\"HF_TOKEN\"),\n",
-    "            # Calibration data: list of strings\n",
-    "            calibration_data=calibration_texts[:min(calibration_dataset_size, 128)]\n",
     "        )\n",
     "        \n",
     "        print(f\"✅ Model quantized to AWQ successfully\")\n",

     "        print(f\"  ✅ AWQModifier created successfully\")\n",
     "        \n",
     "        # Call oneshot with the modifier\n",
+    "        # Note: oneshot() uses HfArgumentParser and expects specific parameter names\n",
+    "        # Use 'model_id' instead of 'model', and pass modifiers/calibration via recipe or separate args\n",
     "        print(f\"  → Starting quantization process...\")\n",
+    "        \n",
+    "        # Prepare calibration dataset (limit to reasonable size)\n",
+    "        calibration_dataset = calibration_texts[:min(calibration_dataset_size, 128)]\n",
+    "        \n",
+    "        # oneshot() API: model_id, output_dir, and recipe parameters\n",
+    "        # Pass modifiers and dataset via the recipe structure\n",
     "        oneshot(\n",
+    "            model_id=repo_id,\n",
     "            output_dir=temp_output_dir,\n",
+    "            recipe_modifiers=modifiers,\n",
+    "            recipe_dataset=calibration_dataset,\n",
+    "            hf_token=os.environ.get(\"HF_TOKEN\")\n",
     "        )\n",
     "        \n",
     "        print(f\"✅ Model quantized to AWQ successfully\")\n",