Upload README.md with huggingface_hub

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,3 +1,37 @@
 # Qwen2.5-0.5B-Instruct AWQ + FP8_DYNAMIC
 This is a quantized version of [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) using AWQ + FP8_DYNAMIC quantization scheme.

+---
+language:
+- en
+license: apache-2.0
+library_name: transformers
+tags:
+- quantization
+- awq
+- fp8
+- llm-compressor
+- vllm
+- model-compression
+- qwen2.5
+base_model: Qwen/Qwen2.5-0.5B-Instruct
+datasets:
+- gsm8k
+model-index:
+- name: Qwen2.5-0.5B-AWQ-FP8-Dynamic
+  results:
+  - task:
+      type: text-generation
+      name: Text Generation
+    dataset:
+      name: GSM8K
+      type: gsm8k
+    metrics:
+    - type: exact_match
+      value: 22.67
+      name: Strict Match
+    - type: flexible_extract
+      value: 30.78
+      name: Flexible Extract
+---
 # Qwen2.5-0.5B-Instruct AWQ + FP8_DYNAMIC
 This is a quantized version of [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) using AWQ + FP8_DYNAMIC quantization scheme.