robgreenberg3 commited on
Commit
340e252
·
verified ·
1 Parent(s): 3ba651e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -5
README.md CHANGED
@@ -1,7 +1,4 @@
1
  ---
2
- tags:
3
- - int4
4
- - vllm
5
  language:
6
  - en
7
  - de
@@ -11,9 +8,32 @@ language:
11
  - hi
12
  - es
13
  - th
 
 
14
  pipeline_tag: text-generation
15
- license: llama3.1
16
- base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
17
  ---
18
  <h1 style="display: flex; align-items: center; gap: 10px; margin: 0;">
19
  Meta-Llama-3.1-8B-Instruct-quantized.w4a16
 
1
  ---
 
 
 
2
  language:
3
  - en
4
  - de
 
8
  - hi
9
  - es
10
  - th
11
+ base_model:
12
+ - meta-llama/Llama-3.1-8B-Instruct
13
  pipeline_tag: text-generation
14
+ tags:
15
+ - llama
16
+ - facebook
17
+ - meta
18
+ - llama-3
19
+ - int4
20
+ - vllm
21
+ - chat
22
+ - neuralmagic
23
+ - llmcompressor
24
+ - conversational
25
+ - 4-bit precision
26
+ - gptq
27
+ - compressed-tensors
28
+ license: other
29
+ license_name: llama3.1
30
+ name: RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w4a16
31
+ description: This model was obtained by quantizing the weights of Meta-Llama-3.1-8B-Instruct to INT4 data type.
32
+ readme: https://huggingface.co/RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w4a16/main/README.md
33
+ tasks:
34
+ - text-to-text
35
+ provider: Meta
36
+ license_link: https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/LICENSE
37
  ---
38
  <h1 style="display: flex; align-items: center; gap: 10px; margin: 0;">
39
  Meta-Llama-3.1-8B-Instruct-quantized.w4a16