Efficient-Large-Model
/

NVILA-AWQ

Model card Files Files and versions

Louym commited on May 20, 2025

Commit

fb043ec

·

verified ·

1 Parent(s): 2ab7bb0

Update README.md

Files changed (1) hide show

README.md +1 -2

README.md CHANGED Viewed

@@ -6,13 +6,12 @@ Here, we provide AWQ-quantized versions of the most popular NVILA models. These
 One-command demo to chat with quantized NVILA models via [llm-awq](https://github.com/mit-han-lab/llm-awq/tree/main/tinychat#:~:text=TinyChat%20support%20NVILA) (NVILA-8B as an example):
 ```bash
-'''
 cd llm-awq/tinychat
 python nvila_demo.py --model-path PATH/TO/NVILA       \
     --quant_path NVILA-8B-w4-g128-awq-v2.pt  \
     --media PATH/TO/ANY/IMAGES/VIDEOS    \
     --act_scale_path NVILA-8B-VT-smooth-scale.pt \
     --all --chunk --model_type nvila
-'''
 This command will download the quantized NVILA model and run a chat demo. If you’ve already downloaded the files, simply set the path to your local copies.

 One-command demo to chat with quantized NVILA models via [llm-awq](https://github.com/mit-han-lab/llm-awq/tree/main/tinychat#:~:text=TinyChat%20support%20NVILA) (NVILA-8B as an example):
 ```bash
 cd llm-awq/tinychat
 python nvila_demo.py --model-path PATH/TO/NVILA       \
     --quant_path NVILA-8B-w4-g128-awq-v2.pt  \
     --media PATH/TO/ANY/IMAGES/VIDEOS    \
     --act_scale_path NVILA-8B-VT-smooth-scale.pt \
     --all --chunk --model_type nvila
+```
 This command will download the quantized NVILA model and run a chat demo. If you’ve already downloaded the files, simply set the path to your local copies.