File size: 684 Bytes
8f596b9
 
 
4709301
 
6a0ce9b
 
 
 
 
 
c74b68a
 
3b2c516
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
---
license: apache-2.0
---

The models can be loaded by the [InferLLM](https://github.com/MegEngine/InferLLM) project.

Chinese Alpaca model is from https://github.com/ymcui/Chinese-LLaMA-Alpaca

ggml Alpaca model is from https://huggingface.co/Sosaka/Alpaca-native-4bit-ggml/tree/main

the two models also can be loaded by the [llama.cpp](https://github.com/ggerganov/llama.cpp) project.

InferLLM support the ChatGLM model, the chatglm-q4 is the int4 quantized model from [chatglm-6b](https://huggingface.co/THUDM/chatglm-6b) 

InferLLM support the baichuan model, the baichuan-q4 is the int4 quantized model from [baichuan](https://huggingface.co/fireballoon/baichuan-vicuna-7b)