Instructions to use typeof/miqu-70b-split with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use typeof/miqu-70b-split with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="typeof/miqu-70b-split") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("typeof/miqu-70b-split") model = AutoModelForCausalLM.from_pretrained("typeof/miqu-70b-split") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use typeof/miqu-70b-split with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "typeof/miqu-70b-split" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "typeof/miqu-70b-split", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/typeof/miqu-70b-split
- SGLang
How to use typeof/miqu-70b-split with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "typeof/miqu-70b-split" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "typeof/miqu-70b-split", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "typeof/miqu-70b-split" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "typeof/miqu-70b-split", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use typeof/miqu-70b-split with Docker Model Runner:
docker model run hf.co/typeof/miqu-70b-split
init
Browse files- model-00228-of-00723.safetensors +3 -0
- model-00229-of-00723.safetensors +3 -0
- model-00230-of-00723.safetensors +3 -0
- model-00231-of-00723.safetensors +3 -0
- model-00232-of-00723.safetensors +3 -0
- model-00233-of-00723.safetensors +3 -0
- model-00234-of-00723.safetensors +3 -0
- model-00235-of-00723.safetensors +3 -0
- model-00236-of-00723.safetensors +3 -0
- model-00237-of-00723.safetensors +3 -0
- model-00238-of-00723.safetensors +3 -0
- model-00239-of-00723.safetensors +3 -0
- model-00240-of-00723.safetensors +3 -0
- model-00241-of-00723.safetensors +3 -0
- model-00242-of-00723.safetensors +3 -0
- model-00243-of-00723.safetensors +3 -0
- model-00244-of-00723.safetensors +3 -0
- model-00245-of-00723.safetensors +3 -0
- model-00248-of-00723.safetensors +3 -0
- model-00249-of-00723.safetensors +3 -0
- model-00251-of-00723.safetensors +3 -0
- model-00252-of-00723.safetensors +3 -0
- model-00253-of-00723.safetensors +3 -0
- model-00254-of-00723.safetensors +3 -0
model-00228-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5cf06234d8ef8ba5188ba0d335b33f98ed625260a562c76dbebbb302b05088c6
|
| 3 |
+
size 16528
|
model-00229-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:778626f546bc98d681c78d4d76420da57a8c4224b0677127c2798eb0c42e71a6
|
| 3 |
+
size 469762200
|
model-00230-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0e0750df82e022d23e22db8838abca8d34c313d6a4677617bd3f3a3208d12a2f
|
| 3 |
+
size 469762200
|
model-00231-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:475d0638c6579998034d62ad69051a2e9a1ed1447ceba610c5901480961e3793
|
| 3 |
+
size 469762192
|
model-00232-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4d85b60fa0524d3d618b5f3b41f21773b7c491710baa799a8f3982b4ff0f2f56
|
| 3 |
+
size 16536
|
model-00233-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0442962f9221e0fbe896aaf832c63ed777bf0416d7bfc3fe212578d8204b97d4
|
| 3 |
+
size 16777368
|
model-00234-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:45fa63ce5e8b54de8e5be54c0f88bd79d791980436723e7d4cac728af09cd6c9
|
| 3 |
+
size 134217880
|
model-00235-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:cc79643bde7eddd43e6c578030278bc47922b6f53b998f337ab9f942aa8dd6b0
|
| 3 |
+
size 134217880
|
model-00236-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0e689bfc8f05cf2ab165d19973d578ee7b06bf2fe359f63fa0caebff13889c9d
|
| 3 |
+
size 16777368
|
model-00237-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ccb4b68a73e48e91b9ce63935d5d77ea6bb966bcdd066e3995793ed9f1b12d75
|
| 3 |
+
size 16528
|
model-00238-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ba2634107b5ca99be58eca00e5e3d20b2c2118e0135ef516401d6207ecfcda47
|
| 3 |
+
size 469762200
|
model-00239-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3a44d7923d4b3313b6dff311bb3694e144be3a2fb52d3a7d51e19889710b9997
|
| 3 |
+
size 469762200
|
model-00240-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:514571852c87bbf0e8e44ca71ce330923903397c56721ef36dc75f03866125a6
|
| 3 |
+
size 469762192
|
model-00241-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a1f5716bfe7b3af9af55ff7d0353a1b56466da62594323983a85f590eb0f1875
|
| 3 |
+
size 16536
|
model-00242-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9b8644720deee6703d51f0dc4d47c9500bc1ecf79aeddc900c7e9310ef44a6c1
|
| 3 |
+
size 16777368
|
model-00243-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b16318dd513a7568d44f54ca0abdbc5d6aaf9dff8ed2f862dfa2a8ea992e8ba9
|
| 3 |
+
size 134217880
|
model-00244-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1ae41c6a9765ef0990517903b423afa30eeef7f0f1a0c606edecc7c1dd8c1249
|
| 3 |
+
size 134217880
|
model-00245-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a35f460def75c5b01e8c2032882195005393d47c5be4b338d89086ecc8f5022d
|
| 3 |
+
size 16777368
|
model-00248-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:cf82adb3367c77a52ddb197fce7f9ab364d80a2a550b378323cbd81b2665eff6
|
| 3 |
+
size 469762200
|
model-00249-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:fe7112b2dcb24c1fea58cc7420ccce28274aa9ee89a7e93a69bd0f4821f25799
|
| 3 |
+
size 469762192
|
model-00251-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2ffbb5e137ef46346016465858c3cbf0d9c9cd317df970f4799f9b24f387424d
|
| 3 |
+
size 16777368
|
model-00252-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ef0f77f1a76729bb754820aac91040118cdf0e2134425b088227e982772fc491
|
| 3 |
+
size 134217880
|
model-00253-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5ca82573ff096de5d8809206cde1500ee0ec955d012c23135a532d1d446e3c59
|
| 3 |
+
size 134217880
|
model-00254-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3cb840a08952d93b62b52938e8c4742211b8831ad0a87deabead5abf83e8581b
|
| 3 |
+
size 16777368
|