Instructions to use typeof/miqu-70b-split with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use typeof/miqu-70b-split with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="typeof/miqu-70b-split") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("typeof/miqu-70b-split") model = AutoModelForCausalLM.from_pretrained("typeof/miqu-70b-split") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use typeof/miqu-70b-split with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "typeof/miqu-70b-split" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "typeof/miqu-70b-split", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/typeof/miqu-70b-split
- SGLang
How to use typeof/miqu-70b-split with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "typeof/miqu-70b-split" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "typeof/miqu-70b-split", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "typeof/miqu-70b-split" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "typeof/miqu-70b-split", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use typeof/miqu-70b-split with Docker Model Runner:
docker model run hf.co/typeof/miqu-70b-split
init
Browse files- model-00453-of-00723.safetensors +3 -0
- model-00454-of-00723.safetensors +3 -0
- model-00455-of-00723.safetensors +3 -0
- model-00456-of-00723.safetensors +3 -0
- model-00457-of-00723.safetensors +3 -0
- model-00459-of-00723.safetensors +3 -0
- model-00462-of-00723.safetensors +3 -0
- model-00463-of-00723.safetensors +3 -0
- model-00464-of-00723.safetensors +3 -0
- model-00465-of-00723.safetensors +3 -0
- model-00466-of-00723.safetensors +3 -0
- model-00467-of-00723.safetensors +3 -0
- model-00468-of-00723.safetensors +3 -0
- model-00469-of-00723.safetensors +3 -0
- model-00470-of-00723.safetensors +3 -0
- model-00471-of-00723.safetensors +3 -0
- model-00472-of-00723.safetensors +3 -0
- model-00473-of-00723.safetensors +3 -0
- model-00474-of-00723.safetensors +3 -0
- model-00475-of-00723.safetensors +3 -0
- model-00476-of-00723.safetensors +3 -0
- model-00477-of-00723.safetensors +3 -0
- model-00478-of-00723.safetensors +3 -0
- model-00479-of-00723.safetensors +3 -0
model-00453-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c38c33b204b47d8c6a292846386391e3471c2ce657ccb0f546026df5c85a2451
|
| 3 |
+
size 16528
|
model-00454-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ea626729051d30d1d62873272b94c41a6476761d80d2a8fe3ca1da388726352e
|
| 3 |
+
size 469762200
|
model-00455-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9e1ebc45086b51c281d1bf8fe3548772ad841b03a3ad720cfd89fb55fc3e742d
|
| 3 |
+
size 469762200
|
model-00456-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:195d39bcfc7a445ffe1d0abd6fe4f2ce8a15eb5a8456ff70e304a15a9473f384
|
| 3 |
+
size 469762192
|
model-00457-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:adda4669d85119be0ebb8ce0c0fb67ecae1471537f075b5563811c6a6e1dbd69
|
| 3 |
+
size 16536
|
model-00459-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:149d9d11134203b67ebb160d45ef176916e84c219fb757ffa8708c974c46a080
|
| 3 |
+
size 134217880
|
model-00462-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:53280792e601457e90b7fb1c0dd4c2d7307ad674498d647110330c14a665e776
|
| 3 |
+
size 16528
|
model-00463-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:67c59b3153ed0da9bfd5b2c34886141b99e863f98910563c8aa9475f748ea03b
|
| 3 |
+
size 469762200
|
model-00464-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3e43a97595a46062018e25d04ccd975049bf99766d63261943d571fa3d054759
|
| 3 |
+
size 469762200
|
model-00465-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a74c26034f7cb8fe4cf98ac210f6e02a8d958f61feda3ac8fcb21e0f9ea4ce8e
|
| 3 |
+
size 469762192
|
model-00466-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7f0915815a37e82ce210809fb275e648cdc2ac1fcd317d549e155a9cab700a20
|
| 3 |
+
size 16536
|
model-00467-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9f6a86cfb03c0c4db23a9a69690ab58c394195b30e39a1d74b524a46fc78e032
|
| 3 |
+
size 16777368
|
model-00468-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1d0115ef23d06633c6d98dca7c317c3ab1f5032beca82b7a07568af9a71e7709
|
| 3 |
+
size 134217880
|
model-00469-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7bcac763a40a5b68ee3d24bc5a6134e04e85e2af854317fd28e81b8ebe35da90
|
| 3 |
+
size 134217880
|
model-00470-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e599177fd087fc0785588617122bd0482d53e3b1e16c3801eff761b0e325a8bd
|
| 3 |
+
size 16777368
|
model-00471-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d13abda3d449e830a4f53ea3a0b8d27649e72a8bfb68040465ddae3cfe284cc0
|
| 3 |
+
size 16528
|
model-00472-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:06a38b16796d06a3703d1b0922064910d24c02fa4958b60e40d59513583168d8
|
| 3 |
+
size 469762200
|
model-00473-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:bb8cf2ce7dc00a8f91a8aff31db6daf98ba81509a6c91747a2ad82d0ae5183be
|
| 3 |
+
size 469762200
|
model-00474-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:81a31d3f94f20ae601a55ad4f22d975bcc617233f1a1d7b13e555462ea413867
|
| 3 |
+
size 469762192
|
model-00475-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b756cbf10e6cd4548ab8f7277b488b204926bfca6e5cbe355fa9c9aea4355012
|
| 3 |
+
size 16536
|
model-00476-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3eb255207e78a650ad8dde01c6f2efa9b779db5e60e124f63b88d9400696fb54
|
| 3 |
+
size 16777368
|
model-00477-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0c2a38d21a169ffb88a879ad10eb9fc8d3c316a0490d17412e6e700a7a3d6505
|
| 3 |
+
size 134217880
|
model-00478-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1faf8bbefe12aa315c1055fdd62f25df1d9178b2574f64ea1585e88a0f7abb6c
|
| 3 |
+
size 134217880
|
model-00479-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:461f902e4707cce119380f6ed9458308901efbe1613018177f3b80111218c2b6
|
| 3 |
+
size 16777368
|