Instructions to use typeof/miqu-70b-split with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use typeof/miqu-70b-split with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="typeof/miqu-70b-split") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("typeof/miqu-70b-split") model = AutoModelForCausalLM.from_pretrained("typeof/miqu-70b-split") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use typeof/miqu-70b-split with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "typeof/miqu-70b-split" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "typeof/miqu-70b-split", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/typeof/miqu-70b-split
- SGLang
How to use typeof/miqu-70b-split with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "typeof/miqu-70b-split" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "typeof/miqu-70b-split", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "typeof/miqu-70b-split" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "typeof/miqu-70b-split", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use typeof/miqu-70b-split with Docker Model Runner:
docker model run hf.co/typeof/miqu-70b-split
init
Browse files- model-00678-of-00723.safetensors +3 -0
- model-00679-of-00723.safetensors +3 -0
- model-00680-of-00723.safetensors +3 -0
- model-00681-of-00723.safetensors +3 -0
- model-00682-of-00723.safetensors +3 -0
- model-00687-of-00723.safetensors +3 -0
- model-00688-of-00723.safetensors +3 -0
- model-00689-of-00723.safetensors +3 -0
- model-00690-of-00723.safetensors +3 -0
- model-00691-of-00723.safetensors +3 -0
- model-00692-of-00723.safetensors +3 -0
- model-00693-of-00723.safetensors +3 -0
- model-00694-of-00723.safetensors +3 -0
- model-00695-of-00723.safetensors +3 -0
- model-00696-of-00723.safetensors +3 -0
- model-00697-of-00723.safetensors +3 -0
- model-00698-of-00723.safetensors +3 -0
- model-00699-of-00723.safetensors +3 -0
- model-00700-of-00723.safetensors +3 -0
- model-00701-of-00723.safetensors +3 -0
- model-00702-of-00723.safetensors +3 -0
- model-00703-of-00723.safetensors +3 -0
- model-00704-of-00723.safetensors +3 -0
- model-00710-of-00723.safetensors +3 -0
- model-00712-of-00723.safetensors +3 -0
- model-00713-of-00723.safetensors +3 -0
model-00678-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:50561dcb685d15827a7a17a9ede406fa2ee2031ecca128a146e59023c975b68f
|
| 3 |
+
size 16528
|
model-00679-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7dbe4e2feea5abfe9b2b4d42a94e22d0b925b7c2318ccf665689643888da1fe8
|
| 3 |
+
size 469762200
|
model-00680-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6abc0bbd2b9e0b5ad495d6de866ef64c8057c4a730cbc90ca79fa1b0bd38d587
|
| 3 |
+
size 469762200
|
model-00681-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e586e653b6baa4b7f8495a07a06d8b85b993288ca572c962dfad9e6e2c13daea
|
| 3 |
+
size 469762192
|
model-00682-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:bebe38147917fa6ee2c038cbc1c1caa913ebd550d00adca2758348978464532b
|
| 3 |
+
size 16536
|
model-00687-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3de5b0fc5c6745ed25159b208f72c7f1956a9b2f7f71394d550dee791ce598c4
|
| 3 |
+
size 16528
|
model-00688-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b07454d7f5e35be2ae6a3b388f63b2a3c409caea9d706ad7527440a7e3df9c77
|
| 3 |
+
size 469762200
|
model-00689-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:339110313f921e11f16e01a5fe17425f96160e22115cc972a5efe564b7aa4e4e
|
| 3 |
+
size 469762200
|
model-00690-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8125098542b2d980795bf9bc15d8899eb8315d05323694abcd7c9b0722b7f3b9
|
| 3 |
+
size 469762192
|
model-00691-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d20465be93c3a5018f3e87141f7d24b2b28ed4c09004926b99084fc692094288
|
| 3 |
+
size 16536
|
model-00692-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f455c7b229f180fe620aa186432c255d51e3c8d6bfa917bbd16f1659667a03b0
|
| 3 |
+
size 16777368
|
model-00693-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8d86020fd23735e7268b2b2361551bcb855ad90b8d204cc91781ab3938ad0025
|
| 3 |
+
size 134217880
|
model-00694-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:049ecb720d3135c6ba9004c06f0f3f49aa729064166721d12117c5854da3e8a1
|
| 3 |
+
size 134217880
|
model-00695-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:701ceaea47f9583fc9a52f4c52bffbd092d7e1b428147902393753c5909fccb5
|
| 3 |
+
size 16777368
|
model-00696-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ed8491712af1d64652db8c9d0da16c12f2ea9347b6000d2bb0858977e0f89985
|
| 3 |
+
size 16528
|
model-00697-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:db28d7d37720dc38f79b5e966a1ccfccfb3f8a980ac5ad6969bfd1b2bc7551fe
|
| 3 |
+
size 469762200
|
model-00698-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6911170cdad815323f00155f18edec055933ce4557b374ed3a01ad5e6d8483d5
|
| 3 |
+
size 469762200
|
model-00699-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5e9c200d9a049f02c6aeca9fbd85a2b376ef3fd39e44abfac372660a91cdd4aa
|
| 3 |
+
size 469762192
|
model-00700-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d7f20b78f3f7eb4c2ec9e543d09a4a63aa3763528a2bce3abaf24e742ed60724
|
| 3 |
+
size 16536
|
model-00701-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:03596da03479bd0cf94a9c1213bc800b8e9f95ff47af7bcb9d4a2e177616febf
|
| 3 |
+
size 16777368
|
model-00702-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e493b88a35f98a55ac48ab5d58d111de17caf8ac875d381a5478b08d7711cf04
|
| 3 |
+
size 134217880
|
model-00703-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:82e94069fca4972c5a38f240d3d793d254f995a95a64e2aabdec30283a095c2e
|
| 3 |
+
size 134217880
|
model-00704-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9ecbb1153389a01eb5c2ad59a05f9ee688b9d2e618784618a9490b825129ca86
|
| 3 |
+
size 16777368
|
model-00710-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:18f03e499ece55f0fa71f1dc31010080f1f4cd5595a5326d29f6711cbc699f74
|
| 3 |
+
size 16777368
|
model-00712-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:df8712ed4dce6fc9adb3d82550ca50b7232fd3f685382041b9fcf99cc4b396f7
|
| 3 |
+
size 134217880
|
model-00713-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:48a27ad335b0b1e47406bfed1ca54fbf8bd026ced16ca935cca15d36ba2d1dec
|
| 3 |
+
size 16777368
|