Instructions to use typeof/miqu-70b-split with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use typeof/miqu-70b-split with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="typeof/miqu-70b-split") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("typeof/miqu-70b-split") model = AutoModelForCausalLM.from_pretrained("typeof/miqu-70b-split") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use typeof/miqu-70b-split with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "typeof/miqu-70b-split" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "typeof/miqu-70b-split", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/typeof/miqu-70b-split
- SGLang
How to use typeof/miqu-70b-split with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "typeof/miqu-70b-split" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "typeof/miqu-70b-split", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "typeof/miqu-70b-split" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "typeof/miqu-70b-split", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use typeof/miqu-70b-split with Docker Model Runner:
docker model run hf.co/typeof/miqu-70b-split
init
Browse files- model-00399-of-00723.safetensors +3 -0
- model-00400-of-00723.safetensors +3 -0
- model-00402-of-00723.safetensors +3 -0
- model-00403-of-00723.safetensors +3 -0
- model-00408-of-00723.safetensors +3 -0
- model-00409-of-00723.safetensors +3 -0
- model-00410-of-00723.safetensors +3 -0
- model-00411-of-00723.safetensors +3 -0
- model-00412-of-00723.safetensors +3 -0
- model-00413-of-00723.safetensors +3 -0
- model-00414-of-00723.safetensors +3 -0
- model-00415-of-00723.safetensors +3 -0
- model-00416-of-00723.safetensors +3 -0
- model-00417-of-00723.safetensors +3 -0
- model-00418-of-00723.safetensors +3 -0
- model-00419-of-00723.safetensors +3 -0
- model-00420-of-00723.safetensors +3 -0
- model-00421-of-00723.safetensors +3 -0
- model-00422-of-00723.safetensors +3 -0
- model-00423-of-00723.safetensors +3 -0
- model-00424-of-00723.safetensors +3 -0
- model-00425-of-00723.safetensors +3 -0
- model-00431-of-00723.safetensors +3 -0
- model-00432-of-00723.safetensors +3 -0
- model-00433-of-00723.safetensors +3 -0
- model-00434-of-00723.safetensors +3 -0
model-00399-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e0ee6d5e273b98482b110d7258481038b24b42d7d9e91e366400587e27af911f
|
| 3 |
+
size 16528
|
model-00400-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c699018987ea4cbd650beb91a175b7742ab5a9e8b858dd6b10829cf571e8328a
|
| 3 |
+
size 469762200
|
model-00402-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4b08bce457e206994c992f9d727996eab12c19a7c9ff0818d80cba6315a1bd2c
|
| 3 |
+
size 469762192
|
model-00403-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c0c593475ca142dee113e66747bd6dfc5a3f3efbe1b47179618bb3849bea0778
|
| 3 |
+
size 16536
|
model-00408-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e71b13c60925f5fbe6dad7ab374c422b7ebb2e4185604afc04f57e8eddc23951
|
| 3 |
+
size 16528
|
model-00409-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:097ac3cd9b8611dc7551bbdb77b7402e7faf20468cc28815c658f4a1a5c767b1
|
| 3 |
+
size 469762200
|
model-00410-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:646ce2d771eeaff82cfd8057821be24cdbf780b425b04ebe466874981444b866
|
| 3 |
+
size 469762200
|
model-00411-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:cfb71eb7e8dbebe1e13c90ebe915fb8f8098094f02938c868a470c5299a0c65d
|
| 3 |
+
size 469762192
|
model-00412-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:16477e11c4b6f43327e02e3f4c6265de850d8e02403723835d65d07411276043
|
| 3 |
+
size 16536
|
model-00413-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b716afcb8f3be26be3099849aaf8fa4afff58dc049d17fedcb56838cfb4b5585
|
| 3 |
+
size 16777368
|
model-00414-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:467f3d94914cd321580c4892780a2c0d679c599575de41476d59487910ab6569
|
| 3 |
+
size 134217880
|
model-00415-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f04491546c11395a2c5652baeea5128d3aa86444a93e2a1103b59cc44ff812ca
|
| 3 |
+
size 134217880
|
model-00416-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4f67100eb6bd45c07ed565df04656938e0a586919e66a30248ed2d386aa40c62
|
| 3 |
+
size 16777368
|
model-00417-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2cd599966b5260d03565970ca189cbd39e8c00b18266dfe462fe1df13171c8a5
|
| 3 |
+
size 16528
|
model-00418-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ec71046aa6ce675e4b74a8f885917566d587ece92f497cd0b36f401d1451dfbb
|
| 3 |
+
size 469762200
|
model-00419-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:99eaa14d8f35eb9fe9e6998b4130931e3b86890af1d65397858a72928e88e44b
|
| 3 |
+
size 469762200
|
model-00420-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d70674bb21c6defc7b985ab420ade5b2c775a7c5bda7fd1addfd64799ea3e65e
|
| 3 |
+
size 469762192
|
model-00421-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f4289f0ca553e7f950bd582edc5db6daba5b61fa7333b3d996889e2909649076
|
| 3 |
+
size 16536
|
model-00422-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8286d2afbafc3e670d32c466fad9d747393ae487adba4b4edb3ca213bec766c6
|
| 3 |
+
size 16777368
|
model-00423-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:51fd28b5abba78bbeeca0f4a9188b7d58bd177c119d5d4b0e35da853698a3b1c
|
| 3 |
+
size 134217880
|
model-00424-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a475028c35db149209b627b9e89291fa3f7db1955a5c1d50b6c0054fdcefcada
|
| 3 |
+
size 134217880
|
model-00425-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ae57b44d6fdc5ae5c261153ea77fd1e9d2ee2f88c04126cb94f17590c5936c03
|
| 3 |
+
size 16777368
|
model-00431-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e74434a9911337047126e9139e057bb9202d7e3a34ade8348ef45abb32fe0460
|
| 3 |
+
size 16777368
|
model-00432-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f6eb81471ea151112dac34dabdd6892dd240168fb56fca4b8bc40f52e9ecd8d1
|
| 3 |
+
size 134217880
|
model-00433-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9bd6fa9dca4350c1c855333adc90ae15ff848d70785872a1814573589764acbf
|
| 3 |
+
size 134217880
|
model-00434-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6296af246f71179dbec2e5a96421a59e115fe36d8c8b5a97d518f3d0bd81bbae
|
| 3 |
+
size 16777368
|