Instructions to use typeof/miqu-70b-split with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use typeof/miqu-70b-split with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="typeof/miqu-70b-split") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("typeof/miqu-70b-split") model = AutoModelForCausalLM.from_pretrained("typeof/miqu-70b-split") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use typeof/miqu-70b-split with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "typeof/miqu-70b-split" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "typeof/miqu-70b-split", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/typeof/miqu-70b-split
- SGLang
How to use typeof/miqu-70b-split with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "typeof/miqu-70b-split" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "typeof/miqu-70b-split", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "typeof/miqu-70b-split" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "typeof/miqu-70b-split", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use typeof/miqu-70b-split with Docker Model Runner:
docker model run hf.co/typeof/miqu-70b-split
init
Browse files- model-00327-of-00723.safetensors +3 -0
- model-00328-of-00723.safetensors +3 -0
- model-00329-of-00723.safetensors +3 -0
- model-00330-of-00723.safetensors +3 -0
- model-00331-of-00723.safetensors +3 -0
- model-00333-of-00723.safetensors +3 -0
- model-00336-of-00723.safetensors +3 -0
- model-00337-of-00723.safetensors +3 -0
- model-00338-of-00723.safetensors +3 -0
- model-00339-of-00723.safetensors +3 -0
- model-00340-of-00723.safetensors +3 -0
- model-00341-of-00723.safetensors +3 -0
- model-00342-of-00723.safetensors +3 -0
- model-00343-of-00723.safetensors +3 -0
- model-00344-of-00723.safetensors +3 -0
- model-00345-of-00723.safetensors +3 -0
- model-00346-of-00723.safetensors +3 -0
- model-00347-of-00723.safetensors +3 -0
- model-00348-of-00723.safetensors +3 -0
- model-00349-of-00723.safetensors +3 -0
- model-00350-of-00723.safetensors +3 -0
- model-00351-of-00723.safetensors +3 -0
- model-00352-of-00723.safetensors +3 -0
- model-00353-of-00723.safetensors +3 -0
model-00327-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3d99a2dc0ad80c235debb43c85d8c18f005f79b3a2e3be56380fffe6327623d5
|
| 3 |
+
size 16528
|
model-00328-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:dca2cdbad0c60fc48671843042ac50d4e2ff250de06f2e6a694e7e1055a17f13
|
| 3 |
+
size 469762200
|
model-00329-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9c8afe14384550c2cd461f7d0f0ec54f964a38687102ee98770dae12d6c7b249
|
| 3 |
+
size 469762200
|
model-00330-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:81f36482342336c93131dbd2b8129682c47e131cf38b62a68370fd162a7ebca0
|
| 3 |
+
size 469762192
|
model-00331-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9ff6d5071215f8e9d5127231a8ce026652817b12f8c080eb825c7c564ec13fac
|
| 3 |
+
size 16536
|
model-00333-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9c5cdc8b7dde3d95ae63bc0beb301beff4f4e74dbb6f1d8ee205cfec035ebaf7
|
| 3 |
+
size 134217880
|
model-00336-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3f98829a1feca0ca8e17807f7042ce7dc40efc4eaae7b4f05938499292520f4a
|
| 3 |
+
size 16528
|
model-00337-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a4b5f75447102a303d7ec0bb82942cbc7d2491743199310313fd78a64a2ebb2c
|
| 3 |
+
size 469762200
|
model-00338-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8837802db88b98fcfc020b53f3932671e87520d9424323a0b26538bfb5788653
|
| 3 |
+
size 469762200
|
model-00339-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a80b2ce74424f67f6d399d84c6229189ec1bf783542d6b3645e7f97a93374154
|
| 3 |
+
size 469762192
|
model-00340-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a5ee6c60fbddb8c19278e526dc50b7a1ed7ca71429d1fff4d775123cbee01b83
|
| 3 |
+
size 16536
|
model-00341-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6dad621c07a27c897c672356070ec02d74c761c116eb61bf37b58622c26c749c
|
| 3 |
+
size 16777368
|
model-00342-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:78fd0fd4e041303d0ea85b5c94d6e099f0df9c75e8d72b9c6fa8ca21d57ff8a0
|
| 3 |
+
size 134217880
|
model-00343-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d392c839599a8751b5655a337b9646efdc0f5735befbbffc6215dcffa7a317c6
|
| 3 |
+
size 134217880
|
model-00344-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d483035ec2343a0cc4de50895656c3e98f8bec6ac9d666ed61b06e7285a05030
|
| 3 |
+
size 16777368
|
model-00345-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:00f500a6ff0aca50f8b2a32d2235d659082577d54c1fb4ba30f4cf55335ef485
|
| 3 |
+
size 16528
|
model-00346-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:38c12d309d83f342486af484b9bc3051a0d3e6eb079ad4afcff5f2fa0f72a137
|
| 3 |
+
size 469762200
|
model-00347-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9f8c25d98fa8e3fb698eee4c412e3cd63ac70fd4ceae59752e44a12ee6e2faa1
|
| 3 |
+
size 469762200
|
model-00348-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8252dc59ceec317920d6bd640fa6eda5c8cbc5b45b9181b1bb12392b42086ba1
|
| 3 |
+
size 469762192
|
model-00349-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:78899bb6b51d150fcb230eba13638c7e56159c9a2ee0f76871ba3e54fcce2b7a
|
| 3 |
+
size 16536
|
model-00350-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:14070cdcc7f3b77c2fa3d026f56c35150727e7a19e5523bba6b5372d86819553
|
| 3 |
+
size 16777368
|
model-00351-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b947d0ec1935d864432cd9c4514b7e11007d658a4587638ae11cebb8f12e8368
|
| 3 |
+
size 134217880
|
model-00352-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c8b87cae9f87ec3be165067fdddd4430b1b77192d888e3f5846f12b1cf58f5d0
|
| 3 |
+
size 134217880
|
model-00353-of-00723.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4bb4bcfe54ef8fa22ba564785fec7387a61e84a32f2c16f4de4e0836b524d311
|
| 3 |
+
size 16777368
|