majchrow commited on
Commit
de8a6b6
·
verified ·
1 Parent(s): 03c14e7

Add SGLang to model card

Browse files
Files changed (1) hide show
  1. README.md +10 -1
README.md CHANGED
@@ -11,6 +11,8 @@ tags:
11
  - vllm
12
  - vllm-omni
13
  - diffusers
 
 
14
  - text, image, video, audio, and action generation
15
  - omnimodel
16
  countDownloads:
@@ -192,6 +194,7 @@ Our AI models are designed and/or optimized to run on NVIDIA GPU-accelerated sys
192
  - [PyTorch](https://github.com/nvidia/cosmos3)
193
  - [vLLM-Omni](https://github.com/vllm-project/vllm-omni)
194
  - [Hugging Face Diffusers](https://huggingface.co/docs/diffusers/en/index)
 
195
 
196
  **Supported Hardware Microarchitecture Compatibility:**
197
 
@@ -941,6 +944,12 @@ Example output:
941
 
942
  <video controls width="1280" height="720" src="https://huggingface.co/nvidia/Cosmos3-Nano/resolve/main/assets/example_t2v_diffusers_output.mp4"></video>
943
 
 
 
 
 
 
 
944
  ## Limitations
945
 
946
  Cosmos3 may produce imperfect outputs in challenging scenarios. Generation artifacts include temporal inconsistency, unstable camera or object motion, imprecise physical interactions, inaccurate audio-video synchronization, and action-state drift — especially in long-horizon or high-resolution outputs. Reasoning may also be incorrect: object states, causal relationships, spatial geometry, temporal ordering, agent intent, and future outcomes can be misinferred, and complex or long-context inputs may yield hallucinated entities, inconsistent interpretations, or implausible predictions. Because the model lacks an explicit physics simulator, 3D geometry, 4D space-time evolution, object permanence, contact dynamics, and physical laws are only approximated — producing artifacts such as disappearing or morphing objects, unrealistic collisions, and physically implausible motions. Quality further degrades in out-of-distribution environments, safety-critical edge cases, and domains underrepresented in training.
@@ -949,7 +958,7 @@ Cosmos3 outputs should not be treated as physically accurate simulation, reliabl
949
 
950
  ## Inference
951
 
952
- **Acceleration Engine:** [PyTorch](https://pytorch.org/), [vLLM](https://github.com/vllm-project/vllm), [vLLM-Omni](https://github.com/vllm-project/vllm-omni), [Hugging Face Diffusers](https://github.com/huggingface/diffusers)
953
 
954
  **Test Hardware:** GB200 and H100
955
 
 
11
  - vllm
12
  - vllm-omni
13
  - diffusers
14
+ - sglang
15
+ - sglang-diffusion
16
  - text, image, video, audio, and action generation
17
  - omnimodel
18
  countDownloads:
 
194
  - [PyTorch](https://github.com/nvidia/cosmos3)
195
  - [vLLM-Omni](https://github.com/vllm-project/vllm-omni)
196
  - [Hugging Face Diffusers](https://huggingface.co/docs/diffusers/en/index)
197
+ - [SGLang](https://sgl-project.github.io/)
198
 
199
  **Supported Hardware Microarchitecture Compatibility:**
200
 
 
944
 
945
  <video controls width="1280" height="720" src="https://huggingface.co/nvidia/Cosmos3-Nano/resolve/main/assets/example_t2v_diffusers_output.mp4"></video>
946
 
947
+ ### SGLang
948
+
949
+ [SGLang Diffusion](https://sgl-project.github.io/diffusion) can serve `nvidia/Cosmos3-Nano` through OpenAI-compatible image and video generation endpoints.
950
+
951
+ For complete serving instructions and request examples, see the [Cosmos3 SGLang cookbook](https://lmsysorg.mintlify.app/cookbook/diffusion/Cosmos/Cosmos3).
952
+
953
  ## Limitations
954
 
955
  Cosmos3 may produce imperfect outputs in challenging scenarios. Generation artifacts include temporal inconsistency, unstable camera or object motion, imprecise physical interactions, inaccurate audio-video synchronization, and action-state drift — especially in long-horizon or high-resolution outputs. Reasoning may also be incorrect: object states, causal relationships, spatial geometry, temporal ordering, agent intent, and future outcomes can be misinferred, and complex or long-context inputs may yield hallucinated entities, inconsistent interpretations, or implausible predictions. Because the model lacks an explicit physics simulator, 3D geometry, 4D space-time evolution, object permanence, contact dynamics, and physical laws are only approximated — producing artifacts such as disappearing or morphing objects, unrealistic collisions, and physically implausible motions. Quality further degrades in out-of-distribution environments, safety-critical edge cases, and domains underrepresented in training.
 
958
 
959
  ## Inference
960
 
961
+ **Acceleration Engine:** [PyTorch](https://pytorch.org/), [vLLM](https://github.com/vllm-project/vllm), [vLLM-Omni](https://github.com/vllm-project/vllm-omni), [Hugging Face Diffusers](https://github.com/huggingface/diffusers), [SGLang](https://sgl-project.github.io/), [SGLang Diffusion](https://sgl-project.github.io/diffusion)
962
 
963
  **Test Hardware:** GB200 and H100
964