majchrow commited on
Commit
5358703
·
verified ·
1 Parent(s): 0b900a0

Add SGLang to model card

Browse files
Files changed (1) hide show
  1. README.md +10 -1
README.md CHANGED
@@ -11,6 +11,8 @@ tags:
11
  - vllm
12
  - vllm-omni
13
  - diffusers
 
 
14
  - text, image, video, audio, and action generation
15
  - omnimodel
16
  countDownloads:
@@ -212,6 +214,7 @@ Our AI models are designed and/or optimized to run on NVIDIA GPU-accelerated sys
212
  - [PyTorch](https://github.com/nvidia/cosmos3)
213
  - [vLLM-Omni](https://github.com/vllm-project/vllm-omni)
214
  - [Hugging Face Diffusers](https://huggingface.co/docs/diffusers/en/index)
 
215
 
216
  **Supported Hardware Microarchitecture Compatibility:**
217
 
@@ -966,6 +969,12 @@ Example output:
966
 
967
  <video controls width="1280" height="720" src="https://huggingface.co/nvidia/Cosmos3-Super/resolve/main/assets/example_t2v_diffusers_output.mp4"></video>
968
 
 
 
 
 
 
 
969
  ## Limitations
970
 
971
  Cosmos3 may produce imperfect outputs in challenging scenarios. Generation artifacts include temporal inconsistency, unstable camera or object motion, imprecise physical interactions, inaccurate audio-video synchronization, and action-state drift — especially in long-horizon or high-resolution outputs. Reasoning may also be incorrect: object states, causal relationships, spatial geometry, temporal ordering, agent intent, and future outcomes can be misinferred, and complex or long-context inputs may yield hallucinated entities, inconsistent interpretations, or implausible predictions. Because the model lacks an explicit physics simulator, 3D geometry, 4D space-time evolution, object permanence, contact dynamics, and physical laws are only approximated — producing artifacts such as disappearing or morphing objects, unrealistic collisions, and physically implausible motions. Quality further degrades in out-of-distribution environments, safety-critical edge cases, and domains underrepresented in training.
@@ -974,7 +983,7 @@ Cosmos3 outputs should not be treated as physically accurate simulation, reliabl
974
 
975
  ## Inference
976
 
977
- **Acceleration Engine:** [PyTorch](https://pytorch.org/), [vLLM](https://github.com/vllm-project/vllm), [vLLM-Omni](https://github.com/vllm-project/vllm-omni), [Hugging Face Diffusers](https://github.com/huggingface/diffusers)
978
 
979
  **Test Hardware:** GB200 and H100
980
 
 
11
  - vllm
12
  - vllm-omni
13
  - diffusers
14
+ - sglang
15
+ - sglang-diffusion
16
  - text, image, video, audio, and action generation
17
  - omnimodel
18
  countDownloads:
 
214
  - [PyTorch](https://github.com/nvidia/cosmos3)
215
  - [vLLM-Omni](https://github.com/vllm-project/vllm-omni)
216
  - [Hugging Face Diffusers](https://huggingface.co/docs/diffusers/en/index)
217
+ - [SGLang](https://sgl-project.github.io/)
218
 
219
  **Supported Hardware Microarchitecture Compatibility:**
220
 
 
969
 
970
  <video controls width="1280" height="720" src="https://huggingface.co/nvidia/Cosmos3-Super/resolve/main/assets/example_t2v_diffusers_output.mp4"></video>
971
 
972
+ ### SGLang
973
+
974
+ [SGLang Diffusion](https://sgl-project.github.io/diffusion) can serve `nvidia/Cosmos3-Super` through OpenAI-compatible image and video generation endpoints.
975
+
976
+ For complete serving instructions and request examples, see the [Cosmos3 SGLang cookbook](https://lmsysorg.mintlify.app/cookbook/diffusion/Cosmos/Cosmos3).
977
+
978
  ## Limitations
979
 
980
  Cosmos3 may produce imperfect outputs in challenging scenarios. Generation artifacts include temporal inconsistency, unstable camera or object motion, imprecise physical interactions, inaccurate audio-video synchronization, and action-state drift — especially in long-horizon or high-resolution outputs. Reasoning may also be incorrect: object states, causal relationships, spatial geometry, temporal ordering, agent intent, and future outcomes can be misinferred, and complex or long-context inputs may yield hallucinated entities, inconsistent interpretations, or implausible predictions. Because the model lacks an explicit physics simulator, 3D geometry, 4D space-time evolution, object permanence, contact dynamics, and physical laws are only approximated — producing artifacts such as disappearing or morphing objects, unrealistic collisions, and physically implausible motions. Quality further degrades in out-of-distribution environments, safety-critical edge cases, and domains underrepresented in training.
 
983
 
984
  ## Inference
985
 
986
+ **Acceleration Engine:** [PyTorch](https://pytorch.org/), [vLLM](https://github.com/vllm-project/vllm), [vLLM-Omni](https://github.com/vllm-project/vllm-omni), [Hugging Face Diffusers](https://github.com/huggingface/diffusers), [SGLang](https://sgl-project.github.io/), [SGLang Diffusion](https://sgl-project.github.io/diffusion)
987
 
988
  **Test Hardware:** GB200 and H100
989