Text Generation
Transformers
Safetensors
PyTorch
English
language-model
diffusion
latent-diffusion
flow-matching
text-vae
research
Instructions to use ByteDance-Seed/Cola-DLM with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use ByteDance-Seed/Cola-DLM with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="ByteDance-Seed/Cola-DLM")# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("ByteDance-Seed/Cola-DLM", dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use ByteDance-Seed/Cola-DLM with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "ByteDance-Seed/Cola-DLM" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "ByteDance-Seed/Cola-DLM", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/ByteDance-Seed/Cola-DLM
- SGLang
How to use ByteDance-Seed/Cola-DLM with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "ByteDance-Seed/Cola-DLM" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "ByteDance-Seed/Cola-DLM", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "ByteDance-Seed/Cola-DLM" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "ByteDance-Seed/Cola-DLM", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use ByteDance-Seed/Cola-DLM with Docker Model Runner:
docker model run hf.co/ByteDance-Seed/Cola-DLM
statement
Browse files
README.md
CHANGED
|
@@ -200,6 +200,24 @@ This checkpoint is **not instruction-tuned** and has not gone through RLHF. It s
|
|
| 200 |
- Generation quality can be sensitive to prompt format and prompt length. QA-style prompts such as `"Question: ... Answer:"` are recommended for quick evaluation.
|
| 201 |
- The model uses mutable KV caches during generation; service implementations should serialize generation inside one process unless cache handling is explicitly isolated.
|
| 202 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 203 |
## Citation
|
| 204 |
|
| 205 |
If you use Cola DLM in your work, please cite:
|
|
|
|
| 200 |
- Generation quality can be sensitive to prompt format and prompt length. QA-style prompts such as `"Question: ... Answer:"` are recommended for quick evaluation.
|
| 201 |
- The model uses mutable KV caches during generation; service implementations should serialize generation inside one process unless cache handling is explicitly isolated.
|
| 202 |
|
| 203 |
+
## Safety Statement and Use Restrictions
|
| 204 |
+
|
| 205 |
+
Cola DLM is a research-oriented checkpoint for continuous latent diffusion language modeling. The released model is relatively small and has **not been instruction-tuned, RLHF-aligned, or systematically safety-aligned**. Therefore, it does not provide reliable refusal behavior, content moderation, or risk detection. Its outputs may contain inaccurate, offensive, biased, unlawful, inappropriate, or misleading content.
|
| 206 |
+
|
| 207 |
+
This model is intended only for academic research and technical experimentation. We do not encourage, support, or authorize the use of Cola DLM to generate, distribute, or assist with the following types of content:
|
| 208 |
+
|
| 209 |
+
- Pornographic, sexually explicit, exploitative, or otherwise inappropriate content;
|
| 210 |
+
- Gambling-related content, including gambling promotion, betting advice, or illegal gambling services;
|
| 211 |
+
- Content related to illegal drugs or controlled substances, including instructions for manufacturing, purchasing, selling, using, or evading regulation;
|
| 212 |
+
- Hate, harassment, discrimination, threats of violence, extremist, or inflammatory content;
|
| 213 |
+
- Political manipulation, targeted political persuasion, political misinformation, incitement of international or intergroup conflict, or sensitive political content that may escalate social, national, or geopolitical tensions;
|
| 214 |
+
- Illegal activities, regulatory evasion, cyber abuse, privacy violations, or other content that may cause real-world harm;
|
| 215 |
+
- Automated advice or decision-making in high-stakes domains such as medical, legal, financial, safety-critical, or security-sensitive settings.
|
| 216 |
+
|
| 217 |
+
Users who download, deploy, fine-tune, redistribute, or build applications based on this model are responsible for implementing appropriate safety and compliance measures. Such measures may include, but are not limited to, input and output moderation, access control, logging and auditing, human review, red-teaming, and compliance checks under applicable laws and regulations.
|
| 218 |
+
|
| 219 |
+
Cola DLM should not be treated as a production-ready chatbot or a safety-reliable general-purpose assistant. Any content generated by this model does not represent the views, positions, or endorsements of the authors, affiliated institutions, or contributors.
|
| 220 |
+
|
| 221 |
## Citation
|
| 222 |
|
| 223 |
If you use Cola DLM in your work, please cite:
|