Instructions to use fredzzp/open-dcoder-0.5B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use fredzzp/open-dcoder-0.5B with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="fredzzp/open-dcoder-0.5B") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("fredzzp/open-dcoder-0.5B") model = AutoModelForCausalLM.from_pretrained("fredzzp/open-dcoder-0.5B") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Inference
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use fredzzp/open-dcoder-0.5B with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "fredzzp/open-dcoder-0.5B" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "fredzzp/open-dcoder-0.5B", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/fredzzp/open-dcoder-0.5B
- SGLang
How to use fredzzp/open-dcoder-0.5B with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "fredzzp/open-dcoder-0.5B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "fredzzp/open-dcoder-0.5B", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "fredzzp/open-dcoder-0.5B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "fredzzp/open-dcoder-0.5B", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use fredzzp/open-dcoder-0.5B with Docker Model Runner:
docker model run hf.co/fredzzp/open-dcoder-0.5B
Add paper link and improve model card metadata
Browse filesHi, I'm Niels from the Hugging Face community science team. I'm opening this PR to improve your model card with relevant metadata and links to the associated research.
Specifically, I have:
- Added `pipeline_tag: text-generation` to the YAML metadata to improve discoverability on the Hub.
- Added a link to the paper: "Don't Retrain, Align: Adapting Autoregressive LMs to Diffusion LMs via Representation Alignment".
- Added a link to the official GitHub repository for easier access to training and evaluation scripts.
- Included the citation for the project.
Please feel free to review and merge if this looks good to you!
|
@@ -1,8 +1,9 @@
|
|
| 1 |
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
language:
|
| 4 |
- code
|
| 5 |
library_name: transformers
|
|
|
|
|
|
|
| 6 |
tags:
|
| 7 |
- masked-diffusion
|
| 8 |
- code-generation
|
|
@@ -11,7 +12,12 @@ tags:
|
|
| 11 |
|
| 12 |
## Open Diffusion Large Language Models for Code Generation
|
| 13 |
|
| 14 |
-
This repository contains the weights and custom code for the **fredzzp/open-dcoder-0.5B** model, a masked diffusion model for code generation based on the Qwen2 architecture.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 15 |
|
| 16 |
This model uses bidirectional attention and must be used with the custom `diffusion_generate` method.
|
| 17 |
|
|
@@ -23,8 +29,8 @@ First, make sure you have the latest `transformers` library installed.
|
|
| 23 |
pip install transformers torch huggingface_hub
|
| 24 |
```
|
| 25 |
|
|
|
|
| 26 |
|
| 27 |
-
You can then use the model for generation. Note: You must pass trust_remote_code=True to load the custom model architecture.
|
| 28 |
```python
|
| 29 |
import torch
|
| 30 |
from transformers import AutoTokenizer, AutoModelForCausalLM
|
|
@@ -59,3 +65,16 @@ generated_text = tokenizer.decode(outputs.sequences[0][prompt_len:], skip_specia
|
|
| 59 |
print("--- Generated Code ---")
|
| 60 |
print(generated_text)
|
| 61 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
|
|
|
| 2 |
language:
|
| 3 |
- code
|
| 4 |
library_name: transformers
|
| 5 |
+
license: apache-2.0
|
| 6 |
+
pipeline_tag: text-generation
|
| 7 |
tags:
|
| 8 |
- masked-diffusion
|
| 9 |
- code-generation
|
|
|
|
| 12 |
|
| 13 |
## Open Diffusion Large Language Models for Code Generation
|
| 14 |
|
| 15 |
+
This repository contains the weights and custom code for the **fredzzp/open-dcoder-0.5B** model, a masked diffusion model for code generation based on the Qwen2 architecture.
|
| 16 |
+
|
| 17 |
+
The model was introduced in the paper [Don't Retrain, Align: Adapting Autoregressive LMs to Diffusion LMs via Representation Alignment](https://huggingface.co/papers/2605.06885).
|
| 18 |
+
|
| 19 |
+
- **Code:** [pengzhangzhi/Open-dLLM](https://github.com/pengzhangzhi/Open-dLLM)
|
| 20 |
+
- **Blog:** [Notion Blog](https://oval-shell-31c.notion.site/Open-Diffusion-Large-Language-Model-25e03bf6136480b7a4ebe3d53be9f68a?pvs=74)
|
| 21 |
|
| 22 |
This model uses bidirectional attention and must be used with the custom `diffusion_generate` method.
|
| 23 |
|
|
|
|
| 29 |
pip install transformers torch huggingface_hub
|
| 30 |
```
|
| 31 |
|
| 32 |
+
You can then use the model for generation. Note: You must pass `trust_remote_code=True` to load the custom model architecture.
|
| 33 |
|
|
|
|
| 34 |
```python
|
| 35 |
import torch
|
| 36 |
from transformers import AutoTokenizer, AutoModelForCausalLM
|
|
|
|
| 65 |
print("--- Generated Code ---")
|
| 66 |
print(generated_text)
|
| 67 |
```
|
| 68 |
+
|
| 69 |
+
## Citation
|
| 70 |
+
|
| 71 |
+
```bibtex
|
| 72 |
+
@misc{opendllm2025,
|
| 73 |
+
title = {Open-dLLM: Open Diffusion Large Language Models},
|
| 74 |
+
author = {Fred Zhangzhi Peng, Shuibai Zhang, Alex Tong, and contributors},
|
| 75 |
+
year = {2025},
|
| 76 |
+
howpublished = {\url{https://github.com/pengzhangzhi/Open-dLLM}},
|
| 77 |
+
note = {Blog: \url{https://oval-shell-31c.notion.site/Open-Diffusion-Large-Language-Model-25e03bf6136480b7a4ebe3d53be9f68a?pvs=74},
|
| 78 |
+
Model: \url{https://huggingface.co/fredzzp/open-dcoder-0.5B}}
|
| 79 |
+
}
|
| 80 |
+
```
|