Salesforce
/

CoDA-v0-Base

Text Generation

feature-extraction

text diffusion model

code generation

Model card Files Files and versions

hlnchen commited on Oct 3, 2025

Commit

3986381

·

verified ·

1 Parent(s): 7891534

Update README.md

Files changed (1) hide show

README.md +30 -5

README.md CHANGED Viewed

@@ -10,23 +10,26 @@ tags:
 ---
 # CoDA-v0-Base
-## Overview
-CoDA is Salesforce AI Research's open, lightweight and diffusion-based language model.
 [Technical Report](https://github.com/SalesforceAIResearch/CoDA/blob/main/technical_report.pdf)
 [Code](https://github.com/SalesforceAIResearch/CoDA/)
-## Requirements
 ```
 torch==2.8.0
 transformers>=4.47.1
 flash-attn==2.8.3
 ```
-## Quickstart
 Here is a code snippet for loading the model, tokenizer and run unmasking for a partially finished code.
 ```python
 import torch
@@ -81,4 +84,26 @@ unmasked_output = tokenizer.batch_decode(generated_ids, skip_special_tokens=True
 ```
-## Citation

 ---
 # CoDA-v0-Base
+## Overview 🎯
+CoDA is Salesforce AI Research's open diffusion language model.
 [Technical Report](https://github.com/SalesforceAIResearch/CoDA/blob/main/technical_report.pdf)
 [Code](https://github.com/SalesforceAIResearch/CoDA/)
+The code repo contains a unified training pipeline from pre-training to post-training, evaluation harnesses, and a simple Fast-API based serving backend.
+## Requirements 📦
 ```
 torch==2.8.0
 transformers>=4.47.1
 flash-attn==2.8.3
 ```
+## Quickstart 🚀
 Here is a code snippet for loading the model, tokenizer and run unmasking for a partially finished code.
 ```python
 import torch
 ```
+## Benchmark 📊
+Comparison of code-generation performance across standard and plus-enhanced benchmarks. Evalplus is computed as the mean pass@1 on enhanced variants. Bold marks results where CoDA produces the strongest diffusion-model performance.
+| Model | Humaneval Instruct | Humaneval Plus | MBPP Instruct | MBPP Plus | Evalplus |
+| --- | --- | --- | --- | --- | --- |
+| CoDA-Base | 29.3 | 23.8 | 35.2 | 46.0 | 34.9 |
+| CoDA-Instruct | 54.3 | 47.6 | 47.2 | **63.2** | **55.4** |
+| Dream-Base | 56.7 | 50.0 | 68.7 | 57.4 | 53.7 |
+| Dream-7B-Instruct | 57.9 | 53.7 | 68.3 | 56.1 | 54.9 |
+| LLaDA-8B-Instruct | 35.4 | 31.7 | 31.5 | 28.6 | 30.2 |
+| Qwen3-1.7B | 66.5 | 61.6 | 46.2 | 65.9 | 63.8 |
+| Qwen2.5-Coder-1.5B | 43.9 | 36.6 | 69.2 | 58.6 | 47.6 |
+| Qwen2.5-Coder-1.5B-Instruct | 70.7 | 66.5 | 69.2 | 59.4 | 62.3 |
+| Gemma-3-1B-it | 39.6 | 35.4 | 39.4 | 63.5 | 49.5 |
+| LLaMA-3.2-1B-Instruct | 35.4 | 31.1 | 24.4 | 53.7 | 42.4 |
+## Deployment 🛠️
+Checkout our [Deployment Guide](https://github.com/SalesforceAIResearch/CoDA?tab=readme-ov-file#deployment-guide-%EF%B8%8F)!
+## Citation 📚
+```
+coming soon
+```