Update README.md
Browse files
README.md
CHANGED
|
@@ -33,7 +33,7 @@ base_model:
|
|
| 33 |
- **Soft Parallel Decoding**: Uses interpolation between mask and token embeddings to propagate confidence priors from previous steps.
|
| 34 |
|
| 35 |
<div align="center">
|
| 36 |
-
<img src="assets/tradeoff.png" width="
|
| 37 |
<br>
|
| 38 |
<em>Superior Parallelism-Accuracy Trade-off, Increased TPF with Maintained Accuracy.</em>
|
| 39 |
</div>
|
|
@@ -43,13 +43,13 @@ base_model:
|
|
| 43 |
|
| 44 |
| Model | Description | Source Model | Link |
|
| 45 |
| --- | --- | --- | --- |
|
| 46 |
-
| π€ DMax-Math-16B | Highly parallel dLLM for math and reasoning. | LLaDA-2.0-mini | [
|
| 47 |
-
| π€ DMax-Coder-16B | Highly parallel dLLM for code generation. | LLaDA-2.0-mini | [
|
| 48 |
|
| 49 |
| Dataset | Description | Link |
|
| 50 |
| --- | --- | --- |
|
| 51 |
-
| π DMax-Math-Training-Data | Trajectories on math problems generated by LLaDA-2.0-mini | [
|
| 52 |
-
| π DMax-Code-Training-Data | Trajectories on code problems generated by LLaDA-2.0-mini | [
|
| 53 |
|
| 54 |
|
| 55 |
## π Quick Start
|
|
|
|
| 33 |
- **Soft Parallel Decoding**: Uses interpolation between mask and token embeddings to propagate confidence priors from previous steps.
|
| 34 |
|
| 35 |
<div align="center">
|
| 36 |
+
<img src="assets/tradeoff.png" width="100%" />
|
| 37 |
<br>
|
| 38 |
<em>Superior Parallelism-Accuracy Trade-off, Increased TPF with Maintained Accuracy.</em>
|
| 39 |
</div>
|
|
|
|
| 43 |
|
| 44 |
| Model | Description | Source Model | Link |
|
| 45 |
| --- | --- | --- | --- |
|
| 46 |
+
| π€ DMax-Math-16B | Highly parallel dLLM for math and reasoning. | LLaDA-2.0-mini | [HF](https://huggingface.co/Zigeng/DMax-Math-16B) |
|
| 47 |
+
| π€ DMax-Coder-16B | Highly parallel dLLM for code generation. | LLaDA-2.0-mini | [HF](https://huggingface.co/Zigeng/DMax-Coder-16B) |
|
| 48 |
|
| 49 |
| Dataset | Description | Link |
|
| 50 |
| --- | --- | --- |
|
| 51 |
+
| π DMax-Math-Training-Data | Trajectories on math problems generated by LLaDA-2.0-mini | [HF](https://huggingface.co/datasets/Zigeng/DMax-LLaDA-2.0-Mini-Math-Trajectories) |
|
| 52 |
+
| π DMax-Code-Training-Data | Trajectories on code problems generated by LLaDA-2.0-mini | [HF](https://huggingface.co/datasets/Zigeng/DMax-LLaDA-2.0-Mini-Code-Trajectories) |
|
| 53 |
|
| 54 |
|
| 55 |
## π Quick Start
|