Diffusion Language Models
Collection
Experimental diffusion-style MLM built on top of ModernBERT. Inspired by https://nathan.rs/posts/roberta-diffusion/ • 6 items • Updated
How to use philipp-zettl/modernbert-diffusion-refactor with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("fill-mask", model="philipp-zettl/modernbert-diffusion-refactor") # Load model directly
from transformers import AutoTokenizer, AutoModelForMaskedLM
tokenizer = AutoTokenizer.from_pretrained("philipp-zettl/modernbert-diffusion-refactor")
model = AutoModelForMaskedLM.from_pretrained("philipp-zettl/modernbert-diffusion-refactor")A diffusion-style masked language model fine-tuned from philipp-zettl/modernbert-diffusion-universal on the custom dataset.
Intended for tasks related to the custom training data.
Example
from refinebert.diffusion_engine import MaskedDiffusionEngine
engine = MaskedDiffusionEngine("./refinebert-refactor")
prompt = "N/A (See generation logs)"
output = engine.generate(prompt, num_new_tokens=N/A, steps=N/A, guidance_scale=N/A)
print(output)
Single-dataset fine-tuning.
| Custom Files | 100% | code_refactoring.txt |
Fine-tuned on user-provided local text files.
| Metric | Value |
|---|---|
| Training Loss | 2.0958 |
| Epochs | 3 |
| Global Step | 1731 |
Base model
answerdotai/ModernBERT-base