philipp-zettl
/

modernbert-diffusion-refactor

Model card Files Files and versions

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

./refinebert-refactor

Model Summary

A diffusion-style masked language model fine-tuned from philipp-zettl/modernbert-diffusion-universal on the custom dataset.

Model Details

Model ID: ./refinebert-refactor
Base model: philipp-zettl/modernbert-diffusion-universal
Training mode: Fine-tuning
Task type: Masked token denoising / diffusion-style infilling

Intended Use

Intended for tasks related to the custom training data.

Example

from refinebert.diffusion_engine import MaskedDiffusionEngine

engine = MaskedDiffusionEngine("./refinebert-refactor")
prompt = "N/A (See generation logs)"
output = engine.generate(prompt, num_new_tokens=N/A, steps=N/A, guidance_scale=N/A)
print(output)

Training Data

Single-dataset fine-tuning.

Dataset Mix

| Custom Files | 100% | code_refactoring.txt |

Fine-tuned on user-provided local text files.

Training Procedure

Steps: 1731
Batch size: 16
Sequence length: 256
Learning rate: 5e-05
CFG dropout probability: N/A
Samples loaded into RAM: N/A

Training Time & Hardware

Duration: 0h 10m 25s
Hardware: NVIDIA GeForce RTX 4070 Laptop GPU x1 (CUDA available)

Metrics (Training)

Metric	Value
Training Loss	2.0958
Epochs	3
Global Step	1731

Limitations & Considerations

The model is trained with a masked-token diffusion objective and may not behave like an autoregressive LM.
Data sources may have licensing or content constraints—review source dataset cards before deployment.
Performance can vary substantially by mode (Fine-tuning) and prompt structure.

Downloads last month: -

Safetensors

Model size

0.1B params

Tensor type

F32

·

Model tree for philipp-zettl/modernbert-diffusion-refactor

Base model

answerdotai/ModernBERT-base

Finetuned

philipp-zettl/modernbert-diffusion-universal

Finetuned

(3)

this model

Collection including philipp-zettl/modernbert-diffusion-refactor

Diffusion Language Models

Experimental diffusion-style MLM built on top of ModernBERT. Inspired by https://nathan.rs/posts/roberta-diffusion/ • 6 items • Updated Apr 1