Safetensors
llada
custom_code

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Quick Start Guide

To use this models, follow the snippet below:

from transformers import AutoModelForCausalLM

model = AutoModelForCausalLM.from_pretrained(
# model_config_overrides = {}  # Use this to optionally override config parameters
    "kuleshov-group/proseco-llada-sft",
    trust_remote_code=True,
    # **model_config_overrides,
)

Model details

See paper for more detaials.

Citation

  @article{
    schiff2026learn,
    title={Learn from Your Mistakes: Self-Correcting Masked Diffusion Models},
    author={Schiff, Yair and Belhasin, Omer and Uziel, Roy and Wang, Guanghan and Arriola, Marianne and Turok, Gilad and Elad, Michael and Kuleshov, Volodymyr},
    journal={arXiv preprint arXiv:2602.11590},
    year={2026}
  }
Downloads last month
990
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for kuleshov-group/proseco-llada-sft

Finetuned
(2)
this model

Datasets used to train kuleshov-group/proseco-llada-sft

Collection including kuleshov-group/proseco-llada-sft

Paper for kuleshov-group/proseco-llada-sft