Add model card and metadata for dUltra

by nielsr HF Staff - opened Feb 9

←

Feb 9

Hi! I'm Niels from the Hugging Face community science team.

This PR adds a model card for dUltra, an on-policy reinforcement learning framework for masked diffusion language models.

The PR includes:

Please feel free to merge if this looks good to you!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment