ELF-B-owt / README.md
nielsr's picture
nielsr HF Staff
Add model card for ELF
3d5b5f1 verified
|
raw
history blame
1.84 kB
---
license: mit
pipeline_tag: text-generation
---
# ELF: Embedded Language Flows
ELF is a class of continuous diffusion language models based on continuous-time Flow Matching, presented in the paper [ELF: Embedded Language Flows](https://huggingface.co/papers/2605.10938).
Unlike existing diffusion language models (DLMs), ELF predominantly stays within the continuous embedding space until the final time step, where it maps to discrete tokens using a shared-weight network. This formulation makes it straightforward to adapt established techniques from image-domain diffusion models, such as classifier-free guidance (CFG).
- **Paper:** [ELF: Embedded Language Flows](https://huggingface.co/papers/2605.10938)
- **Repository:** [https://github.com/lillian039/ELF](https://github.com/lillian039/ELF)
## Inference and Evaluation
This checkpoint can be used with the official JAX implementation. To verify the setup or run evaluation, follow these steps:
### Initialization
```bash
git clone https://github.com/lillian039/ELF
cd ELF
pip install -r requirements.txt
```
### Running Evaluation
To evaluate the model on unconditional generation (OpenWebText):
```bash
cd src/
# For ELF-B (105M)
python eval.py \
--config configs/training_configs/train_owt_ELF-B.yml \
--checkpoint_path embedded-language-flows/ELF-B-owt
```
To evaluate on conditional tasks like summarization (XSum):
```bash
cd src/
python eval.py \
--config configs/training_configs/train_xsum_ELF-B.yml \
--checkpoint_path embedded-language-flows/ELF-B-xsum
```
## Citation
```bibtex
@article{elf2026,
title={ELF: Embedded Language Flows},
author={Hu, Keya and Qiu, Linlu and Lu, Yiyang and Zhao, Hanhong and Li, Tianhong and Kim, Yoon and Andreas, Jacob and He, Kaiming},
journal={arXiv preprint arXiv:2605.10938},
year={2026}
}
```