File size: 968 Bytes
3a6c6b3 5a92dca 3a6c6b3 5a92dca 3a6c6b3 f3e6f4a 3a6c6b3 5a92dca 3a6c6b3 5a92dca 3a6c6b3 5a92dca 3a6c6b3 5a92dca | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 | ---
license: apache-2.0
datasets:
- timdettmers/openassistant-guanaco
language:
- en
base_model:
- meta-llama/Llama-2-7b-chat-hf
---
## 🐋 Humpback-reproduce
This is a backward model _Myx_ for [Self-Alignment with Instruction Backtranslation](https://arxiv.org/pdf/2308.06259.pdf) reproduction.
This model (llama2 7B) is trained on the seed data ([openassistant-guanaco](https://huggingface.co/datasets/timdettmers/openassistant-guanaco) __ENGLISH DATA ONLY__) in a reversed order ((output, instruction) pairs {(yi, xi)}).
In other words, the model is trained by using the output to predict the instruction.
## 📜 Reference
```bibtex
@misc{li2023selfalignment,
title={Self-Alignment with Instruction Backtranslation},
author={Xian Li and Ping Yu and Chunting Zhou and Timo Schick and Luke Zettlemoyer and Omer Levy and Jason Weston and Mike Lewis},
year={2023},
eprint={2308.06259},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
``` |