Safetensors
English
llama
File size: 968 Bytes
3a6c6b3
5a92dca
 
 
 
 
 
 
3a6c6b3
 
5a92dca
3a6c6b3
f3e6f4a
3a6c6b3
5a92dca
3a6c6b3
5a92dca
3a6c6b3
5a92dca
3a6c6b3
5a92dca
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
---
license: apache-2.0
datasets:
- timdettmers/openassistant-guanaco
language:
- en
base_model:
- meta-llama/Llama-2-7b-chat-hf
---

## 🐋 Humpback-reproduce

This is a backward model _Myx_ for [Self-Alignment with Instruction Backtranslation](https://arxiv.org/pdf/2308.06259.pdf) reproduction.

This model (llama2 7B) is trained on the seed data ([openassistant-guanaco](https://huggingface.co/datasets/timdettmers/openassistant-guanaco) __ENGLISH DATA ONLY__) in a reversed order ((output, instruction) pairs {(yi, xi)}). 

In other words, the model is trained by using the output to predict the instruction. 

## 📜 Reference

```bibtex
@misc{li2023selfalignment,
    title={Self-Alignment with Instruction Backtranslation},
    author={Xian Li and Ping Yu and Chunting Zhou and Timo Schick and Luke Zettlemoyer and Omer Levy and Jason Weston and Mike Lewis},
    year={2023},
    eprint={2308.06259},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}
```