| license: apache-2.0 | |
| train tinyllama1b-instruct for 20k DPO. | |
| train tinyllama1b-instruct for 20k DPO. | |
| train tinyllama1b-instruct for 20k DPO. | |
| train tinyllama1b-instruct for 20k DPO. | |
| train tinyllama1b-instruct for 20k DPO. | |
| train tinyllama1b-instruct for 20k DPO. | |
| train tinyllama1b-instruct for 20k DPO. | |
| train tinyllama1b-instruct for 20k DPO. |