恭喜突破GPT-4, 这是开源的胜利

by 11011Free - opened Sep 21, 2023

Discussion

11011Free

Sep 21, 2023

恭喜突破GPT-4, 这是开源的胜利, 一个新的起点

robotzheng

Sep 22, 2023

如何微调的？进化SFT+强化

mt-99

Sep 22, 2023

•

edited Sep 22, 2023

就是现在没有比较权威的评价标准，出来的都说自己很牛

Yhyu13

Sep 22, 2023

如何微调的？进化SFT+强化

RLHF, 根据title,

Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment
Step up your LLM alignment with Xwin-LM!
Xwin-LM aims to develop and open-source alignment technologies for large language models, including supervised fine-tuning (SFT), reward models (RM), reject sampling, reinforcement learning from human feedback (RLHF), etc. Our first release, built-upon on the Llama2 base models, ranked TOP-1 on AlpacaEval. Notably, it's the first to surpass GPT-4 on this benchmark. The project will be continuously updated.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment