argilla
/

notus-7b-v1

Text Generation

text-generation-inference

Model card Files Files and versions

Metrics Training metrics Community

dvilasuero HF Staff commited on Nov 30, 2023

Commit

f891dd2

·

1 Parent(s): 3455ea2

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -52,7 +52,9 @@ with the original Zephyr dDPO model and other 7B models.
 ### Chat benchmarks
 Table adapted from Zephyr-7b-β and Starling's original tables for [MT-Bench](https://huggingface.co/spaces/lmsys/mt-bench) and [AlpacaEval](https://tatsu-lab.github.io/alpaca_eval/) benchmarks. Results are shown sorted by AlpacaEval win rates and ommit some >7B for brevity.
 Notus stays on par with Zephyr on MT-Bench, while surpassing Zephyr, Claude 2, and Cohere Command on AlpacaEval. Making Notus the most-competitive 7B commercial model on AlpacaEval.
 <table>
     <tr>
         <th>Model</th>

 ### Chat benchmarks
 Table adapted from Zephyr-7b-β and Starling's original tables for [MT-Bench](https://huggingface.co/spaces/lmsys/mt-bench) and [AlpacaEval](https://tatsu-lab.github.io/alpaca_eval/) benchmarks. Results are shown sorted by AlpacaEval win rates and ommit some >7B for brevity.
 Notus stays on par with Zephyr on MT-Bench, while surpassing Zephyr, Claude 2, and Cohere Command on AlpacaEval. Making Notus the most-competitive 7B commercial model on AlpacaEval.
 <table>
     <tr>
         <th>Model</th>