amazon
/

MistralLite

@@ -26,9 +26,10 @@ Although the performance of the models on long context was fairly competitive on
 there were some limitations on its performance on longer context. Motivated by improving its performance on longer context, we finetuned the Mistral 7B model, and produced `Mistrallite`. The model managed to `significantly boost the performance of long context handling` over Mistral-7B-Instruct-v0.1. The detailed `long context evalutaion results` are as below:
 1. [Topic Retrieval](https://lmsys.org/blog/2023-06-29-longchat/)
 |Model Name|Input length| Input length | Input length| Input length| Input length|
 |----------|-------------:|-------------:|------------:|-----------:|-----------:|
-|          | 2851| 5568 |8313 | 11044 | 13780
 |   Mistral-7B-Instruct-v0.1  | 100%        | 50%       | 2%      | 0%     | 0% |
 |   MistralLite   | **100%**        | **100%**       | **100%**      | **100%**     | **98%** |

 there were some limitations on its performance on longer context. Motivated by improving its performance on longer context, we finetuned the Mistral 7B model, and produced `Mistrallite`. The model managed to `significantly boost the performance of long context handling` over Mistral-7B-Instruct-v0.1. The detailed `long context evalutaion results` are as below:
 1. [Topic Retrieval](https://lmsys.org/blog/2023-06-29-longchat/)
 |Model Name|Input length| Input length | Input length| Input length| Input length|
 |----------|-------------:|-------------:|------------:|-----------:|-----------:|
+|          | 2851| 5568 |8313 | 11044 | 13780 |
 |   Mistral-7B-Instruct-v0.1  | 100%        | 50%       | 2%      | 0%     | 0% |
 |   MistralLite   | **100%**        | **100%**       | **100%**      | **100%**     | **98%** |