SeaLLMs
/

SeaLLM-13B-Chat

Model card Files Files and versions

nxphi47 commited on Oct 30, 2023

Commit

00560d7

·

1 Parent(s): e70d390

Update README.md

Files changed (1) hide show

README.md +8 -18

README.md CHANGED Viewed

@@ -109,13 +109,10 @@ We use GPT-4 as an evaluator to rate the comparison between our models versus Ch
 Compared with [PolyLM-13b-chat](https://arxiv.org/pdf/2307.06018.pdf), a recent multilingual model, our model significantly outperforms across all languages and categories.
 <div class="row" style="display: flex; clear: both;">
-  <div class="column" style="float: left; width: 49%">
-    <img src="seallm_vs_polylm_by_lang.png" alt="Snow" style="width:100%">
-  </div>
-  <div class="column" style="float: left; width: 49%">
-    <img src="seallm_vs_polylm_by_cat_sea.png" alt="Forest" style="width:100%">
-  </div>
 </div>
 Compared with Llama-2-13b-chat, our SeaLLM-13b performs significantly better in all SEA languages,
@@ -123,13 +120,10 @@ despite the fact that Llama-2 was already trained on a decent data amount of Vi,
 In english, our model is 46% as good as Llama-2-13b-chat, even though it did not undergo complex human-labor intensive RLHF.
 <div class="row" style="display: flex; clear: both;">
-  <div class="column" style="float: left; width: 49%">
-    <img src="seallm_vs_llama2_by_lang.png" alt="Snow" style="width:100%">
-  </div>
-  <div class="column" style="float: left; width: 49%">
-    <img src="seallm_vs_llama2_by_cat_sea.png" alt="Forest" style="width:100%">
-  </div>
 </div>
 Compared with ChatGPT-3.5, our SeaLLM-13b model is performing 45% as good as ChatGPT for Thai.
@@ -137,12 +131,8 @@ For important aspects such as Safety and Task-Solving, our model nearly on par w
 <div class="row" style="display: flex; clear: both;">
-  <div class="column" style="float: left; width: 49%">
-    <img src="seallm_vs_chatgpt_by_lang.png" alt="Snow" style="width:100%">
-  </div>
-  <div class="column" style="float: left; width: 49%">
-    <img src="seallm_vs_chatgpt_by_cat_sea.png" alt="Forest" style="width:100%">
-  </div>
 </div>
 ### M3Exam - World Knowledge in Regional Languages

 Compared with [PolyLM-13b-chat](https://arxiv.org/pdf/2307.06018.pdf), a recent multilingual model, our model significantly outperforms across all languages and categories.
 <div class="row" style="display: flex; clear: both;">
+    <img src="seallm_vs_polylm_by_lang.png" alt="Snow" style="float: left; width: 48%">
+    <img src="seallm_vs_polylm_by_cat_sea.png" alt="Forest" style="float: left; width: 48%">
 </div>
 Compared with Llama-2-13b-chat, our SeaLLM-13b performs significantly better in all SEA languages,
 In english, our model is 46% as good as Llama-2-13b-chat, even though it did not undergo complex human-labor intensive RLHF.
 <div class="row" style="display: flex; clear: both;">
+  <img src="seallm_vs_llama2_by_lang.png" alt="Snow" style="float: left; width: 48%">
+  <img src="seallm_vs_llama2_by_cat_sea.png" alt="Forest" style="float: left; width: 48%">
 </div>
 Compared with ChatGPT-3.5, our SeaLLM-13b model is performing 45% as good as ChatGPT for Thai.
 <div class="row" style="display: flex; clear: both;">
+  <img src="seallm_vs_chatgpt_by_lang.png" alt="Snow" style="float: left; width: 48%">
+  <img src="seallm_vs_chatgpt_by_cat_sea.png" alt="Forest" style="float: left; width: 48%">
 </div>
 ### M3Exam - World Knowledge in Regional Languages