NTQAI
/

Nxcode-CQ-7B-orpo

Text Generation

text-generation-inference

Model card Files Files and versions

nhanv commited on Apr 25, 2024

Commit

15165fa

·

verified ·

1 Parent(s): 612e826

Update README.md

Files changed (1) hide show

README.md +13 -4

README.md CHANGED Viewed

@@ -21,12 +21,21 @@ Nxcode-CQ-7B-orpo is an ORPO fine-tune of Qwen/CodeQwen1.5-7B-Chat on 100k sampl
 * Supporting 92 coding languages
 * Excellent performance in text-to-SQL, bug fix, etc.
-## Evalplus(https://github.com/evalplus/evalplus)
-| human-eval | pass@1 |
 | --- | --- |
-| humaneval | 86.0 |
-| humaneval+ | 81.1 |
 ## Quickstart

 * Supporting 92 coding languages
 * Excellent performance in text-to-SQL, bug fix, etc.
+## [Evalplus](https://github.com/evalplus/evalplus)
+| EvalPlus | pass@1 |
 | --- | --- |
+| HumanEval | 86.0 |
+| HumanVval+ | 81.1 |
+[Evalplus Leaderboard](https://evalplus.github.io/leaderboard.html)
+| Models | HumanEval | HumanEval+|
+|------ | ------  | ------ |
+| GPT-4-Turbo (April 2024)|  90.2| 86.6|
+| GPT-4 (May 2023)|  88.4| 81.17|
+| GPT-4-Turbo (Nov 2023)|  85.4| 79.3|
+| CodeQwen1.5-7B-Chat|  83.5| 78.7|
+| claude-3-opus (Mar 2024)|  82.9| 76.8|
 ## Quickstart