AceGPT

AceGPT is a fully fine-tuned generative text model collection, particularly focused on the Arabic language domain. This is the repository for the version 2 of the 8B pre-trained model, developed based on Meta-Llama-3-8B..


Model Details

We have released the AceGPT family of large language models, which is a collection of fully fine-tuned generative text models, ranging from 7B to 70B parameters. Our models include two main categories: AceGPT and AceGPT-chat. AceGPT-chat is an optimized version specifically designed for dialogue applications. It is worth mentioning that our models have demonstrated superior performance compared to all currently available open-source Arabic dialogue models in multiple benchmark tests. Furthermore, in our human evaluations, our models have shown comparable satisfaction levels to some closed-source models, such as ChatGPT, in the Arabic language.

Model Developers

We are from the King Abdullah University of Science and Technology (KAUST), the Chinese University of Hong Kong, Shenzhen (CUHKSZ) and the Shenzhen Research Institute of Big Data (SRIBD).

Variations

AceGPT families come in a range of parameter sizes โ€”โ€” 7B, 8B, 13B, 32B and 70B, each size of model has a base category and a -chat category.

Paper

The paper can be accessed at link.

Input

Models input text only.

Output

Models output text only.

Model Evaluation Results

Arabic Benchmark evaluations on Arabic MMLU are conducted using accuracy scores as metrics, following the evaluation framework available at https://github.com/FreedomIntelligence/AceGPT/tree/main.

Arabic-trans MMLU ArabicMMLU (koto et al.) Arabic EXAMS Arabic ACVA clean Arabic ACVA all Arabic AraTrust Arabic ARC-C Arabic Avg.
Qwen1.5-7B 42.14 46.41 38.34 75.17 75.88 54.21 45.56 53.96
Jais-30B-v3 43.42 44.47 45.78 83.39 79.51 62.64 45.56 57.82
Llama3-8B 47.22 45.78 46.34 77.49 76.68 67.82 47.53 58.41
AceGPT-v2-8B 48.41 50.17 46.15 80.14 78.84 65.90 49.91 59.93
ChatGPT 3.5 Turbo 49.07 57.70 45.93 74.45 76.88 65.13 60.24 61.34
Qwen1.5-32B 55.90 55.94 52.84 78.91 80.07 69.34 67.66 65.81
Qwen1.5-72B 60.24 61.23 54.41 82.98 81.20 75.93 76.79 70.40
AceGPT-v2-32B 58.71 65.67 52.74 82.66 81.04 80.46 71.69 70.42
Llama3-70B 65.16 65.67 54.78 83.48 82.92 74.84 77.30 72.02
AceGPT-v2-70B 65.19 67.71 56.19 84.79 80.93 80.93 80.93 73.81
GPT-4 65.06 72.50 57.76 84.06 79.43 90.04 85.67 76.36

Benchmarks for English and Chinese are conducted using the OpenCompass framework.

MMLU RACE English Avg. CMMLU CEval Chinese Avg. Avg.
Jais-30B-v3 42.53 30.96 36.75 25.26 22.17 23.72 30.23
AceGPT-v2-8B 65.48 60.49 62.99 53.44 50.37 51.91 57.45
Llama3-8B 66.57 65.92 66.25 50.70 49.78 50.24 58.24
ChatGPT 3.5 Turbo 69.03 83.00 76.02 53.90 52.50 53.20 64.60
Qwen1.5-7B 62.15 82.19 72.17 71.79 73.61 72.70 72.44
AceGPT-v2-70B 76.71 80.48 78.60 68.97 66.87 67.92 73.26
GPT-4 83.00 91.00 87.00 71.00 69.90 70.45 78.73
Llama3-70B 79.34 84.76 82.05 68.29 67.21 67.75 74.90
Qwen1.5-32B 75.10 83.29 79.20 83.12 82.68 82.90 81.05
AceGPT-v2-32B 74.52 88.68 81.60 81.36 82.41 81.89 81.74
Qwen1.5-72B 75.78 88.23 82.01 83.11 83.04 83.08 82.54

Samples

Sample1(abstract_algebra)

  • input: "ููŠู…ุง ูŠู„ูŠ ุฃุณุฆู„ุฉ ุงู„ุงุฎุชูŠุงุฑ ู…ู† ู…ุชุนุฏุฏ (ู…ุน ุงู„ุฅุฌุงุจุงุช) ุญูˆู„ ุฌุจุฑ ุชุฌุฑูŠุฏูŠ\n\nุณุคุงู„: ุงู„ุนุซูˆุฑ ุนู„ู‰ ุฌู…ูŠุน ู‚ูŠู… c ููŠ Z_3 ุจุญูŠุซ ูŠูƒูˆู† Z_3 [x]/(x^2+c) ุญู‚ู„ู‹ุง.\nA. 0\nB. 1\nC. 2\nD. 3\nุฅุฌุงุจุฉ: B\n\nุณุคุงู„: ุงู„ุจูŠุงู† ุฑู‚ู… 1 | ุฅุฐุง ูƒุงู† aH ุนู†ุตุฑู‹ุง ููŠ ู…ุฌู…ูˆุนุฉ ุงู„ุนูˆุงู…ู„ ุŒ ูุฅู† | aH | ูŠู‚ุณู… | a |. ุงู„ุจูŠุงู† ุฑู‚ู… 2 | ุฅุฐุง ูƒุงู†ุช H ูˆ K ู…ุฌู…ูˆุนุงุช ูุฑุนูŠุฉ ู„ู€ G ุŒ ูุฅู† HK ู…ุฌู…ูˆุนุฉ ูุฑุนูŠุฉ ู„ู€ G.\nA. ุตุญูŠุญ ุŒ ุตุญูŠุญ\nB. ุฎุทุฃ ุŒ ุฎุทุฃ\nC. ุตุญูŠุญ ุŒ ุฎุทุฃ\nD. ุฎุทุฃ ุŒ ุตุญูŠุญ\nุฅุฌุงุจุฉ: B\n\nุณุคุงู„: ุงู„ุนุจุงุฑุฉ 1 | ูƒู„ ุนู†ุตุฑ ู…ู† ู…ุฌู…ูˆุนุฉ ูŠูˆู„ุฏ ู…ุฌู…ูˆุนุฉ ุฏูˆุฑูŠุฉ ู…ู† ุงู„ู…ุฌู…ูˆุนุฉ. ุงู„ุนุจุงุฑุฉ 2 | ุงู„ู…ุฌู…ูˆุนุฉ ุงู„ู…ุชู†ุงุธุฑุฉ S_10 ู„ุฏูŠู‡ุง 10 ุนู†ุงุตุฑ.\nA. ุตุญูŠุญุŒ ุตุญูŠุญ\nB. ุฎุทุฃุŒ ุฎุทุฃ\nC. ุตุญูŠุญุŒ ุฎุทุฃ\nD. ุฎุทุฃุŒ ุตุญูŠุญ\nุฅุฌุงุจุฉ: C\n\nุณุคุงู„: ุงู„ุจูŠุงู† 1| ูƒู„ ูˆุธูŠูุฉ ู…ู† ู…ุฌู…ูˆุนุฉ ู…ุญุฏูˆุฏุฉ ุนู„ู‰ ู†ูุณู‡ุง ูŠุฌุจ ุฃู† ุชูƒูˆู† ูˆุงุญุฏุฉ ู„ูƒู„ ู…ุฌู…ูˆุนุฉ. ุงู„ุจูŠุงู† 2 | ูƒู„ ูุฑุน ูุฑุนูŠ ู„ู…ุฌู…ูˆุนุฉ ุฃุจูŠู„ูŠุฉ ู‡ูˆ ุฃุจูŠู„ูŠ.\nA. ุตุญูŠุญ, ุตุญูŠุญ\nB. ุฎุงุทุฆ, ุฎุงุทุฆ\nC. ุตุญูŠุญ, ุฎุงุทุฆ\nD. ุฎุงุทุฆ, ุตุญูŠุญ\nุฅุฌุงุจุฉ: A\n\nุณุคุงู„: ุงุนุซุฑ ุนู„ู‰ ุฎุงุตูŠุฉ ุงู„ุญู„ู‚ุฉ 2Z.\nA. 0\nB. 3\nC. 12\nD. 30\nุฅุฌุงุจุฉ: A\n\nุณุคุงู„: ู…ุง ู‡ูˆ ุงู„ุฏุฑุฌุฉ ู„ู„ุงู…ุชุฏุงุฏ ุงู„ู…ูŠุฏุงู†ูŠ ุงู„ู†ุงุชุฌ ู…ู† Q(sqrt(2), sqrt(3), sqrt(18)) ุนู„ู‰ QุŸ\nA. 0\nB. 4\nC. 2\nD. 6\nุฅุฌุงุจุฉ:"

  • output: " B\n\nุณุคุงู„: ู…ุง ู‡ูˆ ุงู„ุฏุฑุฌุฉ ู„ู„"

Sample2(business_ethics)

  • input: "ููŠู…ุง ูŠู„ูŠ ุฃุณุฆู„ุฉ ุงู„ุงุฎุชูŠุงุฑ ู…ู† ู…ุชุนุฏุฏ (ู…ุน ุงู„ุฅุฌุงุจุงุช) ุญูˆู„ ุฃุฎู„ุงู‚ูŠุงุช ุงู„ุฃุนู…ุงู„\n\nุณุคุงู„: ู…ุง ู‡ูŠ ุงู„ุญุฌุฌ ุงู„ุฃุฎู„ุงู‚ูŠุฉ ุงู„ู…ุชุนู„ู‚ุฉ ุจุงู„ู…ุณุคูˆู„ูŠุฉ ุงู„ุงุฌุชู…ุงุนูŠุฉ ู„ู„ุดุฑูƒุงุชุŸ\nA. ุงู„ุชูƒุงู„ูŠู ุงู„ุฎุงุฑุฌูŠุฉุŒ ุงู„ู‚ูˆุฉุŒ ุงู„ุงุณุชู‚ู„ุงู„ูŠุฉ\nB. ุงู„ุฅุนู„ุงู…ุŒ ุงู„ู…ูˆุงุฑุฏ ุงู„ุถุนูŠูุฉุŒ ุงู„ุชุจุงุฏู„ ุงู„ุชุนุงูˆู†ูŠ\nC. ุงู„ุฅุนู„ุงู…ุŒ ุงู„ู‚ูˆุฉุŒ ุงู„ุงุณุชู‚ู„ุงู„ูŠุฉ\nD. ุงู„ุชูƒุงู„ูŠู ุงู„ุฎุงุฑุฌูŠุฉุŒ ุงู„ู‚ูˆุฉุŒ ุงู„ุชุจุงุฏู„ ุงู„ุชุนุงูˆู†ูŠ\nุฅุฌุงุจุฉ: D\n\nุณุคุงู„: _______ ู‡ูˆ ุงู„ู…ุญุงูˆู„ุฉ ุงู„ู…ุจุงุดุฑุฉ ู„ุฅุฏุงุฑุฉ ุงู„ู‚ุถุงูŠุง ุงู„ุฃุฎู„ุงู‚ูŠุฉ ุฃูˆ ุงู„ู…ุดุงูƒู„ุŒ ุณูˆุงุก ุจุดูƒู„ ุฑุณู…ูŠ ุฃูˆ ุบูŠุฑ ุฑุณู…ูŠุŒ ู…ู† ุฎู„ุงู„ ุณูŠุงุณุงุช ูˆู…ู…ุงุฑุณุงุช ูˆุจุฑุงู…ุฌ ู…ุญุฏุฏุฉ.\nA. ุงู„ู…ุณุคูˆู„ูŠุฉ ุงู„ุงุฌุชู…ุงุนูŠุฉ ู„ู„ุดุฑูƒุงุช\nB. ุฅุฏุงุฑุฉ ุงู„ุฃุฎู„ุงู‚ูŠุงุช ุงู„ุนู…ู„ูŠุฉ\nC. ุงู„ุงุณุชุฏุงู…ุฉ\nD. ุฅุฏุงุฑุฉ ุงู„ุจูŠุฆุฉ\nุฅุฌุงุจุฉ: B\n\nุณุคุงู„: ู„ุถู…ุงู† ุงุณุชู‚ู„ุงู„ ุฃุนุถุงุก ู…ุฌู„ุณ ุงู„ุฅุฏุงุฑุฉ ุบูŠุฑ ุงู„ุชู†ููŠุฐูŠุฉ ุŒ ู‡ู†ุงูƒ ุนุฏุฏ ู…ู† ุงู„ุฎุทูˆุงุช ุงู„ุชูŠ ูŠู…ูƒู† ุงุชุฎุงุฐู‡ุง ุŒ ูˆุงู„ุชูŠ ุชุดู…ู„ ุงุฎุชูŠุงุฑ ุงู„ุบูŠุฑ ุงู„ุชู†ููŠุฐูŠูŠู† ู…ู† _______ ุงู„ุดุฑูƒุฉ ุŒ ูˆุชุนูŠูŠู†ู‡ู… ู„ู…ุฏุฉ _________ ุŒ ูˆูƒุฐู„ูƒ ุชุนูŠูŠู†ู‡ู… _________.\nA. ุฎุงุฑุฌ ุงู„ุดุฑูƒุฉ ุŒ ู…ุญุฏูˆุฏุฉ ุŒ ุจุดูƒู„ ู…ุณุชู‚ู„\nB. ู…ู† ุงู„ุฏุงุฎู„ ุŒ ู…ุญุฏูˆุฏุฉ ุŒ ุจุดูƒู„ ู…ุชู‚ุทุน\nC. ุฎุงุฑุฌ ุงู„ุดุฑูƒุฉ ุŒ ุบูŠุฑ ู…ุญุฏูˆุฏุฉ ุŒ ุจุดูƒู„ ู…ุชู‚ุทุน\nD. ู…ู† ุงู„ุฏุงุฎู„ ุŒ ุบูŠุฑ ู…ุญุฏูˆุฏุฉ ุŒ ุจุดูƒู„ ู…ุณุชู‚ู„\nุฅุฌุงุจุฉ: A\n\nุณุคุงู„: ู…ุง ู‡ูŠ ุงู„ุฃุณุงู„ูŠุจ ุงู„ุชูŠ ูŠู…ูƒู† ู„ู„ู…ุฏูŠุฑ ุงู„ุฃู…ู†ูŠ ุงู„ุฐูŠ ูŠุณุนู‰ ู„ุชุญู‚ูŠู‚ ุฃู‡ุฏุงูู‡ ุงู„ุงุฎุชูŠุงุฑ ุจูŠู†ู‡ุงุŸ\nA. ุงู„ุนู…ู„ ุงู„ู…ุจุงุดุฑ ุงู„ุบูŠุฑ ุนู†ูŠู ุŒ ุงู„ุนู…ู„ ุงู„ู…ุจุงุดุฑ ุงู„ุนู†ูŠู ุŒ ุงู„ุนู…ู„ ุบูŠุฑ ุงู„ู…ุจุงุดุฑ ุŒ ุงู„ุญู…ู„ุฉ ุงู„ุฏุนุงุฆูŠุฉ\nB. ุงู„ุนู…ู„ ุบูŠุฑ ุงู„ู…ุจุงุดุฑ ุŒ ุงู„ุนู…ู„ ุงู„ุฃูˆุชูŠู„ ุŒ ุงู„ุนู…ู„ ุงู„ู…ุจุงุดุฑ ุงู„ุบูŠุฑ ุนู†ูŠู ุŒ ุงู„ุญู…ู„ุฉ ุงู„ุฅุนู„ุงู…ูŠุฉ\nC. ุงู„ุนู…ู„ ุบูŠุฑ ุงู„ู…ุจุงุดุฑ ุŒ ุงู„ุนู…ู„ ุงู„ู…ุจุงุดุฑ ุงู„ุนู†ูŠู ุŒ ุงู„ุนู…ู„ ุงู„ู…ุจุงุดุฑ ุบูŠุฑ ุงู„ุนู†ูŠู ุงู„ู…ุจุงุดุฑ ุŒ ุงู„ุญู…ู„ุฉ ุงู„ุฏุนุงุฆูŠุฉ\nD. ุงู„ุนู…ู„ ุงู„ู…ุจุงุดุฑ ุงู„ุบูŠุฑ ุนู†ูŠู ุŒ ุงู„ุนู…ู„ ุงู„ุฃูˆุชูŠู„ ุŒ ุงู„ุนู…ู„ ุบูŠุฑ ุงู„ู…ุจุงุดุฑ ุŒ ุงู„ุญู…ู„ุฉ ุงู„ุฅุนู„ุงู…ูŠุฉ\nุฅุฌุงุจุฉ: C\n\nุณุคุงู„: ุนู„ู‰ ุนูƒุณ _______ ุŒ ุชู‡ุฏู _______ ุฅู„ู‰ ู…ูƒุงูุฃุฉ ุงู„ุณู„ูˆูƒ ุงู„ุฅูŠุฌุงุจูŠ ู„ู„ุดุฑูƒุงุช. ุชู… ุชุนุฒูŠุฒ ู†ุฌุงุญ ู…ุซู„ ู‡ุฐู‡ ุงู„ุญู…ู„ุงุช ู…ู† ุฎู„ุงู„ ุงุณุชุฎุฏุงู… ___________, ุงู„ุฐูŠ ูŠุชูŠุญ ู„ู„ุญู…ู„ุงุช ุชูŠุณูŠุฑ ุชุญู‚ูŠู‚ ุงู„ุดุฑูƒุฉ ู„ู€ู€ _________ .\nA. ุงู„ุญู…ู„ุงุช ุงู„ุงุณุชู‡ู„ุงูƒูŠุฉุŒ ุงู„ุญู…ู„ุงุช ุงู„ุงุณุชู‡ู„ุงูƒูŠุฉ ุงู„ุนุงู…ุฉุŒ ุชูƒู†ูˆู„ูˆุฌูŠุง ุณู„ุณู„ุฉ ุงู„ูƒุชู„ุŒ ุงู„ุชุจุฑุนุงุช ุงู„ุฎูŠุฑูŠุฉ\nB. ุงู„ุญู…ู„ุงุช ุงู„ุชุญููŠุฒูŠุฉุŒ ุงู„ุญู…ู„ุงุช ุงู„ุงุณุชู‡ู„ุงูƒูŠุฉ ุงู„ุนุงู…ุฉุŒ ุงู„ุชูƒู†ูˆู„ูˆุฌูŠุง ุงู„ุฑู‚ู…ูŠุฉุŒ ุฒูŠุงุฏุฉ ุงู„ู…ุจูŠุนุงุช\nC. ุงู„ุญู…ู„ุงุช ุงู„ุงุณุชู‡ู„ุงูƒูŠุฉุŒ ุงู„ุญู…ู„ุงุช ุงู„ุดุฑุงุฆูŠุฉุŒ ุชูƒู†ูˆู„ูˆุฌูŠุง ุณู„ุณู„ุฉ ุงู„ูƒุชู„ุŒ ุงู„ุชุจุฑุนุงุช ุงู„ุฎูŠุฑูŠุฉ\nD. ุงู„ู…ู‚ุงุทุนุงุชุŒ ุงู„ุญู…ู„ุงุช ุงู„ุชุญููŠุฒูŠุฉุŒ ุงู„ุญู…ู„ุงุช ุงู„ุฑู‚ู…ูŠุฉุŒ ุฒูŠุงุฏุฉ ุงู„ู…ุจูŠุนุงุช\nุฅุฌุงุจุฉ: D\n\nุณุคุงู„: ุชูุตุจุญ _______ ู…ุซู„ ุงู„ุจูŠุชูƒูˆูŠู† ุฃูƒุซุฑ ุงู†ุชุดุงุฑู‹ุง ูˆุชุญู…ู„ ู…ุฌู…ูˆุนุฉ ูƒุจูŠุฑุฉ ู…ู† ุงู„ุขุซุงุฑ ุงู„ุฃุฎู„ุงู‚ูŠุฉ ุงู„ู…ุฑุชุจุทุฉ ุจู‡ุงุŒ ุนู„ู‰ ุณุจูŠู„ ุงู„ู…ุซุงู„ุŒ ุฅู†ู‡ุง _______ ูˆุฃูƒุซุฑ _______. ูˆู…ุน ุฐู„ูƒุŒ ุชู… ุงุณุชุฎุฏุงู…ู‡ุง ุฃูŠุถู‹ุง ู„ู„ู…ุดุงุฑูƒุฉ ููŠ _______.\nA. ุงู„ุนู…ู„ุงุช ุงู„ุฑู‚ู…ูŠุฉุŒ ู…ูƒู„ูุฉุŒ ุขู…ู†ุฉุŒ ุฌุฑุงุฆู… ู…ุงู„ูŠุฉ\nB. ุงู„ุนู…ู„ุงุช ุงู„ุชู‚ู„ูŠุฏูŠุฉุŒ ุฑุฎูŠุตุฉุŒ ุบูŠุฑ ุขู…ู†ุฉุŒ ุงู„ุนุทุงุก ุงู„ุฎูŠุฑูŠ\nC. ุงู„ุนู…ู„ุงุช ุงู„ุฑู‚ู…ูŠุฉุŒ ุฑุฎูŠุตุฉุŒ ุขู…ู†ุฉุŒ ุฌุฑุงุฆู… ู…ุงู„ูŠุฉ\nD. ุงู„ุนู…ู„ุงุช ุงู„ุชู‚ู„ูŠุฏูŠุฉุŒ ู…ูƒู„ูุฉุŒ ุบูŠุฑ ุขู…ู†ุฉุŒ ุงู„ุนุทุงุก ุงู„ุฎูŠุฑูŠ\nุฅุฌุงุจุฉ:"

  • output: " A\n\nุณุคุงู„: ู…ุง ู‡ูŠ ุงู„ุญุฌุฌ"

Reference

@inproceedings{liang2024alignment,
  title={Alignment at Pre-training! Towards Native Alignment for Arabic {LLM}s},
  author={Juhao Liang and Zhenyang Cai and Jianqing Zhu and Huang Huang and Kewei Zong and Bang An and Mosen Alharthi and Juncai He and Lian Zhang and Haizhou Li and Benyou Wang and Jinchao Xu},
  booktitle={The Thirty-eighth Annual Conference on Neural Information Processing Systems},
  year={2024},
  url={https://openreview.net/forum?id=woRFmNJiLp}
}
@article{zhu2024second,
  title={Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion},
  author={Zhu, Jianqing and Huang, Huang and Lin, Zhihang and Liang, Juhao and Tang, Zhengyang and Almubarak, Khalid and Alharthi, Mosen and An, Bang and He, Juncai and Wu, Xiangbo and Yu, Fei and Chen, Junying and Ma, Zhuoheng and Du, Yuhao and Hu, Yan and Zhang, He and Alghamdi, Emad A. and Zhang, Lian and Sun, Ruoyu and Li, Haizhou and Wang, Benyou and Xu, Jinchao},
  journal={},
  year={2024}
}
Downloads last month
121
Safetensors
Model size
8B params
Tensor type
F16
ยท
Inference Providers NEW

Model tree for FreedomIntelligence/AceGPT-v2-8B

Merges
1 model
Quantizations
2 models

Collection including FreedomIntelligence/AceGPT-v2-8B