OpenNLPLab
/

TransNormerLLM-385M

Text Generation

Model card Files Files and versions

OpenNLPLab commited on Oct 15, 2023

Commit

67b67aa

·

1 Parent(s): fc689fe

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -26,7 +26,7 @@ tags:
 - [Released Weights](#released-weights)
 - [Benchmark Results](#benchmark-results)
   - [General Domain](#general-domain)
-    - [7B Model Results](#7b-model-results)
 - [Inference and Deployment](#inference-and-deployment)
   - [Dependency Installation](#dependency-installation)
   - [Notice](#notice)
@@ -86,7 +86,7 @@ In the general domain, we conducted 5-shot tests on the following datasets:
 - [CMMLU](https://github.com/haonan-li/CMMLU) is a comprehensive Chinese evaluation benchmark covering 67 topics, specifically designed to assess language models' knowledge and reasoning capabilities in a Chinese context. We adopted its [official](https://github.com/haonan-li/CMMLU) evaluation approach.
-### 7B Model Results
 **Performance Comparison on Commonsense Reasoning and Aggregated Benchmarks.** For a fair comparison, we report competing methods' results reproduced by us using their released models. PS: parameter size (billion). T: tokens (trillion). HS: HellaSwag. WG: WinoGrande.
 | Model       | PS   | T    | BoolQ          | PIQA           | HS             | WG             | ARC-e          | ARC-c          | OBQA           | MMLU           | CMMLU          | C-Eval         |

 - [Released Weights](#released-weights)
 - [Benchmark Results](#benchmark-results)
   - [General Domain](#general-domain)
+    - [Model Results](#model-results)
 - [Inference and Deployment](#inference-and-deployment)
   - [Dependency Installation](#dependency-installation)
   - [Notice](#notice)
 - [CMMLU](https://github.com/haonan-li/CMMLU) is a comprehensive Chinese evaluation benchmark covering 67 topics, specifically designed to assess language models' knowledge and reasoning capabilities in a Chinese context. We adopted its [official](https://github.com/haonan-li/CMMLU) evaluation approach.
+### Model Results
 **Performance Comparison on Commonsense Reasoning and Aggregated Benchmarks.** For a fair comparison, we report competing methods' results reproduced by us using their released models. PS: parameter size (billion). T: tokens (trillion). HS: HellaSwag. WG: WinoGrande.
 | Model       | PS   | T    | BoolQ          | PIQA           | HS             | WG             | ARC-e          | ARC-c          | OBQA           | MMLU           | CMMLU          | C-Eval         |