Update README.md
Browse files
README.md
CHANGED
|
@@ -16,7 +16,7 @@ This checkpoint (CodeGen-Multi 16B) was firstly initialized with *CodeGen-NL 16B
|
|
| 16 |
## Training procedure
|
| 17 |
|
| 18 |
CodeGen was trained using cross-entropy loss to maximize the likelihood of sequential inputs.
|
| 19 |
-
The family of models are trained using
|
| 20 |
See Section 2.3 of the [paper](https://arxiv.org/abs/2203.13474) for more details.
|
| 21 |
|
| 22 |
## Evaluation results
|
|
|
|
| 16 |
## Training procedure
|
| 17 |
|
| 18 |
CodeGen was trained using cross-entropy loss to maximize the likelihood of sequential inputs.
|
| 19 |
+
The family of models are trained using multiple TPU-v4-512 by Google, leveraging data and model parallelism.
|
| 20 |
See Section 2.3 of the [paper](https://arxiv.org/abs/2203.13474) for more details.
|
| 21 |
|
| 22 |
## Evaluation results
|