Update README.md
Browse files
README.md
CHANGED
@@ -16,7 +16,7 @@ This checkpoint (CodeGen-Multi 16B) was firstly initialized with *CodeGen-NL 16B
|
|
16 |
## Training procedure
|
17 |
|
18 |
CodeGen was trained using cross-entropy loss to maximize the likelihood of sequential inputs.
|
19 |
-
The family of models are trained using
|
20 |
See Section 2.3 of the [paper](https://arxiv.org/abs/2203.13474) for more details.
|
21 |
|
22 |
## Evaluation results
|
|
|
16 |
## Training procedure
|
17 |
|
18 |
CodeGen was trained using cross-entropy loss to maximize the likelihood of sequential inputs.
|
19 |
+
The family of models are trained using multiple TPU-v4-512 by Google, leveraging data and model parallelism.
|
20 |
See Section 2.3 of the [paper](https://arxiv.org/abs/2203.13474) for more details.
|
21 |
|
22 |
## Evaluation results
|