Update README.md
Browse files
README.md
CHANGED
|
@@ -40,28 +40,28 @@ La Configuraci贸n 3, con un n煤mero de 茅pocas superior, logr贸 la convergencia
|
|
| 40 |
|
| 41 |
## Argumentos de entrenamiento
|
| 42 |
|
| 43 |
-
|
| 44 |
-
|
| 45 |
-
|
| 46 |
-
|
| 47 |
-
|
| 48 |
-
|
| 49 |
-
|
| 50 |
-
|
| 51 |
-
|
| 52 |
-
|
| 53 |
-
|
| 54 |
-
|
| 55 |
-
|
| 56 |
-
|
| 57 |
-
|
| 58 |
-
|
| 59 |
-
|
| 60 |
-
|
| 61 |
-
|
| 62 |
-
|
| 63 |
-
|
| 64 |
-
|
| 65 |
|
| 66 |
## Dataset
|
| 67 |
|
|
@@ -83,9 +83,11 @@ Splits disponibles:
|
|
| 83 |
|
| 84 |
Puedes cargar el dataset directamente desde Hugging Face:
|
| 85 |
|
| 86 |
-
|
|
|
|
| 87 |
ds_train = load_dataset("AngelaGijon/TFG-DockerCommands-Dataset", split="train")
|
| 88 |
-
print(ds_train[0])
|
|
|
|
| 89 |
|
| 90 |
Disponible en el repositorio: **"AngelaGijon/TFG-DockerCommands-Dataset"**
|
| 91 |
|
|
|
|
| 40 |
|
| 41 |
## Argumentos de entrenamiento
|
| 42 |
|
| 43 |
+
| Argumento | Valor |
|
| 44 |
+
| ----------------------------- | ---------------------: |
|
| 45 |
+
| `output_dir` | `output_dir` |
|
| 46 |
+
| `lr_scheduler_type` | `"linear"` |
|
| 47 |
+
| `num_train_epochs` | `15` |
|
| 48 |
+
| `per_device_train_batch_size` | `8` |
|
| 49 |
+
| `per_device_eval_batch_size` | `8` |
|
| 50 |
+
| `gradient_accumulation_steps` | `2` |
|
| 51 |
+
| `learning_rate` | `5e-5` |
|
| 52 |
+
| `weight_decay` | `0.01` |
|
| 53 |
+
| `warmup_steps` | `50` |
|
| 54 |
+
| `predict_with_generate` | `True` |
|
| 55 |
+
| `logging_dir` | `f"{output_dir}/logs"` |
|
| 56 |
+
| `logging_steps` | `10` |
|
| 57 |
+
| `eval_steps` | `10` |
|
| 58 |
+
| `save_total_limit` | `2` |
|
| 59 |
+
| `dataloader_num_workers` | `2` |
|
| 60 |
+
| `evaluation_strategy` | `"steps"` |
|
| 61 |
+
| `save_strategy` | `"steps"` |
|
| 62 |
+
| `load_best_model_at_end` | `True` |
|
| 63 |
+
| `fp16` | `True` |
|
| 64 |
+
| `seed` | `42` |
|
| 65 |
|
| 66 |
## Dataset
|
| 67 |
|
|
|
|
| 83 |
|
| 84 |
Puedes cargar el dataset directamente desde Hugging Face:
|
| 85 |
|
| 86 |
+
```
|
| 87 |
+
from datasets import load_dataset
|
| 88 |
ds_train = load_dataset("AngelaGijon/TFG-DockerCommands-Dataset", split="train")
|
| 89 |
+
print(ds_train[0])
|
| 90 |
+
```
|
| 91 |
|
| 92 |
Disponible en el repositorio: **"AngelaGijon/TFG-DockerCommands-Dataset"**
|
| 93 |
|