metadata
language: en
tags:
- fill-mask
Environmental Impact (CODE CARBON DEFAULT)
| Metric | Value |
|---|---|
| Duration (in seconds) | [More Information Needed] |
| Emissions (Co2eq in kg) | [More Information Needed] |
| CPU power (W) | [NO CPU] |
| GPU power (W) | [No GPU] |
| RAM power (W) | [More Information Needed] |
| CPU energy (kWh) | [No CPU] |
| GPU energy (kWh) | [No GPU] |
| RAM energy (kWh) | [More Information Needed] |
| Consumed energy (kWh) | [More Information Needed] |
| Country name | [More Information Needed] |
| Cloud provider | [No Cloud] |
| Cloud region | [No Cloud] |
| CPU count | [No CPU] |
| CPU model | [No CPU] |
| GPU count | [No GPU] |
| GPU model | [No GPU] |
Environmental Impact (for one core)
| Metric | Value |
|---|---|
| CPU energy (kWh) | [No CPU] |
| Emissions (Co2eq in kg) | [More Information Needed] |
Note
5 juillet 2024 !
My Config
| Config | Value |
|---|---|
| checkpoint | albert-base-v2 |
| model_name | fp_bs8_lr1e4_x4 |
| sequence_length | 400 |
| num_epoch | 6 |
| learning_rate | 0.0001 |
| batch_size | 8 |
| weight_decay | 0.0 |
| warm_up_prop | 0.0 |
| drop_out_prob | 0.1 |
| packing_length | 100 |
| train_test_split | 0.2 |
| num_steps | 81709 |
Training and Testing steps
| Epoch | Train Loss | Test Loss |
|---|---|---|
| 0.0 | 17.859739 | 13.066814 |
| 0.5 | 3.997183 | 3.593726 |
| 1.0 | 3.391947 | 3.265344 |
| 1.5 | 6.459209 | 6.977757 |
| 2.0 | 6.985903 | 6.983891 |
| 2.5 | 6.982025 | 7.001526 |
| 3.0 | 6.995377 | 6.998771 |
| 3.5 | 6.984867 | 6.995654 |
| 4.0 | 6.984822 | 6.994850 |
| 4.5 | 6.975999 | 6.984660 |
| 5.0 | 6.978274 | 6.982478 |
| 5.5 | 6.975072 | 6.977838 |