This model has 1 file scanned as unsafe.
- attn_layer_mapper=last, attn_loss_fn=mse, attn_weight=1.0, lr_scheduler_type=cosine, warmup_ratio=0.5
- attn_layer_mapper=layer-2, attn_loss_fn=cos, attn_weight=1.0, lr_scheduler_type=cosine, warmup_ratio=0.5
- attn_layer_mapper=layer-2, attn_loss_fn=mse, attn_weight=1.0, lr_scheduler_type=cosine, warmup_ratio=0.5
- dataset_sample_size=1000000
- lr_scheduler_type=cosine, warmup_ratio=0.5
- lr_scheduler_type=linear, warmup_ratio=0.5
- 0 Bytes
- 29.7 MB xet
- 588 Bytes xet