Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

distily
/
distily_grad_log_perf

TensorBoard
Safetensors
Distily
gpt2
Generated from Trainer
Model card Files Files and versions
xet
Metrics Training metrics Community
distily_grad_log_perf / logs
27.8 kB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 4 commits
lapp0's picture
lapp0
End of training
8a061ec verified over 1 year ago
  • attn_norm=layernorm, attn_projector=orthogonal, attn_weight=5, extra_grad_stats=False, learning_rate=0.0001, per_device_train_batch_size=8, warmup_ratio=0
    End of training over 1 year ago
  • attn_norm=layernorm, attn_projector=orthogonal, attn_weight=5, extra_grad_stats=True, learning_rate=0.0001, per_device_train_batch_size=8, warmup_ratio=0
    Training in progress, step 495 over 1 year ago